Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Neuro-Dynamic Programming: An Overview and Recent Results - MaRDI portal

Neuro-Dynamic Programming: An Overview and Recent Results

From MaRDI portal
Publication:5391735

DOI10.1007/978-3-540-69995-8_11zbMath1209.90343OpenAlexW1593485202MaRDI QIDQ5391735

Dimitri P. Bertsekas

Publication date: 7 April 2011

Published in: Operations Research Proceedings (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1007/978-3-540-69995-8_11




Related Items (40)

Dynamic control in multi-item production/inventory systemsApproximate policy iteration: a survey and some new methodsA review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applicationsParameter-free sampled fictitious play for solving deterministic dynamic programming problemsOn sample size control in sample average approximations for solving smooth stochastic programsStrategy optimization for controlled Markov process with descriptive complexity constraintApproximate linear programming for networks: average cost boundsSolution for a class of closed-loop leader-follower games with convexity conditions on the payoffsDynamic admission and service rate control of a queueA multi-objective approach for PH-graphs with applications to stochastic shortest pathsQ-learning and policy iteration algorithms for stochastic shortest path problemsA stochastic control formalism for dynamic biologically conformal radiation therapyA survey of motion planning algorithms from the perspective of autonomous UAV guidanceStochastic optimization for real time service capacity allocation under random service demandMinimax PAC bounds on the sample complexity of reinforcement learning with a generative modelBisimulations of Probabilistic Boolean NetworksRandom exploration of the procedural space for single-view 3D modeling of buildingsOptimization of a pumped-storage fixed-head hydroplant: the bang-singular-bang solutionMatrix-Analytic Methods for Solving Poisson’s Equation with Applications to Markov Chains of GI/G/1-TypeAn Efficient Gradient Projection Method for Stochastic Optimal Control ProblemsExact Converging Bounds for Stochastic Dual Dynamic Programming via Fenchel DualityOptimal stopping with a probabilistic constraintApproximate dynamic programming and its applications to the design of Phase I cancer trialsAdaptive Simulation Selection for the Discovery of the Ground State Line of Binary Alloys with a Limited Computational BudgetAdaptive-resolution reinforcement learning with polynomial exploration in deterministic domainsStatistical analysis of trajectories on Riemannian manifolds: bird migration, hurricane tracking and video surveillanceData-Efficient Quickest Change Detection with On–Off Observation ControlOptimal Control of Automotive Multivariable Dynamical SystemsQuadratic approximate dynamic programming for input‐affine systemsCoding and control for communication networksA Retrograde Approximation Algorithm for Multi-player Can’t StopAssessment of the Cell Broadband Engine Architecture as a platform to solve closed-loop optimal control problemsDecentralized optimization over tree graphsContinuous-Time Robust Dynamic ProgrammingDispatching to parallel servers. Solutions of Poisson's equation for first-policy improvementAn Open-Loop Approach for a Stochastic Production Planning Problem with Remanufacturing ProcessAdaptive dynamic programming in the Hamiltonian-driven frameworkA Sufficient Statistic for Influence in Structured Multiagent EnvironmentsApproximate dynamic programming via iterated Bellman inequalitiesAverage-case performance of rollout algorithms for knapsack problems




This page was built for publication: Neuro-Dynamic Programming: An Overview and Recent Results