Neuro-Dynamic Programming: An Overview and Recent Results

DOI10.1007/978-3-540-69995-8_11zbMath1209.90343OpenAlexW1593485202MaRDI QIDQ5391735

Publication date: 7 April 2011

Published in: Operations Research Proceedings (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1007/978-3-540-69995-8_11

Mathematics Subject Classification ID

Dynamic programming (90C39) Reasoning under uncertainty in the context of artificial intelligence (68T37)

Related Items (40)

Dynamic control in multi-item production/inventory systems ⋮ Approximate policy iteration: a survey and some new methods ⋮ A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications ⋮ Parameter-free sampled fictitious play for solving deterministic dynamic programming problems ⋮ On sample size control in sample average approximations for solving smooth stochastic programs ⋮ Strategy optimization for controlled Markov process with descriptive complexity constraint ⋮ Approximate linear programming for networks: average cost bounds ⋮ Solution for a class of closed-loop leader-follower games with convexity conditions on the payoffs ⋮ Dynamic admission and service rate control of a queue ⋮ A multi-objective approach for PH-graphs with applications to stochastic shortest paths ⋮ Q-learning and policy iteration algorithms for stochastic shortest path problems ⋮ A stochastic control formalism for dynamic biologically conformal radiation therapy ⋮ A survey of motion planning algorithms from the perspective of autonomous UAV guidance ⋮ Stochastic optimization for real time service capacity allocation under random service demand ⋮ Minimax PAC bounds on the sample complexity of reinforcement learning with a generative model ⋮ Bisimulations of Probabilistic Boolean Networks ⋮ Random exploration of the procedural space for single-view 3D modeling of buildings ⋮ Optimization of a pumped-storage fixed-head hydroplant: the bang-singular-bang solution ⋮ Matrix-Analytic Methods for Solving Poisson’s Equation with Applications to Markov Chains of GI/G/1-Type ⋮ An Efficient Gradient Projection Method for Stochastic Optimal Control Problems ⋮ Exact Converging Bounds for Stochastic Dual Dynamic Programming via Fenchel Duality ⋮ Optimal stopping with a probabilistic constraint ⋮ Approximate dynamic programming and its applications to the design of Phase I cancer trials ⋮ Adaptive Simulation Selection for the Discovery of the Ground State Line of Binary Alloys with a Limited Computational Budget ⋮ Adaptive-resolution reinforcement learning with polynomial exploration in deterministic domains ⋮ Statistical analysis of trajectories on Riemannian manifolds: bird migration, hurricane tracking and video surveillance ⋮ Data-Efficient Quickest Change Detection with On–Off Observation Control ⋮ Optimal Control of Automotive Multivariable Dynamical Systems ⋮ Quadratic approximate dynamic programming for input‐affine systems ⋮ Coding and control for communication networks ⋮ A Retrograde Approximation Algorithm for Multi-player Can’t Stop ⋮ Assessment of the Cell Broadband Engine Architecture as a platform to solve closed-loop optimal control problems ⋮ Decentralized optimization over tree graphs ⋮ Continuous-Time Robust Dynamic Programming ⋮ Dispatching to parallel servers. Solutions of Poisson's equation for first-policy improvement ⋮ An Open-Loop Approach for a Stochastic Production Planning Problem with Remanufacturing Process ⋮ Adaptive dynamic programming in the Hamiltonian-driven framework ⋮ A Sufficient Statistic for Influence in Structured Multiagent Environments ⋮ Approximate dynamic programming via iterated Bellman inequalities ⋮ Average-case performance of rollout algorithms for knapsack problems

This page was built for publication: Neuro-Dynamic Programming: An Overview and Recent Results