On the Convergence of Policy Iteration in Stationary Dynamic Programming

From MaRDI portal
Publication:4198357

DOI10.1287/moor.4.1.60zbMath0411.90072OpenAlexW2042680115MaRDI QIDQ4198357

Shelby Brumelle, Martin L. Puterman

Publication date: 1979

Published in: Mathematics of Operations Research (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1287/moor.4.1.60




Related Items (42)

Multilevel techniques for the solution of HJB minimum-time control problemsUndiscounted control policy generation for continuous-valued optimal control by approximate dynamic programmingRates of convergence for the policy iteration method for mean field games systemsNumerical approximation of equations involving minimal/maximal operators by successive solution of obstacle problemsApproximations and Optimal Control for State-Dependent Limited Processor Sharing QueuesApproximating Optimal feedback Controllers of Finite Horizon Control Problems Using Hierarchical Tensor FormatsDiscrete dynamic programming and viscosity solutions of the Bellman equationSurvey of linear programming for standard and nonstandard Markovian control problems. Part I: Theory(Approximate) iterated successive approximations algorithm for sequential decision processesOn linear and super-linear convergence of natural policy gradient algorithmOn the convergence of policy iteration for controlled diffusionsValue-Gradient Based Formulation of Optimal Control Problem and Machine Learning AlgorithmOptimal investment strategies for pension funds with regulation-conform dynamic pension payment management in the absence of guaranteesOptimal polynomial feedback laws for finite horizon control problemsPolicy iteration method for time-dependent mean field games systems with non-separable HamiltoniansUnique Tarski Fixed PointsA note on generalized second-order value iteration in Markov decision processesExponential Convergence and Stability of Howard's Policy Improvement Algorithm for Controlled DiffusionsA semi-Lagrangian scheme for a modified version of the Hughes' model for Pedestrian flowRecent Results in the Approximation of Nonlinear Optimal Control ProblemsUnnamed ItemA semi-Lagrangian algorithm in policy space for hybrid optimal control problemsA discrete Hughes model for pedestrian flow on graphsA mean field games model for finite mixtures of Bernoulli and categorical distributionsOptimal consumption under uncertainty, liquidity constraints, and bounded rationalityDomain decomposition based parallel Howard's algorithmPolicy iteration and Newton-Raphson methods for Markov decision processes under average cost criterionMultigrid methods for two‐player zero‐sum stochastic gamesUnnamed ItemA Fixed Point Approach to Undiscounted Markov Renewal ProgramsA neural network-based policy iteration algorithm with global \(H^2\)-superlinear convergence for stochastic games on domainsA numerical method for pricing European options with proportional transaction costsThe primal-dual active set method for a class of nonlinear problems with \(T\)-monotone operatorsA policy iteration method for mean field gamesNUMERICAL METHODS FOR DIFFERENTIAL GAMES BASED ON PARTIAL DIFFERENTIAL EQUATIONSTensor Decomposition Methods for High-dimensional Hamilton--Jacobi--Bellman EquationsOptimal price-threshold control for battery operation with aging phenomenon: a quasiconvex optimization approachApplications of Markov chain approximation methods to optimal control problems in economicsContinuous vs. discrete time: some computational insightsTwo-scale methods for convex envelopesAn Accelerated Value/Policy Iteration Scheme for Optimal Control Problems and GamesThe variational calculus and approximation in policy space for Markovian decision processes




This page was built for publication: On the Convergence of Policy Iteration in Stationary Dynamic Programming