Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
The policy iteration algorithm for average reward Markov decision processes with general state space - MaRDI portal

The policy iteration algorithm for average reward Markov decision processes with general state space

From MaRDI portal
Publication:4395828

DOI10.1109/9.650016zbMath0906.93063OpenAlexW2115558605WikidataQ114991401 ScholiaQ114991401MaRDI QIDQ4395828

Sean P. Meyn

Publication date: 12 August 1998

Published in: IEEE Transactions on Automatic Control (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1109/9.650016




Related Items

A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applicationsPotential-based least-squares policy iteration for a parameterized feedback control systemUnnamed ItemThe policy iteration algorithm for a compound Poisson process applied to optimal dividend strategies under a Cramér-Lundberg risk modelAn optimal control approach to day-to-day congestion pricing for stochastic transportation networksThe policy iteration algorithm for average continuous control of piecewise deterministic Markov processesA note on the existence of optimal stationary policies for average Markov decision processes with countable statesOptimal Inventory Control with Jump Diffusion and Nonlinear Dynamics in the DemandStochastic control via direct comparisonOn Iteration Improvement for Averaged Expected Cost Control for One-Dimensional Ergodic DiffusionsWeak convergence and fluid limits in optimal time-to-empty queueing control problemsApproximate receding horizon approach for Markov decision processes: average reward caseCompletion-of-squares: revisited and extendedAverage control of Markov decision processes with Feller transition probabilities and general action spacesAverage Cost Optimality Inequality for Markov Decision Processes with Borel Spaces and Universally Measurable PoliciesWeakly coupled event triggered output feedback system in wireless networked control systemsA policy improvement method for constrained average Markov decision processesPlanning for the long run: programming with patient, Pareto responsive preferencesPolicy iteration for continuous-time average reward Markov decision processes in Polish spacesCoding and control for communication networksReliability by design in distributed power transmission networksOn the Minimum Pair Approach for Average Cost Markov Decision Processes with Countable Discrete Action Spaces and Strictly Unbounded CostsOn structural properties of optimal average cost functions in Markov decision processes with Borel spaces and universally measurable policiesDispatching to parallel servers. Solutions of Poisson's equation for first-policy improvementSingle sample path-based optimization of Markov chainsDynamic load balancing in parallel queueing systems: stability and optimal controlDynamic safety-stocks for asymptotic optimality in stochastic networks