The Determination of Approximately Optimal Policies in Markov Decision Processes by the Use of Bounds
From MaRDI portal
Publication:3934167
DOI10.2307/2581490zbMath0477.90082OpenAlexW4253745705MaRDI QIDQ3934167
Publication date: 1982
Published in: The Journal of the Operational Research Society (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.2307/2581490
Markov decision processapproximately optimal policiesHoward's policy space methodapproximation of optimal performance levelderivation of upper and lower bounds
Numerical mathematical programming methods (65K05) Markov and semi-Markov decision processes (90C40)
Related Items (2)
Approximate receding horizon approach for Markov decision processes: average reward case ⋮ Solving infinite horizon discounted Markov decision process problems for a range of discount factors
This page was built for publication: The Determination of Approximately Optimal Policies in Markov Decision Processes by the Use of Bounds