On iterative optimization ol structured Markov decision processes with discounted rewards
From MaRDI portal
Publication:3221982
DOI10.1080/02331938408842960zbMath0556.90089OpenAlexW2140922432MaRDI QIDQ3221982
Marcel Hendrikx, J. A. E. E. Van Nunen, Jaap Wessels
Publication date: 1984
Published in: Mathematische Operationsforschung und Statistik. Series Optimization (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1080/02331938408842960
iterative methodstest problemssuccessive approximationoptimal policytotal reward criterionsurvey on solution techniques
Numerical mathematical programming methods (65K05) Markov and semi-Markov decision processes (90C40)
Related Items
Serial and parallel value iteration algorithms for discounted Markov decision processes, The numerical exploitation of periodicity in Markov decision processes, On using discrete random models within decision support systems, Optimal claim behaviour for third-party liability insurances or To claim or not to claim: that is the question, Aggregation and disaggregation in Markov decision models for inventory control