Second order optimality in Markov decision chains
DOI10.14736/KYB-2017-6-1086zbMATH Open1449.90354OpenAlexW2783551506MaRDI QIDQ4637454
Publication date: 18 April 2018
Published in: Kybernetika (Search for Journal in Brave)
Full work available at URL: http://hdl.handle.net/10338.dmlcz/147086
policy iterationsMarkov decision chainssecond-order optimalityvalue iterationsdiscounted and average modelsoptimality conditions for transient
Optimality conditions and duality in mathematical programming (90C46) Optimal stochastic control (93E20) Markov and semi-Markov decision processes (90C40)
Related Items (1)
This page was built for publication: Second order optimality in Markov decision chains
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4637454)