Splitting Randomized Stationary Policies in Total-Reward Markov Decision Processes
DOI10.1287/moor.1110.0525zbMath1243.90233OpenAlexW1991591460MaRDI QIDQ2884309
Uriel G. Rothblum, Eugene A. Feinberg
Publication date: 24 May 2012
Published in: Mathematics of Operations Research (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1287/moor.1110.0525
Markov decision processesconstrained Markov decision processesoccupancy measuressplitting occupancy measures
Computational methods in Markov chains (60J22) Applications of Markov chains and discrete-time Markov processes on general state spaces (social mobility, learning theory, industrial processes, etc.) (60J20) Markov and semi-Markov decision processes (90C40)
Related Items (19)
This page was built for publication: Splitting Randomized Stationary Policies in Total-Reward Markov Decision Processes