Long-Term Values in Markov Decision Processes and Repeated Games, and a New Distance for Probability Spaces
DOI10.1287/MOOR.2016.0814zbMath1364.90350OpenAlexW2557272457MaRDI QIDQ5739144
Publication date: 2 June 2017
Published in: Mathematics of Operations Research (Search for Journal in Brave)
Full work available at URL: http://hdl.handle.net/11385/197395
Wasserstein metricrepeated gamesuniform valuegambling housescharacterization of the valuepartial observation Markov decision processes
Applications of Markov chains and discrete-time Markov processes on general state spaces (social mobility, learning theory, industrial processes, etc.) (60J20) Stochastic games, stochastic differential games (91A15) Markov and semi-Markov decision processes (90C40)
Related Items (11)
This page was built for publication: Long-Term Values in Markov Decision Processes and Repeated Games, and a New Distance for Probability Spaces