Bounded Parameter Markov Decision Processes with Average Reward Criterion
From MaRDI portal
Publication:5434055
DOI10.1007/978-3-540-72927-3_20zbMath1203.90175OpenAlexW2155355065MaRDI QIDQ5434055
Ambuj Tewari, Bartlett, Peter L.
Publication date: 3 January 2008
Published in: Learning Theory (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1007/978-3-540-72927-3_20
Related Items (5)
Reachability analysis of uncertain systems using bounded-parameter Markov decision processes ⋮ Adaptive aggregation for reinforcement learning in average reward Markov decision processes ⋮ Policy iteration for bounded-parameter POMDPs ⋮ Robust topological policy iteration for infinite horizon bounded Markov decision processes ⋮ Reinforcement Learning in Robust Markov Decision Processes
Uses Software
This page was built for publication: Bounded Parameter Markov Decision Processes with Average Reward Criterion