Achieving Target State-Action Frequencies in Multichain Average-Reward Markov Decision Processes
From MaRDI portal
Publication:5704096
DOI10.1287/moor.27.3.545.316zbMath1082.90579OpenAlexW2012045703MaRDI QIDQ5704096
Publication date: 11 November 2005
Published in: Mathematics of Operations Research (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1287/moor.27.3.545.316
average reward criterionconstrained Markov decision processesstate-action frequenciesMarkov decision processes with nonstandard reward criteriaMarkov process decision
Markov chains (discrete-time Markov processes on discrete state spaces) (60J10) Markov and semi-Markov decision processes (90C40)
Related Items (2)
Optimal deterministic controller synthesis from steady-state distributions ⋮ Simultaneous determination of production and maintenance schedules using in‐line equipment condition and yield information
This page was built for publication: Achieving Target State-Action Frequencies in Multichain Average-Reward Markov Decision Processes