Detecting optimal and non-optimal actions in average-cost Markov decision processes
From MaRDI portal
Publication:4322051
DOI10.2307/3215322zbMATH Open0815.60061OpenAlexW2330034154MaRDI QIDQ4322051
Publication date: 20 March 1995
Published in: Journal of Applied Probability (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.2307/3215322
Markov chains (discrete-time Markov processes on discrete state spaces) (60J10) Optimal stochastic control (93E20) Markov and semi-Markov decision processes (90C40)
Recommendations
- Title not available (Why is that?) π π
- Title not available (Why is that?) π π
- Optimal policies for constrained average-cost Markov decision processes π π
- Average cost Markov decision processes: Optimality conditions π π
- Verifiable conditions for average optimality of continuous-time Markov decision processes π π
- Detection-averse optimal and receding-horizon control for Markov decision processes π π
- On the average cost optimality equation and the structure of optimal policies for partially observable Markov decision processes π π
- Optimality Conditions for Partially Observable Markov Decision Processes π π
- Markov decision processes in minimization of expected costs π π
This page was built for publication: Detecting optimal and non-optimal actions in average-cost Markov decision processes
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4322051)