Sample-Path Optimal Stationary Policies in Stable Markov Decision Chains with the Average Reward Criterion
DOI10.1239/jap/1437658607zbMath1327.90366OpenAlexW1605180102MaRDI QIDQ2949846
Raúl Montes-De-oca, Rolando Cavazos-Cadena, Karel Sladký
Publication date: 2 October 2015
Published in: Journal of Applied Probability (Search for Journal in Brave)
Full work available at URL: https://projecteuclid.org/euclid.jap/1437658607
innovationsKolmogorov inequalitydiscrepancy functionstrong sample-path optimalitydominated convergence theorem for the expected average criterion
Discrete-time Markov processes on general state spaces (60J05) Optimal stochastic control (93E20) Markov and semi-Markov decision processes (90C40)
Related Items (1)
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- A counterexample on sample-path optimality in stable Markov decision chains with the average reward criterion
- Markov decision processes with applications to finance.
- Necessary and sufficient conditions for a bounded solution to the optimality equation in average reward Markov decision chains
- Necessary conditions for the optimality equation in average-reward Markov decision processes
- Equivalence of Lyapunov stability criteria in a class of Markov decision processes
- Adaptive Markov control processes
- Almost sure optimality and optimality in probability for stochastic control problems over an infinite time horizon
- Value iteration in average cost Markov control processes on Borel spaces
- Sample path optimality for a Markov optimization problem
- Another set of conditions for Markov decision processes with average sample-path costs
- On Minimum Cost Per Unit Time Control of Markov Chains
- Sample-path average optimality for Markov control processes
- Sample path average optimality of Markov control processes with strictly unbounded cost
- Sample-Path Optimality in Average Markov Decision Chains Under a Double Lyapunov Function Condition
- Discrete-Time Controlled Markov Processes with Average Cost Criterion: A Survey
- Denumerable controlled Markov chains with average reward criterion: Sample path optimality
- Sample-Path Optimality and Variance-Minimization of Average Cost Markov Control Processes
- On the Stochastic Matrices Associated with Certain Queuing Processes
- Markov decision processes
This page was built for publication: Sample-Path Optimal Stationary Policies in Stable Markov Decision Chains with the Average Reward Criterion