Sample-Path Optimality in Average Markov Decision Chains Under a Double Lyapunov Function Condition
From MaRDI portal
Publication:4593600
DOI10.1007/978-0-8176-8337-5_3zbMath1374.90400OpenAlexW159673498MaRDI QIDQ4593600
Raúl Montes-De-oca, Rolando Cavazos-Cadena
Publication date: 22 November 2017
Published in: Optimization, Control, and Applications of Stochastic Systems (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1007/978-0-8176-8337-5_3
Related Items (2)
Sample-Path Optimal Stationary Policies in Stable Markov Decision Chains with the Average Reward Criterion ⋮ A counterexample on sample-path optimality in stable Markov decision chains with the average reward criterion
This page was built for publication: Sample-Path Optimality in Average Markov Decision Chains Under a Double Lyapunov Function Condition