Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
A note on the convergence rate of the value iteration scheme in controlled Markov chains - MaRDI portal

A note on the convergence rate of the value iteration scheme in controlled Markov chains

From MaRDI portal

Publication:1128695

Jump to:navigation, search

DOI10.1016/S0167-6911(97)00097-2zbMath0902.93070OpenAlexW1982510099MaRDI QIDQ1128695

Rolando Cavazos-Cadena

Publication date: 13 August 1998

Published in: Systems \& Control Letters (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1016/s0167-6911(97)00097-2

zbMATH Keywords

Markov decision processes geometric convergence rate long-run average cost criterion simultaneous Doeblin condition

Mathematics Subject Classification ID

Optimal stochastic control (93E20)

Related Items (3)

Open Problem—Convergence and Asymptotic Optimality of the Relative Value Iteration in Ergodic Control ⋮ Asymptotic behavior of the value functions of discrete-time discounted optimal control ⋮ Successive approximations in partially observable controlled Markov chains with risk-sensitive average criterion

Cites Work

This page was built for publication: A note on the convergence rate of the value iteration scheme in controlled Markov chains

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:1128695&oldid=13178761"