Adaptive control of average Markov decision chains under the Lyapunov stability condition
From MaRDI portal
Publication:1397000
DOI10.1007/S001860100138zbMath1031.90059OpenAlexW2058725764MaRDI QIDQ1397000
Publication date: 16 July 2003
Published in: Mathematical Methods of Operations Research (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1007/s001860100138
Consistent estimationAdaptive optimal policyDiscrepancy functionNon stationary value iterationPause ControlSchweitzer's Transformation
This page was built for publication: Adaptive control of average Markov decision chains under the Lyapunov stability condition