Real-Time Reinforcement Learning of Constrained Markov Decision Processes with Weak Derivatives
From MaRDI portal
Publication:4925757
zbMath1294.90068arXiv1110.4946MaRDI QIDQ4925757
Felisa J. Vázquez-Abad, Vikram Krishnamurthy
Publication date: 12 June 2013
Full work available at URL: https://arxiv.org/abs/1110.4946
This page was built for publication: Real-Time Reinforcement Learning of Constrained Markov Decision Processes with Weak Derivatives