Is Q-learning Provably Efficient?

From MaRDI portal
Publication:6304056

arXiv1807.03765MaRDI QIDQ6304056

Michael I. Jordan, Chi Jin, Zeyuan Allen-Zhu, Sebastien Bubeck

Publication date: 10 July 2018




Has companion code repository: https://github.com/microsoft/intrepid









This page was built for publication: Is Q-learning Provably Efficient?