Regret Bounds for Restless Markov Bandits
From MaRDI portal
Publication:3164822
DOI10.1007/978-3-642-34106-9_19zbMath1386.91056OpenAlexW2149943599MaRDI QIDQ3164822
Ronald Ortner, Daniil Ryabko, Rémi Munos, Peter Auer
Publication date: 16 October 2012
Published in: Lecture Notes in Computer Science (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1007/978-3-642-34106-9_19
Decision theory (91B06) Applications of Markov chains and discrete-time Markov processes on general state spaces (social mobility, learning theory, industrial processes, etc.) (60J20) Probabilistic games; gambling (91A60)
This page was built for publication: Regret Bounds for Restless Markov Bandits