Online Learning of Rested and Restless Bandits
From MaRDI portal
Publication:2989865
DOI10.1109/TIT.2012.2198613zbMath1366.91041arXiv1102.3508OpenAlexW3102381603MaRDI QIDQ2989865
Publication date: 8 June 2017
Published in: IEEE Transactions on Information Theory (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/1102.3508
Applications of Markov chains and discrete-time Markov processes on general state spaces (social mobility, learning theory, industrial processes, etc.) (60J20) Rationality and learning in game theory (91A26) Probabilistic games; gambling (91A60)
Related Items (3)
Unnamed Item ⋮ An online algorithm for the risk-aware restless bandit ⋮ Game of Thrones: Fully Distributed Learning for Multiplayer Bandits
This page was built for publication: Online Learning of Rested and Restless Bandits