Pure Exploration for Multi-Armed Bandit Problems
From MaRDI portal
Publication:6208466
DOI10.1007/978-3-642-04414-4_7zbMath1262.68061arXiv0802.2655MaRDI QIDQ6208466
Sébastien Bubeck, Rémi Munos, Gilles Stoltz
Publication date: 19 February 2008
Computational learning theory (68Q32) Learning and adaptive systems in artificial intelligence (68T05) Probabilistic games; gambling (91A60)
This page was built for publication: Pure Exploration for Multi-Armed Bandit Problems