scientific article
From MaRDI portal
Publication:3121140
zbMath1419.91144MaRDI QIDQ3121140
No author found.
Publication date: 20 March 2019
Full work available at URL: http://mathnet.ru/eng/mgta209
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Learning and adaptive systems in artificial intelligence (68T05) Probabilistic games; gambling (91A60) Decision theory for games (91A35)
Related Items (4)
Customization of J. Bather's UCB strategy for a Gaussian multiarmed bandit ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Two-armed bandit problem and batch version of the mirror descent algorithm
Cites Work
- Unnamed Item
- Unnamed Item
- Asymptotically efficient adaptive allocation rules
- Adaptive treatment allocation and the multi-armed bandit problem
- Online linear optimization and adaptive routing
- A One-Armed Bandit Problem with a Concomitant Variable
- The Nonstochastic Multiarmed Bandit Problem
- Finite-time analysis of the multiarmed bandit problem
This page was built for publication: