On ergodic two-armed bandits
From MaRDI portal
Publication:417067
DOI10.1214/10-AAP751zbMath1275.62056arXiv0905.0463OpenAlexW2136855133MaRDI QIDQ417067
Pierre Tarrès, Pierre Vandekerkhove
Publication date: 13 May 2012
Published in: The Annals of Applied Probability (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/0905.0463
Stochastic approximation (62L20) Sequential statistical design (62L05) Statistical aspects of information-theoretic topics (62B10)
Related Items (1)
Cites Work
- Unnamed Item
- A penalized bandit algorithm
- A two armed bandit type problem
- When can the two-armed bandit algorithm be trusted?
- Stochastic algorithms
- The law of the iterated logarithm for additive functionals of Markov chains
- On the linear model with two absorbing barriers
- Stochastic approximation with averaging innovation applied to Finance
- A two armed bandit type problem revisited
- How Fast Is the Bandit?
- Learning Automata - A Survey
- Use of Stochastic Automata for Parameter Self-Optimization with Multimodal Performance Criteria
This page was built for publication: On ergodic two-armed bandits