Infinite Arms Bandit: Optimality via Confidence Bounds
From MaRDI portal
Publication:5089465
DOI10.5705/ss.202020.0242OpenAlexW3174216023MaRDI QIDQ5089465
Publication date: 19 July 2022
Published in: Statistica Sinica (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/1805.11793
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- The multi-armed bandit problem with covariates
- Kullback-Leibler upper confidence bounds for optimal sequential allocation
- Irreversible adaptive allocation rules
- Asymptotically efficient adaptive allocation rules
- Adaptive treatment allocation and the multi-armed bandit problem
- Bandit problems with infinitely many arms
- Randomized allocation with nonparametric estimation for a multi-armed bandit problem with covariates
- Optimal adaptive policies for sequential allocation problems
- The multi-armed bandit problem: an efficient nonparametric solution
- Continuum Armed Bandit Problem of Few Variables in High Dimensions
- Sample mean based index policies by O(log n) regret for the multi-armed bandit problem
- The Continuum-Armed Bandit Problem
- Regret and Convergence Bounds for a Class of Continuum-Armed Bandit Problems
- A linear response bandit problem
- Learning to Optimize via Posterior Sampling
- Bandit problems with side observations
- Improved Rates for the Stochastic Continuum-Armed Bandit Problem
- Finite-time analysis of the multiarmed bandit problem
This page was built for publication: Infinite Arms Bandit: Optimality via Confidence Bounds