A penalized bandit algorithm
From MaRDI portal
Publication:1039016
DOI10.1214/EJP.v13-489zbMath1206.62139arXivmath/0510384MaRDI QIDQ1039016
Damien Lamberton, Gilles Pagès
Publication date: 20 November 2009
Published in: Electronic Journal of Probability (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/math/0510384
Lua error in Module:PublicationMSCList at line 37: attempt to index local 'msc_result' (a nil value).
Related Items (6)
Stochastic approximation of quasi-stationary distributions on compact spaces and applications ⋮ Regret bounds for Narendra-Shapiro bandit algorithms ⋮ On ergodic two-armed bandits ⋮ Convergence in models with bounded expected relative hazard rates ⋮ Nonlinear randomized urn models: a stochastic approximation viewpoint ⋮ Some simple but challenging Markov processes
This page was built for publication: A penalized bandit algorithm