scientific article; zbMATH DE number 6982344
From MaRDI portal
Publication:4558206
zbMath1462.68158MaRDI QIDQ4558206
Sylvain Lamprier, Thibault Gisselbrecht, Patrick Gallinari
Publication date: 21 November 2018
Full work available at URL: http://jmlr.csail.mit.edu/papers/v19/17-693.html
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Parametric tolerance and confidence regions (62F25) Learning and adaptive systems in artificial intelligence (68T05) Online algorithms; streaming algorithms (68W27) Compound decision problems in statistical decision theory (62C25)
Cites Work
- Exploration-exploitation tradeoff using variance estimates in multi-armed bandits
- Asymptotically efficient adaptive allocation rules
- Thompson Sampling: An Asymptotically Optimal Finite-Time Analysis
- Linearly Parameterized Bandits
- Self-Normalized Processes
- 10.1162/153244303321897663
- Finite-time analysis of the multiarmed bandit problem
- Unnamed Item
This page was built for publication: