An information-theoretic analysis of Thompson sampling (Q2810878)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: scientific article |
scientific article; zbMATH DE number 6589482
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | An information-theoretic analysis of Thompson sampling |
scientific article; zbMATH DE number 6589482 |
Statements
6 June 2016
0 references
Thompson sampling
0 references
online optimization
0 references
mutli-armed bandit
0 references
information theory
0 references
regret bounds
0 references
0.9306549
0 references
0.9018026
0 references
0 references
0.8978109
0 references
0.8957436
0 references
0 references
0.8842326
0 references
0.8818612
0 references
An information-theoretic analysis of Thompson sampling (English)
0 references