Regret bounds and minimax policies under partial monitoring (Q2896165)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Regret bounds and minimax policies under partial monitoring |
scientific article; zbMATH DE number 6055580
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Regret bounds and minimax policies under partial monitoring |
scientific article; zbMATH DE number 6055580 |
Statements
13 July 2012
0 references
bandits (adversarial and stochastic)
0 references
regret bound
0 references
minimax rate
0 references
label efficient
0 references
upper confidence bound (UCB) policy
0 references
online learning
0 references
prediction with limited feedback
0 references
Regret bounds and minimax policies under partial monitoring (English)
0 references