A Survey of Preference-Based Online Learning with Bandit Algorithms (Q2938721)
From MaRDI portal
scientific article
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | A Survey of Preference-Based Online Learning with Bandit Algorithms |
scientific article |
Statements
A Survey of Preference-Based Online Learning with Bandit Algorithms (English)
0 references
14 January 2015
0 references
multi-armed bandits
0 references
online learning
0 references
preference learning
0 references
ranking
0 references
top-k selection
0 references
exploration/exploitation
0 references
cumulative regret
0 references
sample complexity
0 references
PAC learning
0 references