Variance-Aware Regret Bounds for Stochastic Contextual Dueling Bandits (Q6453396)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Variance-Aware Regret Bounds for Stochastic Contextual Dueling Bandits |
preprint article from arXiv
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Variance-Aware Regret Bounds for Stochastic Contextual Dueling Bandits |
preprint article from arXiv |
Statements
2 October 2023
0 references
cs.LG
0 references
math.OC
0 references
stat.ML
0 references
Qiwei Di
0 references
Tao Jin
0 references
Yue Wu
0 references
Heyang Zhao
0 references
Farzad Farnoud
0 references
Quanquan Gu
0 references