Combinatorial Bandits for Maximum Value Reward Function under Max Value-Index Feedback (Q6437978)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Combinatorial Bandits for Maximum Value Reward Function under Max Value-Index Feedback |
preprint article from arXiv
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Combinatorial Bandits for Maximum Value Reward Function under Max Value-Index Feedback |
preprint article from arXiv |
Statements
25 May 2023
0 references
cs.LG
0 references
math.ST
0 references
stat.TH
0 references
Yiliu Wang
0 references
Wei Chen
0 references
Milan Vojnović
0 references