Minimax Off-Policy Evaluation for Multi-Armed Bandits (Q5096994)
From MaRDI portal
scientific article; zbMATH DE number 7573342
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Minimax Off-Policy Evaluation for Multi-Armed Bandits |
scientific article; zbMATH DE number 7573342 |
Statements
Minimax Off-Policy Evaluation for Multi-Armed Bandits (English)
0 references
19 August 2022
0 references
off-policy evaluation
0 references
multi-armed bandit
0 references
bounded rewards
0 references
minimax rate-optimal procedures
0 references