Robust control of the multi-armed bandit problem (Q2095215)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Robust control of the multi-armed bandit problem |
scientific article; zbMATH DE number 7614227
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Robust control of the multi-armed bandit problem |
scientific article; zbMATH DE number 7614227 |
Statements
Robust control of the multi-armed bandit problem (English)
0 references
9 November 2022
0 references
multiarmed bandit
0 references
index policies
0 references
Bellman equation
0 references
robust Markov decision processes
0 references
uncertain transition matrix
0 references
project selection
0 references
0 references