On the Gittins index for multiarmed bandits (Q1203758)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: On the Gittins index for multiarmed bandits |
scientific article; zbMATH DE number 120366
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | On the Gittins index for multiarmed bandits |
scientific article; zbMATH DE number 120366 |
Statements
On the Gittins index for multiarmed bandits (English)
0 references
22 February 1993
0 references
The authors reprove the optimality of the Gittins index policy for the multiarmed bandit problem in a simple, intuitive way. Previous research is reviewed in the light of this new proof and it is shown that the optimal value function is a submodular set function of the available projects.
0 references
sequential methods
0 references
Gittins index policy
0 references
multiarmed bandit problem
0 references
0.9565189
0 references
0.9474607
0 references
0.9232319
0 references
0.88932216
0 references
0.8876237
0 references