Maximizing the length of a success run for many-armed bandits (Q1053621)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Maximizing the length of a success run for many-armed bandits |
scientific article; zbMATH DE number 3819495
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Maximizing the length of a success run for many-armed bandits |
scientific article; zbMATH DE number 3819495 |
Statements
Maximizing the length of a success run for many-armed bandits (English)
0 references
1983
0 references
gambling with discounting
0 references
many-armed bandits
0 references
sequential decisions
0 references
stay on a winner rule
0 references
Bernoulli populations
0 references
maximization of expected total reward
0 references
super-regularity
0 references
existence of optimal strategy
0 references
sampling strategy
0 references