Randomization in the two-armed bandit problem (Q750006)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Randomization in the two-armed bandit problem |
scientific article; zbMATH DE number 4174042
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Randomization in the two-armed bandit problem |
scientific article; zbMATH DE number 4174042 |
Statements
Randomization in the two-armed bandit problem (English)
0 references
1990
0 references
This paper gives an elementary proof of the existence of optimal solutions to a general form of the continuous-time two-armed bandit. The formulation is the same as that used by \textit{G. Mazziotto} and \textit{A. Millet} [Stochastics 22, 251-288 (1987; Zbl 0643.60040)]; however the topological embedding of the set of randomized optimal increasing paths is new and enables a resolution of the problem that requires only straightforward topological arguments. Also, one of the conditions in Mazziotto and Millet's paper can be removed, yielding a stronger result.
0 references
randomization
0 references
existence of optimal solutions
0 references
continuous-time two-armed bandit
0 references