Exact solution of the Bellman equation for a \(\beta\)-discounted reward in a two-armed bandit with switching arms (Q1307621)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Exact solution of the Bellman equation for a \(\beta\)-discounted reward in a two-armed bandit with switching arms |
scientific article; zbMATH DE number 1359780
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Exact solution of the Bellman equation for a \(\beta\)-discounted reward in a two-armed bandit with switching arms |
scientific article; zbMATH DE number 1359780 |
Statements
Exact solution of the Bellman equation for a \(\beta\)-discounted reward in a two-armed bandit with switching arms (English)
0 references
14 February 2000
0 references
Summary: We consider the symmetric Poissonian two-armed bandit problem. For the case of switching arms, only one of which creates reward, we solve explicitly the Bellman equation for a \(\beta\)-discounted reward and prove that a myopic policy is optimal.
0 references
two-armed bandit
0 references
switching arms
0 references
Bellman equation
0 references
\(\beta\)-discounted reward
0 references