Exact solution of the Bellman equation for a \(\beta\)-discounted reward in a two-armed bandit with switching arms (Q1307621)

scientific article; zbMATH DE number 1359780

Language	Label	Description	Also known as
English	Exact solution of the Bellman equation for a \(\beta\)-discounted reward in a two-armed bandit with switching arms	scientific article; zbMATH DE number 1359780

Statements

instance of

scholarly article

0 references

title

Exact solution of the Bellman equation for a \(\beta\)-discounted reward in a two-armed bandit with switching arms (English)

0 references

author

Doncho S. Donchev

0 references

published in

Journal of Applied Mathematics and Stochastic Analysis

0 references

publication date

14 February 2000

0 references

full work available at URL

https://eudml.org/doc/48335

0 references

review text

Summary: We consider the symmetric Poissonian two-armed bandit problem. For the case of switching arms, only one of which creates reward, we solve explicitly the Bellman equation for a \(\beta\)-discounted reward and prove that a myopic policy is optimal.

0 references

zbMATH Keywords

two-armed bandit

0 references

switching arms

0 references

Bellman equation

0 references

\(\beta\)-discounted reward

0 references

MaRDI profile type

Publication