An optimal stopping zero-sum game in discrete-time multi-armed bandit processes
From MaRDI portal
Publication:4256753
DOI10.1080/02522667.1998.10699387zbMath0957.91031OpenAlexW1988645820MaRDI QIDQ4256753
Publication date: 2 August 1999
Published in: Journal of Information and Optimization Sciences (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1080/02522667.1998.10699387
zero-sum gamesBellman's equationoptimal stopping timesMarkov strategiesmulti-armed bandit processesoptimal Markov strategiesMarkov stopping timesbandit games
Cites Work
This page was built for publication: An optimal stopping zero-sum game in discrete-time multi-armed bandit processes