Partially Observed Markov Decision Process Multiarmed Bandits—Structural Results
DOI10.1287/moor.1080.0371zbMath1231.90373OpenAlexW1998039896MaRDI QIDQ3169035
Vikram Krishnamurthy, B. Wahlberg
Publication date: 27 April 2011
Published in: Mathematics of Operations Research (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1287/moor.1080.0371
likelihood ratio orderingstochastic approximation algorithmopportunistic schedulingmonotone policiespartially observed Markov decision processmultiarmed bandits
Deterministic scheduling theory in operations research (90B35) Approximation methods and heuristics in mathematical programming (90C59) Markov and semi-Markov decision processes (90C40)
Related Items (3)
Uses Software
This page was built for publication: Partially Observed Markov Decision Process Multiarmed Bandits—Structural Results