Four proofs of Gittins' multiarmed bandit theorem
From MaRDI portal
Publication:333080
DOI10.1007/s10479-013-1523-0zbMath1348.60065OpenAlexW2109643282MaRDI QIDQ333080
Publication date: 9 November 2016
Published in: Annals of Operations Research (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1007/s10479-013-1523-0
Queueing theory (aspects of probability theory) (60K25) Dynamic programming (90C39) Stopping times; optimal stopping problems; gambling theory (60G40) Markov and semi-Markov decision processes (90C40)
Related Items (5)
Approximation algorithms for stochastic combinatorial optimization problems ⋮ MULTI-ARMED BANDITS UNDER GENERAL DEPRECIATION AND COMMITMENT ⋮ Decomposing risk in an exploitation-exploration problem with endogenous termination time ⋮ Gittins' theorem under uncertainty ⋮ Robust control of the multi-armed bandit problem
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- The multi-armed bandit, with constraints
- A generalized Gittins index for a Markov chain and its recursive calculation
- Arm-acquiring bandits
- On the Gittins index for multiarmed bandits
- Multiple feedback at a single-server station
- Multi-armed bandits in discrete and continuous time
- Discrete multiarmed bandits and multiparameter processes
- A short proof of the Gittins index theorem
- Multi-armed bandit problem revisited
- Dynamic allocation indices for restless projects and queueing admission control: a polyhedral approach
- Almost optimal policies for stochastic systems which almost satisfy conservation laws
- Finite state multi-armed bandit problems: Sensitive-discount, average-reward and average-overtaking optimality
- Restless bandits, partial conservation laws and indexability
- A (2/3)n3 Fast-Pivoting Algorithm for the Gittins Index and Optimal Stopping of a Markov Chain
- Multi‐Armed Bandit Allocation Indices
- Scheduling for Minimum Total Loss Using Service Time Distributions
- Addendum to ‘On an index policy for restless bandits'
- Branching Bandit Processes
- Parallel Scheduling of Multiclass M/M/m Queues: Approximate and Heavy-Traffic Optimization of Achievable Performance
- Extensions of the multiarmed bandit problem: The discounted case
- The Multi-Armed Bandit Problem: Decomposition and Computation
- Characterization and Optimization of Achievable Performance in General Queueing Systems
- On an index policy for restless bandits
- Multiclass Queueing Systems: Polymatroidal Structure and Optimal Scheduling Control
- Dynamic Scheduling of a Multiclass Queue: Discount Optimality
- Optimal Control of Single-Server Queuing Networks and Multi-Class M/G/1 Queues with Feedback
- Time-Sharing Service Systems. I
- Conservation Laws, Extended Polymatroids and Multiarmed Bandit Problems; A Polyhedral Approach to Indexable Systems
- The Achievable Region Approach to the Optimal Control of Stochastic Systems
- Restless Bandit Marginal Productivity Indices, Diminishing Returns, and Optimal Control of Make-to-Order/Make-to-Stock M/G/1 Queues
- Risk-Sensitive and Risk-Neutral Multiarmed Bandits
This page was built for publication: Four proofs of Gittins' multiarmed bandit theorem