On the Gittins index for multiarmed bandits

From MaRDI portal
Publication:1203758

DOI10.1214/aoap/1177005588zbMath0763.60021OpenAlexW1996859119WikidataQ55920221 ScholiaQ55920221MaRDI QIDQ1203758

Richard R. Weber

Publication date: 22 February 1993

Published in: The Annals of Applied Probability (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1214/aoap/1177005588




Related Items (30)

Gambling Under Unknown Probabilities as Proxy for Real World Decisions Under UncertaintyMulti-armed bandit problem revisitedOpen Bandit Processes with Uncountable States and Time-Backward EffectsOptimistic Gittins IndicesMulti-armed bandit processes with optimal selection of the operating timesOn Gittins' index theorem in continuous timeFour proofs of Gittins' multiarmed bandit theoremKullback-Leibler upper confidence bounds for optimal sequential allocationThe multi-armed bandit, with constraintsThe archievable region method in the optimal control of queueing systems; formulations, bounds and policiesOptimal activation of halting multi‐armed bandit modelsIndex policy for multiarmed bandit problem with dynamic risk measuresMULTI-ARMED BANDITS UNDER GENERAL DEPRECIATION AND COMMITMENTON THE IDENTIFICATION AND MITIGATION OF WEAKNESSES IN THE KNOWLEDGE GRADIENT POLICY FOR MULTI-ARMED BANDITSEmpirical Gittins index strategies with \(\varepsilon\)-explorations for multi-armed bandit problemsOptimal Dynamic Information AcquisitionDynamic priority allocation via restless bandit marginal productivity indicesInformation-gain computation in the \textsc{Fifth} systemStochastic scheduling: a short history of index policies and new approaches to index generation for dynamic resource allocationReading policies for joins: an asymptotic analysisStopped decision processes in conjunction with general utilityOn the Gittins index in the M/G/1 queueUnnamed ItemIndependently Expiring Multiarmed BanditsSurvey of linear programming for standard and nonstandard Markovian control problems. Part II: ApplicationsEfficiency in lung transplant allocation strategiesGittins' theorem under uncertaintyTechnical Note—A Note on the Equivalence of Upper Confidence Bounds and Gittins Indices for Patient AgentsMulti-armed bandits in discrete and continuous timeMulti-armed bandit models for the optimal design of clinical trials: benefits and challenges




This page was built for publication: On the Gittins index for multiarmed bandits