Gittins' theorem under uncertainty
From MaRDI portal
Publication:2076662
DOI10.1214/22-EJP742zbMath1485.91060arXiv1907.05689OpenAlexW4210669552MaRDI QIDQ2076662
Tanut Treetanthiploet, Samuel N. Cohen
Publication date: 22 February 2022
Published in: Electronic Journal of Probability (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/1907.05689
Related Items (2)
Gambling Under Unknown Probabilities as Proxy for Real World Decisions Under Uncertainty ⋮ Index policy for multiarmed bandit problem with dynamic risk measures
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Stochastic finance. An introduction in discrete time.
- Four proofs of Gittins' multiarmed bandit theorem
- Representing filtration consistent nonlinear expectations as \(g\)-expectations in general probability spaces
- A theory of Markovian time-inconsistent stochastic control in discrete time
- On time-inconsistent stochastic control in continuous time
- Data-driven nonlinear expectations for statistical uncertainty in decisions
- Risk-averse dynamic programming for Markov decision processes
- Dynamic risk measures: Time consistency and risk measures from BMO martingales
- A general theory of finite state backward stochastic difference equations
- Asymptotically efficient adaptive allocation rules
- Multi-armed bandits with discount factor near one: The Bernoulli case
- On the Gittins index for multiarmed bandits
- Discrete multiarmed bandits and multiparameter processes
- Dynamic allocation problems in continuous time
- Reflected solutions of backward SDE's, and related obstacle problems for PDE's
- Optimal learning and experimentation in bandit problems.
- Recursive construction of confidence regions
- The K-armed bandit problem with multiple priors
- Time-inconsistent optimal control problems and the equilibrium HJB equation
- Lagrangian relaxation and constraint generation for allocation and advanced scheduling
- A stochastic representation theorem with applications to optimization and obstacle problems.
- Optimal stopping under ambiguity in continuous time
- Coherent multiperiod risk adjusted values and Bellman's principle
- Dynamic coherent risk measures
- Conditional and dynamic convex risk measures
- On Gittins' index theorem in continuous time
- Coherent Measures of Risk
- Time-Inconsistent Stochastic Linear--Quadratic Control
- Backward Stochastic Difference Equations and Nearly Time-Consistent Nonlinear Expectations
- Convex risk measures and the dynamics of their penalty functions
- RISK MEASURES AND CAPITAL REQUIREMENTS FOR PROCESSES
- Optimal Stopping With Multiple Priors
- Optimal stopping and dynamic allocation
- Prospect Theory: An Analysis of Decision under Risk
- Backward Stochastic Differential Equations in Finance
- A Tutorial on Thompson Sampling
- General Gittins index processes in discrete time.
- The Nonstochastic Multiarmed Bandit Problem
- Sample mean based index policies by O(log n) regret for the multi-armed bandit problem
- Stochastic Calculus and Applications
- Robust Control of Markov Decision Processes with Uncertain Transition Matrices
- Stationary Ordinal Utility and Impatience
- On the Existence of a Consistent Course of Action when Tastes are Changing
- Robust Dynamic Programming
- Some aspects of the sequential design of experiments
This page was built for publication: Gittins' theorem under uncertainty