Pages that link to "Item:Q1203758"
From MaRDI portal
The following pages link to On the Gittins index for multiarmed bandits (Q1203758):
Displaying 50 items.
- Multi-armed bandit models for the optimal design of clinical trials: benefits and challenges (Q254442) (← links)
- Four proofs of Gittins' multiarmed bandit theorem (Q333080) (← links)
- Kullback-Leibler upper confidence bounds for optimal sequential allocation (Q366995) (← links)
- The multi-armed bandit, with constraints (Q378726) (← links)
- Stochastic scheduling: a short history of index policies and new approaches to index generation for dynamic resource allocation (Q490352) (← links)
- Dynamic priority allocation via restless bandit marginal productivity indices (Q926578) (← links)
- A generalized Gittins index for a Markov chain and its recursive calculation (Q945795) (← links)
- Multi-armed bandits with simple arms (Q1095862) (← links)
- Multi-armed bandits in discrete and continuous time (Q1296724) (← links)
- On a new approach to the analysis of complex multi-armed bandits (Q1298696) (← links)
- Discrete multiarmed bandits and multiparameter processes (Q1317211) (← links)
- A short proof of the Gittins index theorem (Q1327612) (← links)
- Multi-armed bandit problem revisited (Q1337211) (← links)
- Information-gain computation in the \textsc{Fifth} system (Q1726365) (← links)
- The archievable region method in the optimal control of queueing systems; formulations, bounds and policies (Q1923638) (← links)
- On the optimality of the Gittins index rule for multi-armed bandits with multiple plays (Q1974590) (← links)
- Efficiency in lung transplant allocation strategies (Q2044231) (← links)
- Gittins' theorem under uncertainty (Q2076662) (← links)
- Robust control of the multi-armed bandit problem (Q2095215) (← links)
- On the Gittins index in the M/G/1 queue (Q2269488) (← links)
- Multi-armed bandit processes with optimal selection of the operating times (Q2387146) (← links)
- Reading policies for joins: an asymptotic analysis (Q2467117) (← links)
- Finite state multi-armed bandit problems: Sensitive-discount, average-reward and average-overtaking optimality (Q2564701) (← links)
- On Gittins' index theorem in continuous time (Q2642040) (← links)
- Stopped decision processes in conjunction with general utility (Q2766113) (← links)
- Sensitivity of the gittins index in the contiuous time two-armed bandit problem (Q2785416) (← links)
- Optimal learning with non-Gaussian rewards (Q2806349) (← links)
- A \((2/3)n^{3}\) fast-pivoting algorithm for the Gittins index and optimal stopping of a Markov chain (Q2892371) (← links)
- Computing a classic index for finite-horizon bandits (Q2899118) (← links)
- GSP with General Independent Click-through-Rates (Q2937002) (← links)
- A Generalized Gittins Index for a Class of Multiarmed Bandits with General Resource Requirements (Q3169016) (← links)
- Partially Observed Markov Decision Process Multiarmed Bandits—Structural Results (Q3169035) (← links)
- Tax problems in the undiscounted case (Q3367746) (← links)
- Index Policies for Shooting Problems (Q3392111) (← links)
- The Multi-Armed Bandit Problem: Decomposition and Computation (Q3755256) (← links)
- A note on gittins indices for pharmaceutical research (Q3981888) (← links)
- Evaluating policies for generalized bandits via a notion of duality (Q4519117) (← links)
- (Q4558474) (← links)
- Survey of linear programming for standard and nonstandard Markovian control problems. Part II: Applications (Q4698111) (← links)
- Independently Expiring Multiarmed Bandits (Q4950727) (← links)
- Technical Note—A Note on the Equivalence of Upper Confidence Bounds and Gittins Indices for Patient Agents (Q4994155) (← links)
- Optimistic Gittins Indices (Q5060515) (← links)
- Open Bandit Processes with Uncountable States and Time-Backward Effects (Q5299564) (← links)
- MULTI-ARMED BANDITS UNDER GENERAL DEPRECIATION AND COMMITMENT (Q5358026) (← links)
- ON THE IDENTIFICATION AND MITIGATION OF WEAKNESSES IN THE KNOWLEDGE GRADIENT POLICY FOR MULTI-ARMED BANDITS (Q5358114) (← links)
- Gambling Under Unknown Probabilities as Proxy for Real World Decisions Under Uncertainty (Q5885234) (← links)
- Stationary multi-choice bandit problems. (Q5958100) (← links)
- Optimal activation of halting multi‐armed bandit models (Q6057028) (← links)
- Index policy for multiarmed bandit problem with dynamic risk measures (Q6090163) (← links)
- Empirical Gittins index strategies with \(\varepsilon\)-explorations for multi-armed bandit problems (Q6167036) (← links)