Pages that link to "Item:Q378726"
From MaRDI portal
The following pages link to The multi-armed bandit, with constraints (Q378726):
Displaying 12 items.
- Four proofs of Gittins' multiarmed bandit theorem (Q333080) (← links)
- Multi-armed bandits with episode context (Q766259) (← links)
- An asymptotically optimal strategy for constrained multi-armed bandit problems (Q784789) (← links)
- Robust control of the multi-armed bandit problem (Q2095215) (← links)
- The Irrevocable Multiarmed Bandit Problem (Q3098762) (← links)
- On the reduction of total‐cost and average‐cost MDPs to discounted MDPs (Q3120606) (← links)
- Coordinating Pricing and Inventory Replenishment with Nonparametric Demand Learning (Q5129177) (← links)
- Bandits with Global Convex Constraints and Objective (Q5129206) (← links)
- MULTI-ARMED BANDITS UNDER GENERAL DEPRECIATION AND COMMITMENT (Q5358026) (← links)
- Risk-Sensitive and Risk-Neutral Multiarmed Bandits (Q5388036) (← links)
- Index policy for multiarmed bandit problem with dynamic risk measures (Q6090163) (← links)
- Multi-armed bandits with censored consumption of resources (Q6097147) (← links)