Pages that link to "Item:Q2665165"
From MaRDI portal
The following pages link to An index-based deterministic convergent optimal algorithm for constrained multi-armed bandit problems (Q2665165):
Displaying 3 items.
- Sample mean based index policies by <i>O</i>(log <i>n</i>) regret for the multi-armed bandit problem (Q4862097) (← links)
- An Optimal Algorithm for Bandit and Zero-Order Convex Optimization with Two-Point Feedback (Q5361319) (← links)
- An Index-based Deterministic Asymptotically Optimal Algorithm for Constrained Multi-armed Bandit Problems (Q6346060) (← links)