Pages that link to "Item:Q2810758"
From MaRDI portal
The following pages link to On the complexity of best-arm identification in multi-armed bandit models (Q2810758):
Displaying 31 items.
- Hyperband: A Novel Bandit-Based Approach to Hyperparameter Optimization (Q72746) (← links)
- Approximation algorithms for stochastic combinatorial optimization problems (Q290321) (← links)
- A unified framework for stochastic optimization (Q1719609) (← links)
- Time-uniform, nonparametric, nonasymptotic confidence sequences (Q2039804) (← links)
- Best arm identification in generalized linear bandits (Q2060547) (← links)
- A PAC algorithm in relative precision for bandit problem with costly sampling (Q2084297) (← links)
- Choosing the best arm with guaranteed confidence (Q2096406) (← links)
- The pure exploration problem with general reward functions depending on full distributions (Q2102381) (← links)
- Sequential estimation of quantiles with applications to A/B testing and best-arm identification (Q2137037) (← links)
- Fano's inequality for random variables (Q2218038) (← links)
- Active ranking from pairwise comparisons and when parametric assumptions do not help (Q2284367) (← links)
- A bad arm existence checking problem: how to utilize asymmetric problem structure? (Q2303673) (← links)
- Good arm identification via bandit feedback (Q2425222) (← links)
- An index-based deterministic convergent optimal algorithm for constrained multi-armed bandit problems (Q2665165) (← links)
- Polynomial-Time Algorithms for Multiple-Arm Identification with Full-Bandit Feedback (Q3386400) (← links)
- Learning the distribution with largest mean: two bandit frameworks (Q4606431) (← links)
- Sequential controlled sensing for composite multihypothesis testing (Q4959347) (← links)
- (Q4998863) (← links)
- (Q4998871) (← links)
- (Q4998911) (← links)
- On the Bias, Risk, and Consistency of Sample Means in Multi-armed Bandits (Q5018902) (← links)
- (Q5053268) (← links)
- Robust Learning of Consumer Preferences (Q5080653) (← links)
- Simple Bayesian Algorithms for Best-Arm Identification (Q5144786) (← links)
- Best Arm Identification for Contaminated Bandits (Q5214178) (← links)
- Explore First, Exploit Next: The True Shape of Regret in Bandit Problems (Q5219722) (← links)
- Learning Theory and Kernel Machines (Q5305867) (← links)
- Satisficing in Time-Sensitive Bandit Learning (Q5870357) (← links)
- Treatment recommendation with distributional targets (Q6163253) (← links)
- Bandit Change-Point Detection for Real-Time Monitoring High-Dimensional Data Under Sampling Control (Q6631108) (← links)
- Pair-matching: link prediction with adaptive queries (Q6652703) (← links)