scientific article

From MaRDI portal
Publication:2810758

zbMath1360.62433arXiv1407.4443MaRDI QIDQ2810758

Emilie Kaufmann, Olivier Cappé, Aurélien Garivier

Publication date: 6 June 2016

Full work available at URL: https://arxiv.org/abs/1407.4443

Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.



Related Items (27)

Best Arm Identification for Contaminated BanditsApproximation algorithms for stochastic combinatorial optimization problemsSequential estimation of quantiles with applications to A/B testing and best-arm identificationRobust Learning of Consumer PreferencesAn index-based deterministic convergent optimal algorithm for constrained multi-armed bandit problemsUnnamed ItemUnnamed ItemUnnamed ItemGood arm identification via bandit feedbackTreatment recommendation with distributional targetsLearning the distribution with largest mean: two bandit frameworksA unified framework for stochastic optimizationFano's inequality for random variablesSimple Bayesian Algorithms for Best-Arm IdentificationUnnamed ItemHyperband: A Novel Bandit-Based Approach to Hyperparameter OptimizationActive ranking from pairwise comparisons and when parametric assumptions do not helpTime-uniform, nonparametric, nonasymptotic confidence sequencesSequential controlled sensing for composite multihypothesis testingExplore First, Exploit Next: The True Shape of Regret in Bandit ProblemsA bad arm existence checking problem: how to utilize asymmetric problem structure?Polynomial-Time Algorithms for Multiple-Arm Identification with Full-Bandit FeedbackA PAC algorithm in relative precision for bandit problem with costly samplingChoosing the best arm with guaranteed confidenceThe pure exploration problem with general reward functions depending on full distributionsOn the Bias, Risk, and Consistency of Sample Means in Multi-armed BanditsSatisficing in Time-Sensitive Bandit Learning




This page was built for publication: