Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
scientific article; zbMATH DE number 6253919 - MaRDI portal

scientific article; zbMATH DE number 6253919

From MaRDI portal
Publication:5396654

zbMath1280.91038MaRDI QIDQ5396654

Rémi Munos, Csaba Szepesvári, Gilles Stoltz, Sébastien Bubeck

Publication date: 3 February 2014

Full work available at URL: http://www.jmlr.org/papers/v12/bubeck11a.html

Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.



Related Items (25)

Distributed Bayesian: A Continuous Distributed Constraint Optimization Problem SolverUnnamed ItemContinuous Assortment Optimization with Logit Choice Probabilities and Incomplete InformationAdaptive-treed banditsInformation theory for ranking and selectionMulti-armed bandits with censored consumption of resourcesNonparametric learning for impulse control problems -- exploration vs. exploitationTreatment recommendation with distributional targetsGaussian process bandits with adaptive discretizationDeep learning for ranking response surfaces with applications to optimal stopping problemsLearning in Combinatorial Optimization: What and How to ExploreFiltered Poisson process bandit on a continuumA derivative-free optimization algorithm for the efficient minimization of functions obtained via statistical averagingHyperband: A Novel Bandit-Based Approach to Hyperparameter OptimizationUnnamed ItemLearning to Optimize via Information-Directed SamplingLearning‐based iterative modular adaptive control for nonlinear systemsDerivative-free optimization methodsLearning to Optimize via Posterior SamplingNonparametric Pricing Analytics with Customer CovariatesStochastic continuum-armed bandits with additive models: minimax regrets and adaptive algorithmA Primal–Dual Learning Algorithm for Personalized Dynamic Pricing with an Inventory ConstraintSatisficing in Time-Sensitive Bandit LearningSequential Design for Ranking Response SurfacesOn two continuum armed bandit problems in high dimensions




This page was built for publication: