Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Improved Rates for the Stochastic Continuum-Armed Bandit Problem - MaRDI portal

Improved Rates for the Stochastic Continuum-Armed Bandit Problem

From MaRDI portal

Publication:5434068

Jump to:navigation, search

DOI10.1007/978-3-540-72927-3_33zbMath1203.68134OpenAlexW1521084402MaRDI QIDQ5434068

Peter Auer, Csaba Szepesvári, Ronald Ortner

Publication date: 3 January 2008

Published in: Learning Theory (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1007/978-3-540-72927-3_33

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05) Stochastic learning and adaptive control (93E35) Probabilistic games; gambling (91A60)

Related Items (18)

Smoothness-Adaptive Contextual Bandits ⋮ Continuous Assortment Optimization with Logit Choice Probabilities and Incomplete Information ⋮ Adaptive-treed bandits ⋮ Learning approximately optimal contracts ⋮ Learning approximately optimal contracts ⋮ Nonparametric learning for impulse control problems -- exploration vs. exploitation ⋮ Estimation and inference for minimizer and minimum of convex functions: optimality, adaptivity and uncertainty principles ⋮ Online Regret Bounds for Markov Decision Processes with Deterministic Transitions ⋮ Coordinating Pricing and Inventory Replenishment with Nonparametric Demand Learning ⋮ Online regret bounds for Markov decision processes with deterministic transitions ⋮ Infinite Arms Bandit: Optimality via Confidence Bounds ⋮ Randomized allocation with arm elimination in a bandit problem with covariates ⋮ On Incomplete Learning and Certainty-Equivalence Control ⋮ Dynamic Pricing with Multiple Products and Partially Specified Demand Distribution ⋮ Nonparametric Pricing Analytics with Customer Covariates ⋮ Stochastic continuum-armed bandits with additive models: minimax regrets and adaptive algorithm ⋮ A Primal–Dual Learning Algorithm for Personalized Dynamic Pricing with an Inventory Constraint ⋮ On two continuum armed bandit problems in high dimensions

This page was built for publication: Improved Rates for the Stochastic Continuum-Armed Bandit Problem

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:5434068&oldid=20190289"