Stochastic Convex Optimization with Bandit Feedback
From MaRDI portal
Publication:5300524
DOI10.1137/110850827zbMath1270.90107arXiv1107.1744OpenAlexW2567882221MaRDI QIDQ5300524
Alekh Agarwal, Alexander Rakhlin, Dean P. Foster, Sham M. Kakade, Daniel Hsu
Publication date: 27 June 2013
Published in: SIAM Journal on Optimization (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/1107.1744
Convex programming (90C25) Derivative-free methods and methods using generalized derivatives (90C56) Learning and adaptive systems in artificial intelligence (68T05)
Related Items
Random gradient-free minimization of convex functions, An Accelerated Method for Derivative-Free Smooth Stochastic Convex Optimization, Operations research applications of dichotomous search, Technical note: <scp>Finite‐time</scp> regret analysis of <scp>Kiefer‐Wolfowitz</scp> stochastic approximation algorithm and nonparametric <scp>multi‐product</scp> dynamic pricing with unknown demand, Zeroth-order feedback optimization for cooperative multi-agent systems, Technical Note—Nonstationary Stochastic Optimization Under Lp,q-Variation Measures, Exploratory distributions for convex functions, Non-Stationary Stochastic Optimization, Derivative-free optimization methods, Stochastic continuum-armed bandits with additive models: minimax regrets and adaptive algorithm, Optimal Policy for Dynamic Assortment Planning Under Multinomial Logit Models