The Continuum-Armed Bandit Problem
From MaRDI portal
Publication:4862444
DOI10.1137/S0363012992237273zbMath0848.93069MaRDI QIDQ4862444
Publication date: 8 February 1996
Published in: SIAM Journal on Control and Optimization (Search for Journal in Brave)
Asymptotic properties of nonparametric inference (62G20) Stochastic learning and adaptive control (93E35) Sequential statistical design (62L05)
Related Items (19)
Smoothness-Adaptive Contextual Bandits ⋮ Unnamed Item ⋮ Adaptive-treed bandits ⋮ Learning approximately optimal contracts ⋮ Learning approximately optimal contracts ⋮ An asymptotically optimal policy for finite support models in the multiarmed bandit problem ⋮ Control-data separation and logical condition propagation for efficient inference on probabilistic programs ⋮ Treatment recommendation with distributional targets ⋮ Learning in Combinatorial Optimization: What and How to Explore ⋮ Online linear optimization and adaptive routing ⋮ Filtered Poisson process bandit on a continuum ⋮ Infinite Arms Bandit: Optimality via Confidence Bounds ⋮ A revision game of experimentation on a common threshold ⋮ Optimal learning for sequential sampling with non-parametric beliefs ⋮ Optimal learning with a local parametric belief model ⋮ Nonparametric Pricing Analytics with Customer Covariates ⋮ Stochastic continuum-armed bandits with additive models: minimax regrets and adaptive algorithm ⋮ A General Theory of MultiArmed Bandit Processes with Constrained Arm Switches ⋮ On two continuum armed bandit problems in high dimensions
This page was built for publication: The Continuum-Armed Bandit Problem