Randomized allocation with nonparametric estimation for a multi-armed bandit problem with covariates
From MaRDI portal
Publication:1848931
DOI10.1214/aos/1015362186zbMath1012.62088OpenAlexW2107822634MaRDI QIDQ1848931
Publication date: 14 November 2002
Published in: The Annals of Statistics (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1214/aos/1015362186
Nonparametric regression and quantile regression (62G08) Sequential statistical design (62L05) Compound decision problems in statistical decision theory (62C25)
Related Items
A non-parametric solution to the multi-armed bandit problem with covariates, Woodroofe's one-armed bandit problem revisited, Bandit and covariate processes, with finite or non-denumerable set of arms, A linear response bandit problem, Smoothness-Adaptive Contextual Bandits, The multi-armed bandit problem with covariates, One-armed bandit process with a covariate, Knowledge integration using problem spaces: A study in resource-constrained project scheduling, Technical note—Knowledge gradient for selection with covariates: Consistency and computation, A comparison between two treatments in a clinical trial with an ethical allocation design, Nearly Dimension-Independent Sparse Linear Bandit over Small Action Spaces via Best Subset Selection, Transfer learning for contextual multi-armed bandits, Modeling item-item similarities for personalized recommendations on Yahoo! front page, A reinforcement learning approach to personalized learning recommendation systems, Randomized allocation with nonparametric estimation for contextual multi-armed bandits with delayed rewards, A distribution-free approach for selecting better treatment through an ethical allocation, Infinite Arms Bandit: Optimality via Confidence Bounds, Randomized allocation with arm elimination in a bandit problem with covariates, Response‐adaptive randomization for multi‐arm clinical trials using the forward looking Gittins index rule, Statistical Inference for Online Decision Making via Stochastic Gradient Descent, Covariate-adjusted response-adaptive randomization for multi-arm clinical trials using a modified forward looking Gittins index rule, Nonparametric Pricing Analytics with Customer Covariates, Statistical Inference for Online Decision Making: In a Contextual Bandit Setting
Cites Work
- Asymptotically efficient adaptive allocation rules
- One-armed bandit problems with covariates
- Consistent nonparametric regression. Discussion
- Minimum contrast estimators on sieves: Exponential bounds and rates of convergence
- Bandit problems with infinitely many arms
- On the strong universal consistency of nearest neighbor regression function estimates
- Histogram regression estimation using data-dependent partitions
- Weak convergence and empirical processes. With applications to statistics
- Covariate models for bernoulli bandits
- A One-Armed Bandit Problem with a Concomitant Variable
- Machine learning and nonparametric bandit theory
- Some aspects of the sequential design of experiments
- Convergence of stochastic processes
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item