Randomized allocation with nonparametric estimation for a multi-armed bandit problem with covariates

From MaRDI portal
Publication:1848931

DOI10.1214/aos/1015362186zbMath1012.62088OpenAlexW2107822634MaRDI QIDQ1848931

Dan Zhu, Yuhong Yang

Publication date: 14 November 2002

Published in: The Annals of Statistics (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1214/aos/1015362186



Related Items

A non-parametric solution to the multi-armed bandit problem with covariates, Woodroofe's one-armed bandit problem revisited, Bandit and covariate processes, with finite or non-denumerable set of arms, A linear response bandit problem, Smoothness-Adaptive Contextual Bandits, The multi-armed bandit problem with covariates, One-armed bandit process with a covariate, Knowledge integration using problem spaces: A study in resource-constrained project scheduling, Technical note—Knowledge gradient for selection with covariates: Consistency and computation, A comparison between two treatments in a clinical trial with an ethical allocation design, Nearly Dimension-Independent Sparse Linear Bandit over Small Action Spaces via Best Subset Selection, Transfer learning for contextual multi-armed bandits, Modeling item-item similarities for personalized recommendations on Yahoo! front page, A reinforcement learning approach to personalized learning recommendation systems, Randomized allocation with nonparametric estimation for contextual multi-armed bandits with delayed rewards, A distribution-free approach for selecting better treatment through an ethical allocation, Infinite Arms Bandit: Optimality via Confidence Bounds, Randomized allocation with arm elimination in a bandit problem with covariates, Response‐adaptive randomization for multi‐arm clinical trials using the forward looking Gittins index rule, Statistical Inference for Online Decision Making via Stochastic Gradient Descent, Covariate-adjusted response-adaptive randomization for multi-arm clinical trials using a modified forward looking Gittins index rule, Nonparametric Pricing Analytics with Customer Covariates, Statistical Inference for Online Decision Making: In a Contextual Bandit Setting



Cites Work