Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Randomized allocation with nonparametric estimation for a multi-armed bandit problem with covariates - MaRDI portal

Randomized allocation with nonparametric estimation for a multi-armed bandit problem with covariates

From MaRDI portal
Publication:1848931

DOI10.1214/aos/1015362186zbMath1012.62088OpenAlexW2107822634MaRDI QIDQ1848931

Dan Zhu, Yuhong Yang

Publication date: 14 November 2002

Published in: The Annals of Statistics (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1214/aos/1015362186



Related Items

A non-parametric solution to the multi-armed bandit problem with covariates, Woodroofe's one-armed bandit problem revisited, Bandit and covariate processes, with finite or non-denumerable set of arms, A linear response bandit problem, Smoothness-Adaptive Contextual Bandits, The multi-armed bandit problem with covariates, One-armed bandit process with a covariate, Knowledge integration using problem spaces: A study in resource-constrained project scheduling, Technical note—Knowledge gradient for selection with covariates: Consistency and computation, A comparison between two treatments in a clinical trial with an ethical allocation design, Nearly Dimension-Independent Sparse Linear Bandit over Small Action Spaces via Best Subset Selection, Transfer learning for contextual multi-armed bandits, Modeling item-item similarities for personalized recommendations on Yahoo! front page, A reinforcement learning approach to personalized learning recommendation systems, Randomized allocation with nonparametric estimation for contextual multi-armed bandits with delayed rewards, A distribution-free approach for selecting better treatment through an ethical allocation, Infinite Arms Bandit: Optimality via Confidence Bounds, Randomized allocation with arm elimination in a bandit problem with covariates, Response‐adaptive randomization for multi‐arm clinical trials using the forward looking Gittins index rule, Statistical Inference for Online Decision Making via Stochastic Gradient Descent, Covariate-adjusted response-adaptive randomization for multi-arm clinical trials using a modified forward looking Gittins index rule, Nonparametric Pricing Analytics with Customer Covariates, Statistical Inference for Online Decision Making: In a Contextual Bandit Setting



Cites Work