Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
A One-Armed Bandit Problem with a Concomitant Variable - MaRDI portal

A One-Armed Bandit Problem with a Concomitant Variable

From MaRDI portal

Publication:3885012

Jump to:navigation, search

DOI10.2307/2286402zbMath0442.62063OpenAlexW4240211193MaRDI QIDQ3885012

Michael B. Woodroofe

Publication date: 1979

Full work available at URL: https://doi.org/10.2307/2286402

zbMATH Keywords

concomitant variable myopic policy sequential allocation asymptotically optimal policies one-armed bandit problem

Mathematics Subject Classification ID

Bayesian inference (62F15) Sequential statistical design (62L05)

Related Items

Bayesian adaptive bandit-based designs using the Gittins index for multi-armed trials with normally distributed endpoints, Isotonic smoothing splines under sequential designs, A non-parametric solution to the multi-armed bandit problem with covariates, Woodroofe's one-armed bandit problem revisited, Bandit and covariate processes, with finite or non-denumerable set of arms, A linear response bandit problem, Smoothness-Adaptive Contextual Bandits, MULTI-ARMED BANDITS WITH COVARIATES:THEORY AND APPLICATIONS, The multi-armed bandit problem with covariates, Covariate models for bernoulli bandits, One-armed bandit process with a covariate, Transfer learning for contextual multi-armed bandits, Optimal Bayesian strategies for the infinite-armed Bernoulli bandit, Unnamed Item, Arbitrary side observations in bandit problems, Randomized allocation with nonparametric estimation for contextual multi-armed bandits with delayed rewards, On the optimal amount of experimentation in sequential decision problems, Randomized allocation with arm elimination in a bandit problem with covariates, Bayesian Incentive-Compatible Bandit Exploration, Statistical Inference for Online Decision Making via Stochastic Gradient Descent, Covariate-adjusted response-adaptive randomization for multi-arm clinical trials using a modified forward looking Gittins index rule, Randomized allocation with nonparametric estimation for a multi-armed bandit problem with covariates, Statistical Inference for Online Decision Making: In a Contextual Bandit Setting

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:3885012&oldid=17519395"