A One-Armed Bandit Problem with a Concomitant Variable
From MaRDI portal
Publication:3885012
DOI10.2307/2286402zbMath0442.62063OpenAlexW4240211193MaRDI QIDQ3885012
Publication date: 1979
Full work available at URL: https://doi.org/10.2307/2286402
concomitant variablemyopic policysequential allocationasymptotically optimal policiesone-armed bandit problem
Related Items
Bayesian adaptive bandit-based designs using the Gittins index for multi-armed trials with normally distributed endpoints, Isotonic smoothing splines under sequential designs, A non-parametric solution to the multi-armed bandit problem with covariates, Woodroofe's one-armed bandit problem revisited, Bandit and covariate processes, with finite or non-denumerable set of arms, A linear response bandit problem, Smoothness-Adaptive Contextual Bandits, MULTI-ARMED BANDITS WITH COVARIATES:THEORY AND APPLICATIONS, The multi-armed bandit problem with covariates, Covariate models for bernoulli bandits, One-armed bandit process with a covariate, Transfer learning for contextual multi-armed bandits, Optimal Bayesian strategies for the infinite-armed Bernoulli bandit, Unnamed Item, Arbitrary side observations in bandit problems, Randomized allocation with nonparametric estimation for contextual multi-armed bandits with delayed rewards, On the optimal amount of experimentation in sequential decision problems, Randomized allocation with arm elimination in a bandit problem with covariates, Bayesian Incentive-Compatible Bandit Exploration, Statistical Inference for Online Decision Making via Stochastic Gradient Descent, Covariate-adjusted response-adaptive randomization for multi-arm clinical trials using a modified forward looking Gittins index rule, Randomized allocation with nonparametric estimation for a multi-armed bandit problem with covariates, Statistical Inference for Online Decision Making: In a Contextual Bandit Setting