Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Statistical Inference for Online Decision Making: In a Contextual Bandit Setting - MaRDI portal

Statistical Inference for Online Decision Making: In a Contextual Bandit Setting

From MaRDI portal

Publication:5857145

Jump to:navigation, search

DOI10.1080/01621459.2020.1770098zbMath1457.62041arXiv2010.07283OpenAlexW3030165768MaRDI QIDQ5857145

Haoyu Chen, Rui Song, Wen-Bin Lu

Publication date: 30 March 2021

Published in: Journal of the American Statistical Association (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/2010.07283

zbMATH Keywords

statistical inference model misspecification online decision making epsilon-greedy inverse propensity weighted estimator

Mathematics Subject Classification ID

Nonparametric estimation (62G05) Sequential statistical analysis (62L10) Compound decision problems in statistical decision theory (62C25)

Related Items

A Single-Index Model With a Surface-Link for Optimizing Individualized Dose Rules, Nearly Dimension-Independent Sparse Linear Bandit over Small Action Spaces via Best Subset Selection

Cites Work

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:5857145&oldid=30703580"