Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
The multi-armed bandit problem: an efficient nonparametric solution - MaRDI portal

The multi-armed bandit problem: an efficient nonparametric solution

From MaRDI portal

Publication:2176624

Jump to:navigation, search

DOI10.1214/19-AOS1809zbMath1442.62180arXiv1703.08285OpenAlexW3007054292MaRDI QIDQ2176624

Hock Peng Chan

Publication date: 5 May 2020

Published in: The Annals of Statistics (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/1703.08285

zbMATH Keywords

efficiency subsampling Thompson sampling KL-UCB upper confidence bound (UCB)

Mathematics Subject Classification ID

Nonparametric tolerance and confidence regions (62G15) Sequential statistical design (62L05) Optimal stopping in statistics (62L15) Compound decision problems in statistical decision theory (62C25)

Related Items (2)

A non-parametric solution to the multi-armed bandit problem with covariates ⋮ Infinite Arms Bandit: Optimality via Confidence Bounds

Cites Work

This page was built for publication: The multi-armed bandit problem: an efficient nonparametric solution

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:2176624&oldid=14691759"