Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Bypassing the Monster: A Faster and Simpler Optimal Algorithm for Contextual Bandits Under Realizability - MaRDI portal

Bypassing the Monster: A Faster and Simpler Optimal Algorithm for Contextual Bandits Under Realizability

From MaRDI portal

Publication:5868941

Jump to:navigation, search

DOI10.1287/moor.2021.1193OpenAlexW3013725403MaRDI QIDQ5868941

Yunzong Xu, David Simchi-Levi

Publication date: 26 September 2022

Published in: Mathematics of Operations Research (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/2003.12699

zbMATH Keywords

computational efficiency statistical learning contextual bandit offline regression online-to-offline reduction

Mathematics Subject Classification ID

Computational learning theory (68Q32) General nonlinear regression (62J02) Learning and adaptive systems in artificial intelligence (68T05) Sequential statistical design (62L05)

Related Items (3)

Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement Learning ⋮ Dealing with expert bias in collective decision-making ⋮ Recent advances in reinforcement learning in finance

Cites Work

This page was built for publication: Bypassing the Monster: A Faster and Simpler Optimal Algorithm for Contextual Bandits Under Realizability

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:5868941&oldid=30718385"