Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
ON THE IDENTIFICATION AND MITIGATION OF WEAKNESSES IN THE KNOWLEDGE GRADIENT POLICY FOR MULTI-ARMED BANDITS - MaRDI portal

ON THE IDENTIFICATION AND MITIGATION OF WEAKNESSES IN THE KNOWLEDGE GRADIENT POLICY FOR MULTI-ARMED BANDITS

From MaRDI portal

Publication:5358114

Jump to:navigation, search

DOI10.1017/S0269964816000279zbMath1414.91105arXiv1607.05970OpenAlexW3104196082MaRDI QIDQ5358114

No author found.

Publication date: 19 September 2017

Published in: Probability in the Engineering and Informational Sciences (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/1607.05970

zbMATH Keywords

stochastic dynamic programming

Mathematics Subject Classification ID

Decision theory (91B06) Stochastic programming (90C15) Dynamic programming (90C39)

Related Items (1)

Optimal Online Learning for Nonlinear Belief Models Using Discrete Priors

Uses Software

EGO

Cites Work

This page was built for publication: ON THE IDENTIFICATION AND MITIGATION OF WEAKNESSES IN THE KNOWLEDGE GRADIENT POLICY FOR MULTI-ARMED BANDITS

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:5358114&oldid=20061601"