Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Machine learning and nonparametric bandit theory - MaRDI portal

Machine learning and nonparametric bandit theory

From MaRDI portal

Publication:4850249

Jump to:navigation, search

DOI10.1109/9.400491zbMath0883.62090OpenAlexW2143552356MaRDI QIDQ4850249

Sidney Yakowitz, Tze Leung Lai

Publication date: 9 October 1995

Published in: IEEE Transactions on Automatic Control (Search for Journal in Brave)

Full work available at URL: https://semanticscholar.org/paper/fe611ab14edf4c19fa90e03da42589b0e9c5d5ec

zbMATH Keywords

controlled Markov processes risk growth

Mathematics Subject Classification ID

Stochastic learning and adaptive control (93E35) Optimal stopping in statistics (62L15) Nonparametric inference (62G99)

Related Items (10)

Bandit and covariate processes, with finite or non-denumerable set of arms ⋮ A linear response bandit problem ⋮ MULTI-ARMED BANDITS WITH COVARIATES:THEORY AND APPLICATIONS ⋮ Tuning Bandit Algorithms in Stochastic Environments ⋮ Arbitrary side observations in bandit problems ⋮ Active learning in heteroscedastic noise ⋮ Convergence of least squares learning in self-referential discontinuous stochastic models. ⋮ Exploration-exploitation tradeoff using variance estimates in multi-armed bandits ⋮ Randomized allocation with nonparametric estimation for a multi-armed bandit problem with covariates ⋮ Sequential design with applications to the trim-loss problem

This page was built for publication: Machine learning and nonparametric bandit theory

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:4850249&oldid=19194572"