Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Combining multiple strategies for multiarmed bandit problems and asymptotic optimality - MaRDI portal

Combining multiple strategies for multiarmed bandit problems and asymptotic optimality (Q892592)

From MaRDI portal

Jump to:navigation, search

This is the item page for this Wikibase entity, intended for internal use and editing purposes.

Please use this page instead for the normal view: Combining multiple strategies for multiarmed bandit problems and asymptotic optimality

scientific article; zbMATH DE number 6511718

Language	Label	Description	Also known as
English	Combining multiple strategies for multiarmed bandit problems and asymptotic optimality	scientific article; zbMATH DE number 6511718

Statements

scholarly article

0 references

Combining multiple strategies for multiarmed bandit problems and asymptotic optimality (English)

0 references

Hyeong Soo Chang

0 references

0 references

Journal of Control Science and Engineering

0 references

publication date

19 November 2015

0 references

Summary: This brief paper provides a simple algorithm that selects a strategy at each time in a given set of multiple strategies for stochastic multiarmed bandit problems, thereby playing the arm by the chosen strategy at each time. The algorithm follows the idea of the probabilistic \(\epsilon_t\)-switching in the \(\epsilon_t\)-greedy strategy and is asymptotically optimal in the sense that the selected strategy converges to the best in the set under some conditions on the strategies in the set and the sequence of \(\epsilon_t\).

0 references

zbMATH Keywords

multiarmed bandit problems

0 references

asymptotic optimality

0 references

multiple strategies

0 references

MaRDI profile type

0 references

full work available at URL

https://doi.org/10.1155/2015/264953

0 references

Online learning methods for networking

0 references

Prediction, Learning, and Games

0 references

Some aspects of the sequential design of experiments

0 references

0 references

Randomised allocation of treatments in sequential trials

0 references

Finite-time analysis of the multiarmed bandit problem

0 references

0 references

The Nonstochastic Multiarmed Bandit Problem

0 references

Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems

0 references

Combining expert advice in reactive environments

0 references

Recommended article

An asymptotically optimal strategy for constrained multi-armed bandit problems

Similarity Score

0.9391129

Recommender Run

Recommender Run 3

0 references

ASYMPTOTICALLY OPTIMAL MULTI-ARMED BANDIT POLICIES UNDER A COST CONSTRAINT

Similarity Score

0.9114468

Recommender Run

Recommender Run 3

0 references

Similarity Score

0.90940464

Recommender Run

Recommender Run 3

0 references

Asymptotically optimal algorithms for budgeted multiple play bandits

Similarity Score

0.89920986

Recommender Run

Recommender Run 3

0 references

On nearly selfoptimizing strategies for multiarmed bandit problems with controlled arms

Similarity Score

0.8969173

Recommender Run

Recommender Run 3

0 references

Combinatorial multi-armed bandit and its extension to probabilistically triggered arms

Similarity Score

0.892969

Recommender Run

Recommender Run 3

0 references

Sequential Multi-Hypothesis Testing in Multi-Armed Bandit Problems: An Approach for Asymptotic Optimality

Similarity Score

0.88904816

Recommender Run

Recommender Run 3

0 references

An asymptotically optimal heuristic for general nonstationary finite-horizon restless multi-armed, multi-action bandits

Similarity Score

0.88738525

Recommender Run

Recommender Run 3

0 references

Multi-armed bandits in discrete and continuous time

Similarity Score

0.886415

Recommender Run

Recommender Run 3

0 references

Identifiers

zbMATH Open document ID

0 references

10.1155/2015/264953

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

zbMATH DE Number

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:892592

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Item:Q892592&oldid=42748924"