Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Partially Observed Markov Decision Process Multiarmed Bandits—Structural Results - MaRDI portal

Partially Observed Markov Decision Process Multiarmed Bandits—Structural Results

From MaRDI portal

Publication:3169035

Jump to:navigation, search

DOI10.1287/moor.1080.0371zbMath1231.90373OpenAlexW1998039896MaRDI QIDQ3169035

Vikram Krishnamurthy, B. Wahlberg

Publication date: 27 April 2011

Published in: Mathematics of Operations Research (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1287/moor.1080.0371

zbMATH Keywords

likelihood ratio ordering stochastic approximation algorithm opportunistic scheduling monotone policies partially observed Markov decision process multiarmed bandits

Mathematics Subject Classification ID

Deterministic scheduling theory in operations research (90B35) Approximation methods and heuristics in mathematical programming (90C59) Markov and semi-Markov decision processes (90C40)

Related Items (3)

Ambiguous partially observable Markov decision processes: structural results and applications ⋮ Optimal Threshold Policies for Multivariate Stopping-Time POMDPs ⋮ Game of Thrones: Fully Distributed Learning for Multiplayer Bandits

Uses Software

POMDPS

This page was built for publication: Partially Observed Markov Decision Process Multiarmed Bandits—Structural Results

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:3169035&oldid=16416013"