Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
The Determination of Approximately Optimal Policies in Markov Decision Processes by the Use of Bounds - MaRDI portal

The Determination of Approximately Optimal Policies in Markov Decision Processes by the Use of Bounds

From MaRDI portal

Publication:3934167

Jump to:navigation, search

DOI10.2307/2581490zbMath0477.90082OpenAlexW4253745705MaRDI QIDQ3934167

Douglas J. White

Publication date: 1982

Published in: The Journal of the Operational Research Society (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.2307/2581490

zbMATH Keywords

Markov decision process approximately optimal policies Howard's policy space method approximation of optimal performance level derivation of upper and lower bounds

Mathematics Subject Classification ID

Numerical mathematical programming methods (65K05) Markov and semi-Markov decision processes (90C40)

Related Items (2)

Approximate receding horizon approach for Markov decision processes: average reward case ⋮ Solving infinite horizon discounted Markov decision process problems for a range of discount factors

This page was built for publication: The Determination of Approximately Optimal Policies in Markov Decision Processes by the Use of Bounds

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:3934167&oldid=17616730"