Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
On theory and algorithms for Markov decision problems with the total reward criterion - MaRDI portal

On theory and algorithms for Markov decision problems with the total reward criterion

From MaRDI portal

Publication:1144500

Jump to:navigation, search

DOI10.1007/BF01719273zbMath0443.90108OpenAlexW2002346494MaRDI QIDQ1144500

Jaap Wessels, J. A. E. E. Van Nunen

Publication date: 1979

Published in: OR Spektrum (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1007/bf01719273

zbMATH Keywords

survey algorithms convergence results existence results Markov decision problems theory total reward criterion successive approximation methods unbounded rewards action depending stopping times

Mathematics Subject Classification ID

Numerical mathematical programming methods (65K05) Markov renewal processes, semi-Markov processes (60K15) Markov and semi-Markov decision processes (90C40) Research exposition (monographs, survey articles) pertaining to operations research and mathematical programming (90-02)

Related Items

Finite state dynamic programming with the total reward criterion, Action-dependent stopping times and Markov decision process with unbounded rewards

Cites Work

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:1144500&oldid=13198721"