Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
scientific article; zbMATH DE number 970511 - MaRDI portal

scientific article; zbMATH DE number 970511

From MaRDI portal

Publication:5688680

Jump to:navigation, search

zbMATH Open0873.90107MaRDI QIDQ5688680

Roberto S. Acosta Abreu

Publication date: 23 January 1997

Title of this publication is not available (Why is that?)

zbMATH Keywords

value iteration average reward unknown parameters adaptive policies optimal control of Markovian systems

Mathematics Subject Classification ID

Markov and semi-Markov decision processes (90C40)

Related Items (6)

Discrete-time Markov decision processes with first passage models ⋮ Markov decision processes ⋮ Decision Problems for Interval Markov Chains ⋮ Markov decision processes ⋮ Analysis for some properties of discrete time Markov decision processes ⋮ Multitime scale markov decision processes

This page was built for publication:

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5688680)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:5688680&oldid=30856511"