Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
A note on generalized second-order value iteration in Markov decision processes - MaRDI portal

A note on generalized second-order value iteration in Markov decision processes

From MaRDI portal

Publication:6145054

Jump to:navigation, search

DOI10.1007/s10957-023-02309-xMaRDI QIDQ6145054

Villavarayan Antony Vijesh, Mohammed Shahid Abdulla, Shreyas Sumithra Rudresha

Publication date: 8 January 2024

Published in: Journal of Optimization Theory and Applications (Search for Journal in Brave)

zbMATH Keywords

Markov decision processes reinforcement learning value iteration Q-learning

Mathematics Subject Classification ID

Markov and semi-Markov decision processes (90C40)

Cites Work

This page was built for publication: A note on generalized second-order value iteration in Markov decision processes

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:6145054&oldid=35617014"