Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Recurrent policy gradients - MaRDI portal

Recurrent policy gradients

From MaRDI portal

Publication:3588966

Jump to:navigation, search

DOI10.1093/jigpal/jzp049zbMath1214.68304OpenAlexW2103581399MaRDI QIDQ3588966

Jürgen Schmidhuber, Jan Peters, Alexander Förster, Daan Wierstra

Publication date: 10 September 2010

Published in: Logic Journal of IGPL (Search for Journal in Brave)

Full work available at URL: http://doc.rero.ch/record/293283/files/jzp049.pdf

zbMATH Keywords

reinforcement learning recurrent neural networks POMDPs partially observable Markov decision problems policy gradient methods

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05)

Related Items (2)

Unnamed Item ⋮ Machine learning for combinatorial optimization: a methodological tour d'horizon

Uses Software

POMDPS

This page was built for publication: Recurrent policy gradients

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:3588966&oldid=16998521"