Mathematical Research Data Initiative
Main page
Recent changes
Random page
Help about MediaWiki
Create a new Item
Create a new Property
Merge two items
In other projects
Discussion
View source
View history
Purge
English
Log in

Recurrent policy gradients

From MaRDI portal
Publication:3588966
Jump to:navigation, search

DOI10.1093/jigpal/jzp049zbMath1214.68304OpenAlexW2103581399MaRDI QIDQ3588966

Jürgen Schmidhuber, Jan Peters, Alexander Förster, Daan Wierstra

Publication date: 10 September 2010

Published in: Logic Journal of IGPL (Search for Journal in Brave)

Full work available at URL: http://doc.rero.ch/record/293283/files/jzp049.pdf


zbMATH Keywords

reinforcement learningrecurrent neural networksPOMDPspartially observable Markov decision problemspolicy gradient methods


Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05)


Related Items (2)

Unnamed Item ⋮ Machine learning for combinatorial optimization: a methodological tour d'horizon


Uses Software

  • POMDPS






This page was built for publication: Recurrent policy gradients

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:3588966&oldid=16998521"
Tools
What links here
Related changes
Special pages
Printable version
Permanent link
Page information
MaRDI portal item
This page was last edited on 5 February 2024, at 03:11.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki