Recurrent policy gradients
From MaRDI portal
Publication:3588966
DOI10.1093/jigpal/jzp049zbMath1214.68304OpenAlexW2103581399MaRDI QIDQ3588966
Jürgen Schmidhuber, Jan Peters, Alexander Förster, Daan Wierstra
Publication date: 10 September 2010
Published in: Logic Journal of IGPL (Search for Journal in Brave)
Full work available at URL: http://doc.rero.ch/record/293283/files/jzp049.pdf
reinforcement learningrecurrent neural networksPOMDPspartially observable Markov decision problemspolicy gradient methods
Related Items (2)
Uses Software
This page was built for publication: Recurrent policy gradients