Model-based policy gradients with parameter-based exploration by least-squares conditional density estimation

From MaRDI portal
Publication:889297

DOI10.1016/j.neunet.2014.06.006zbMath1325.68200arXiv1307.5118OpenAlexW2041851440WikidataQ39164602 ScholiaQ39164602MaRDI QIDQ889297

Syogo Mori, Voot Tangkaratt, Jun Morimoto, Masashi Sugiyama, Tingting Zhao

Publication date: 6 November 2015

Published in: Neural Networks (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/1307.5118



Related Items


Uses Software


Cites Work