scientific article; zbMATH DE number 6453514
From MaRDI portal
Publication:5260100
DOI10.13195/J.KZYJC.2013.1467zbMath1324.68130MaRDI QIDQ5260100
Chunyuan Zhang, Qingxin Zhu, Sheng Zhong
Publication date: 29 June 2015
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
temporal difference learningvalue function approximationlocally weighted learningpolicy approximation
This page was built for publication: