scientific article; zbMATH DE number 6276212
From MaRDI portal
Publication:5405224
zbMath1436.90148MaRDI QIDQ5405224
Mohammad Gheshlaghi Azar, Vicenç Gómez, Hilbert J. Kappen
Publication date: 1 April 2014
Full work available at URL: http://www.jmlr.org/papers/v13/azar12a.html
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Markov decision processesreinforcement learningfunction approximationapproximate dynamic programmingMonte-Carlo methods
Related Items (5)
On linear and super-linear convergence of natural policy gradient algorithm ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Applications of variable discounting dynamic programming to iterated function systems and related problems ⋮ Kernel dynamic policy programming: applicable reinforcement learning to robot systems with high dimensional states
This page was built for publication: