scientific article; zbMATH DE number 1356140
From MaRDI portal
Publication:4700316
zbMath0930.93048MaRDI QIDQ4700316
Publication date: 1 November 1999
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
dynamic programmingintelligent controlreinforcement learningQ-learningtemporal differencemultidimensional action spacesQ-V-learning
Learning and adaptive systems in artificial intelligence (68T05) Multivariable systems, multidimensional control systems (93C35)
Related Items (1)
This page was built for publication: