Mathematical Research Data Initiative
Main page
Recent changes
Random page
Help about MediaWiki
Create a new Item
Create a new Property
Merge two items
In other projects
MaRDI portal item
Discussion
View source
View history
Purge
English
Log in

Structure in the space of value functions

From MaRDI portal
Publication:1604827
Jump to:navigation, search

DOI10.1023/A:1017944732463zbMath1005.68087OpenAlexW1598748993MaRDI QIDQ1604827

R. Smith

Publication date: 8 July 2002

Published in: Machine Learning (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1023/a:1017944732463


zbMATH Keywords

optimal controldynamic programmingreinforcement learningunsupervised learning


Mathematics Subject Classification ID

Computational learning theory (68Q32) Memory and learning in psychology (91E40)


Related Items (1)

Accelerating autonomous learning by using heuristic selection of actions







This page was built for publication: Structure in the space of value functions

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:1604827&oldid=13899953"
Tools
What links here
Related changes
Special pages
Printable version
Permanent link
Page information
This page was last edited on 1 February 2024, at 02:43.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki