Mathematical Research Data Initiative
Main page
Recent changes
Random page
Help about MediaWiki
Create a new Item
Create a new Property
Merge two items
In other projects
MaRDI portal item
Discussion
View source
View history
Purge
English
Log in

Fast online \(Q(\lambda)\)

From MaRDI portal
Publication:1275350
Jump to:navigation, search

DOI10.1023/A:1007562800292zbMath0912.68170OpenAlexW139877375MaRDI QIDQ1275350

Jürgen Schmidhuber, Marco A. Wiering

Publication date: 17 January 1999

Published in: Machine Learning (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1023/a:1007562800292


zbMATH Keywords

reinforcement learning\(Q\)-learning


Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05)


Related Items (3)

Open problems in universal induction \& intelligence ⋮ Risk-sensitive reinforcement learning algorithms with generalized average criterion ⋮ On-policy concurrent reinforcement learning







This page was built for publication: Fast online \(Q(\lambda)\)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:1275350&oldid=13376426"
Tools
What links here
Related changes
Special pages
Printable version
Permanent link
Page information
This page was last edited on 31 January 2024, at 09:55.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki