Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Estimation and approximation bounds for gradient-based reinforcement learning - MaRDI portal

Estimation and approximation bounds for gradient-based reinforcement learning

From MaRDI portal

Publication:1604222

Jump to:navigation, search

DOI10.1006/jcss.2001.1793zbMath1052.68108OpenAlexW1983016559MaRDI QIDQ1604222

Jonathan Baxter, Bartlett, Peter L.

Publication date: 4 July 2002

Published in: Journal of Computer and System Sciences (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1006/jcss.2001.1793

zbMATH Keywords

Partially Observable Markov Decision Process

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05) Problem solving in the context of artificial intelligence (heuristics, search strategies, etc.) (68T20)

Related Items (1)

Exploiting random walks for learning

Cites Work

This page was built for publication: Estimation and approximation bounds for gradient-based reinforcement learning

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:1604222&oldid=13901324"