Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Reducing reinforcement learning to KWIK online regression - MaRDI portal

Reducing reinforcement learning to KWIK online regression

From MaRDI portal

Publication:616761

Jump to:navigation, search

DOI10.1007/s10472-010-9201-2zbMath1207.68243OpenAlexW2020753891MaRDI QIDQ616761

Michael L. Littman, Lihong Li

Publication date: 12 January 2011

Published in: Annals of Mathematics and Artificial Intelligence (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1007/s10472-010-9201-2

zbMATH Keywords

reinforcement learning value function approximation exploration knows what it knows (KWIK)online regression PAC-MDP

Mathematics Subject Classification ID

Analysis of algorithms and problem complexity (68Q25) Learning and adaptive systems in artificial intelligence (68T05)

Related Items

Solving average cost Markov decision processes by means of a two-phase time aggregation algorithm ⋮ Unnamed Item ⋮ Knows what it knows: a framework for self-aware learning ⋮ Abstraction from demonstration for efficient reinforcement learning in high-dimensional domains

Uses Software

R-MAX

Cites Work

This page was built for publication: Reducing reinforcement learning to KWIK online regression

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:616761&oldid=12508974"