Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
PAC Bounds for Discounted MDPs - MaRDI portal

PAC Bounds for Discounted MDPs

From MaRDI portal

Publication:3164829

Jump to:navigation, search

DOI10.1007/978-3-642-34106-9_26zbMath1367.68233arXiv1202.3890OpenAlexW1867103660WikidataQ58012270 ScholiaQ58012270MaRDI QIDQ3164829

Marcus Hutter, Tor Lattimore

Publication date: 16 October 2012

Published in: Lecture Notes in Computer Science (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/1202.3890

zbMATH Keywords

Markov decision processes reinforcement learning sample-complexity PAC-MDP exploration exploitation

Mathematics Subject Classification ID

General nonlinear regression (62J02) Learning and adaptive systems in artificial intelligence (68T05)

Related Items (4)

Minimax PAC bounds on the sample complexity of reinforcement learning with a generative model ⋮ Recent advances in reinforcement learning in finance ⋮ Near-optimal PAC bounds for discounted MDPs ⋮ Variance Reduced Value Iteration and Faster Algorithms for Solving Markov Decision Processes

This page was built for publication: PAC Bounds for Discounted MDPs

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:3164829&oldid=16305958"