Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Adaptive playouts for online learning of policies during Monte Carlo tree search - MaRDI portal

Adaptive playouts for online learning of policies during Monte Carlo tree search

From MaRDI portal

Publication:307776

Jump to:navigation, search

DOI10.1016/J.TCS.2016.06.029zbMath1370.68260OpenAlexW2468569233MaRDI QIDQ307776

Tobias Graf, Marco Platzner

Publication date: 5 September 2016

Published in: Theoretical Computer Science (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1016/j.tcs.2016.06.029

zbMATH Keywords

reinforcement learning computer Go Monte Carlo tree search adaptive playouts

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05) Problem solving in the context of artificial intelligence (heuristics, search strategies, etc.) (68T20) Combinatorial games (91A46)

Uses Software

PACHI

Cites Work

This page was built for publication: Adaptive playouts for online learning of policies during Monte Carlo tree search

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:307776&oldid=12188497"