Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
scientific article; zbMATH DE number 6860770 - MaRDI portal

scientific article; zbMATH DE number 6860770

From MaRDI portal

Publication:4636970

Jump to:navigation, search

zbMath1434.68446MaRDI QIDQ4636970

Alan Fern, Jervis Pinto

Publication date: 17 April 2018

Full work available at URL: http://jmlr.csail.mit.edu/papers/v18/15-251.html

Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.

zbMATH Keywords

reductions imitation learning Monte-Carlo tree search online sequential decision-making partial policy partial policy learning

Mathematics Subject Classification ID

Decision theory (91B06) Learning and adaptive systems in artificial intelligence (68T05) Markov and semi-Markov decision processes (90C40) Online algorithms; streaming algorithms (68W27)

Related Items (2)

A synthesis of automated planning and reinforcement learning for efficient, robust decision-making ⋮ Unnamed Item

Uses Software

Cites Work

This page was built for publication:

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:4636970&oldid=18820968"