Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Batch mode reinforcement learning based on the synthesis of artificial trajectories - MaRDI portal

Batch mode reinforcement learning based on the synthesis of artificial trajectories

From MaRDI portal

Publication:378762

Jump to:navigation, search

DOI10.1007/s10479-012-1248-5zbMath1276.68134OpenAlexW2134689794WikidataQ42258641 ScholiaQ42258641MaRDI QIDQ378762

Damien Ernst, Louis Wehenkel, Raphael Fonteneau, Susan A. Murphy

Publication date: 12 November 2013

Published in: Annals of Operations Research (Search for Journal in Brave)

Full work available at URL: http://europepmc.org/articles/pmc3773886

zbMATH Keywords

optimal control reinforcement learning function approximators artificial trajectories

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05) Stochastic learning and adaptive control (93E35)

Related Items

Lipschitzness is all you need to tame off-policy generative adversarial imitation learning

Uses Software

Approxrl

Cites Work

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:378762&oldid=12251321"