Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Smoothing policies and safe policy gradients - MaRDI portal

Smoothing policies and safe policy gradients

From MaRDI portal

Publication:6097096

Jump to:navigation, search

DOI10.1007/s10994-022-06232-6arXiv1905.03231OpenAlexW2944187456MaRDI QIDQ6097096

Matteo Pirotta, Matteo Papini, Marcello Restelli

Publication date: 12 June 2023

Published in: Machine Learning (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/1905.03231

zbMATH Keywords

reinforcement learning policy gradient safe learning monotonic improvement

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05)

Cites Work

This page was built for publication: Smoothing policies and safe policy gradients

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:6097096&oldid=35538527"