Mathematical Research Data Initiative
Main page
Recent changes
Random page
SPARQL
MaRDI@GitHub
In other projects
MaRDI portal item
Discussion
View source
View history
Purge
English
Log in

Global optimality guarantees for policy gradient methods

From MaRDI portal
Publication:6655175
Jump to:navigation, search

DOI10.1287/opre.2021.0014MaRDI QIDQ6655175

Jalaj Bhandari, Daniel J. Russo

Publication date: 20 December 2024

Published in: Operations Research (Search for Journal in Brave)



zbMATH Keywords

dynamic programmingreinforcement learningpolicy iterationpolicy gradient methodsgradient dominance


Mathematics Subject Classification ID

Mathematical programming (90Cxx)








This page was built for publication: Global optimality guarantees for policy gradient methods

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:6655175&oldid=40232114"
Tools
What links here
Related changes
Special pages
Printable version
Permanent link
Page information
This page was last edited on 13 February 2025, at 20:08.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki