Global optimality guarantees for policy gradient methods
From MaRDI portal
Publication:6655175
DOI10.1287/opre.2021.0014MaRDI QIDQ6655175
Jalaj Bhandari, Daniel J. Russo
Publication date: 20 December 2024
Published in: Operations Research (Search for Journal in Brave)
This page was built for publication: Global optimality guarantees for policy gradient methods