Policy-based optimization: single-step policy gradient method seen as an evolution strategy
From MaRDI portal
Publication:6365194
arXiv2104.06175MaRDI QIDQ6365194
Jonathan Viquerat, Régis Duvigneau, Elie Hachem, Alexander Kuhnle, Philippe Meliga
Publication date: 13 April 2021
This page was built for publication: Policy-based optimization: single-step policy gradient method seen as an evolution strategy