Policy-based optimization: single-step policy gradient method seen as an evolution strategy (Q6365194)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Policy-based optimization: single-step policy gradient method seen as an evolution strategy |
scientific article; zbMATH DE number 900478023
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Policy-based optimization: single-step policy gradient method seen as an evolution strategy |
scientific article; zbMATH DE number 900478023 |
Statements
13 April 2021
0 references
math.OC
0 references