Homotopic policy mirror descent: policy convergence, algorithmic regularization, and improved sample complexity (Q6608040)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Homotopic policy mirror descent: policy convergence, algorithmic regularization, and improved sample complexity |
scientific article; zbMATH DE number 7915921
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Homotopic policy mirror descent: policy convergence, algorithmic regularization, and improved sample complexity |
scientific article; zbMATH DE number 7915921 |
Statements
Homotopic policy mirror descent: policy convergence, algorithmic regularization, and improved sample complexity (English)
0 references
19 September 2024
0 references
policy gradient method
0 references
local acceleration
0 references
policy convergence
0 references
sample complexity
0 references