Policy mirror descent inherently explores action space (Q6663113)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Policy mirror descent inherently explores action space |
scientific article; zbMATH DE number 7966996
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Policy mirror descent inherently explores action space |
scientific article; zbMATH DE number 7966996 |
Statements
Policy mirror descent inherently explores action space (English)
0 references
14 January 2025
0 references
Markov decision process
0 references
stochastic policy gradient
0 references
exploration
0 references
mirror descent
0 references
sample complexity
0 references
0 references