Policy Mirror Descent for Reinforcement Learning: Linear Convergence, New Sampling Complexity, and Generalized Problem Classes (Q6359420)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Policy Mirror Descent for Reinforcement Learning: Linear Convergence, New Sampling Complexity, and Generalized Problem Classes |
scientific article
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Policy Mirror Descent for Reinforcement Learning: Linear Convergence, New Sampling Complexity, and Generalized Problem Classes |
scientific article |
Statements
29 January 2021
0 references
cs.LG
0 references
cs.AI
0 references
math.OC
0 references