Neural Policy Gradient Methods: Global Optimality and Rates of Convergence (Q6324598)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Neural Policy Gradient Methods: Global Optimality and Rates of Convergence |
preprint article from arXiv
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Neural Policy Gradient Methods: Global Optimality and Rates of Convergence |
preprint article from arXiv |
Statements
29 August 2019
0 references
cs.LG
0 references
math.OC
0 references
stat.ML
0 references
Lingxiao Wang
0 references
Qi Cai
0 references
Zhuoran Yang
0 references
Zhaoran Wang
0 references