A Hybrid Stochastic Policy Gradient Algorithm for Reinforcement Learning (Q6335899)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: A Hybrid Stochastic Policy Gradient Algorithm for Reinforcement Learning |
preprint article from arXiv
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | A Hybrid Stochastic Policy Gradient Algorithm for Reinforcement Learning |
preprint article from arXiv |
Statements
1 March 2020
0 references
cs.LG
0 references
math.OC
0 references
Nhan H. Pham
0 references
Lam M. Nguyen
0 references
Dzung T. Phan
0 references
Phuong Ha Nguyen
0 references
Marten van Dijk
0 references
Quoc Tran-Dinh
0 references