Maximum a Posteriori Policy Optimisation (Q6303189)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Maximum a Posteriori Policy Optimisation |
preprint article from arXiv
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Maximum a Posteriori Policy Optimisation |
preprint article from arXiv |
Statements
14 June 2018
0 references
cs.LG
0 references
cs.AI
0 references
cs.IT
0 references
cs.RO
0 references
math.IT
0 references
stat.ML
0 references
Abbas Abdolmaleki
0 references
Jost Tobias Springenberg
0 references
Yuval Tassa
0 references
Remi Munos
0 references
Nicolas Heess
0 references
Martin Riedmiller
0 references