Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning (Q6401740)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning |
preprint article from arXiv
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning |
preprint article from arXiv |
Statements
10 June 2022
0 references
cs.LG
0 references
math.OC
0 references
Ruida Zhou
0 references
Tao Liu
0 references
Dileep Kalathil
0 references
P. R. Kumar
0 references
Chao Tian
0 references