Regularized Robust MDPs and Risk-Sensitive MDPs: Equivalence, Policy Gradient, and Sample Complexity (Q6440882)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Regularized Robust MDPs and Risk-Sensitive MDPs: Equivalence, Policy Gradient, and Sample Complexity |
preprint article from arXiv
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Regularized Robust MDPs and Risk-Sensitive MDPs: Equivalence, Policy Gradient, and Sample Complexity |
preprint article from arXiv |
Statements
20 June 2023
0 references
math.OC
0 references
cs.LG
0 references
Runyu Zhang
0 references
Yang Hu
0 references
Na Li
0 references