Proximal reinforcement learning: efficient off-policy evaluation in partially observed Markov decision processes (Q6580512)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Proximal reinforcement learning: efficient off-policy evaluation in partially observed Markov decision processes |
scientific article; zbMATH DE number 7888749
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Proximal reinforcement learning: efficient off-policy evaluation in partially observed Markov decision processes |
scientific article; zbMATH DE number 7888749 |
Statements
Proximal reinforcement learning: efficient off-policy evaluation in partially observed Markov decision processes (English)
0 references
29 July 2024
0 references
machine learning and data science
0 references
offline reinforcement learning
0 references
unmeasured confounding
0 references
semiparametric efficiency
0 references