Proximal Reinforcement Learning: Efficient Off-Policy Evaluation in Partially Observed Markov Decision Processes (Q6381582)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Proximal Reinforcement Learning: Efficient Off-Policy Evaluation in Partially Observed Markov Decision Processes |
preprint article from arXiv
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Proximal Reinforcement Learning: Efficient Off-Policy Evaluation in Partially Observed Markov Decision Processes |
preprint article from arXiv |
Statements
28 October 2021
0 references
cs.LG
0 references
math.OC
0 references
math.ST
0 references
stat.ML
0 references
stat.TH
0 references
Andrew Bennett
0 references
Nathan Kallus
0 references