Off-Policy Confidence Interval Estimation with Confounded Markov Decision Process
From MaRDI portal
Publication:6153991
DOI10.1080/01621459.2022.2110878arXiv2202.10589WikidataQ114641964 ScholiaQ114641964MaRDI QIDQ6153991
Hong-Tu Zhu, Chengchun Shi, Unnamed Author, Jin Zhu, Shikai Luo, Rui Song
Publication date: 19 March 2024
Published in: Journal of the American Statistical Association (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/2202.10589
statistical inferencereinforcement learningunmeasured confoundersoff-policy evaluationInfinite horizonsridesourcing platforms
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Statistical inference for the mean outcome under a possibly non-unique optimal treatment strategy
- Gaussian approximation of suprema of empirical processes
- Nonparametric regression using deep neural networks with ReLU activation function
- Evaluating marker-guided treatment selection strategies
- Asymptotic Statistics
- Estimation and Inference of Heterogeneous Treatment Effects using Random Forests
- Quantile-Optimal Treatment Regimes
- Optimal Dynamic Treatment Regimes
- A Robust Method for Estimating Optimal Treatment Regimes
- Multiply Robust Causal Inference with Double-Negative Control Adjustment for Categorical Unmeasured Confounding
- Double/debiased machine learning for treatment and structural parameters
- Robust Inference on Population Indirect Causal Effects: The Generalized Front Door Criterion
- Estimating Dynamic Treatment Regimes in Mobile Health Using V-Learning
- Optimal Structural Nested Models for Optimal Sequential Decisions
- Robust estimation of optimal dynamic treatment regimes for sequential treatment decisions
- Off-Policy Estimation of Long-Term Average Outcomes With Applications to Mobile Health
- Resampling‐based confidence intervals for model‐free robust inference on optimal treatment regimes
This page was built for publication: Off-Policy Confidence Interval Estimation with Confounded Markov Decision Process