Value Enhancement of Reinforcement Learning via Efficient and Robust Trust Region Optimization
From MaRDI portal
Publication:6631700
DOI10.1080/01621459.2023.2238942MaRDI QIDQ6631700
Fan Zhou, Zhengling Qi, Chengchun Shi, Jianing Wang
Publication date: 1 November 2024
Published in: Journal of the American Statistical Association (Search for Journal in Brave)
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Statistical inference for the mean outcome under a possibly non-unique optimal treatment strategy
- Statistical methods for dynamic treatment regimes. Reinforcement learning, causal inference, and personalized medicine
- Dynamic treatment regimes: technical challenges and applications
- Gaussian approximation of suprema of empirical processes
- Performance guarantees for individualized treatment rules
- Semiparametric theory for causal mediation analysis: efficiency bounds, multiple robustness and sensitivity analysis
- Basic properties of strong mixing conditions. A survey and some open questions
- Fast learning rates for plug-in classifiers
- High-dimensional \(A\)-learning for optimal dynamic treatment regimes
- \({\mathcal Q}\)-learning
- Optimal aggregation of classifiers in statistical learning.
- Optimal Replacement of GMC Bus Engines: An Empirical Model of Harold Zurcher
- Quantile-Optimal Treatment Regimes
- Constructing dynamic treatment regimes over indefinite time horizons
- Optimal Dynamic Treatment Regimes
- Bounded, Efficient and Multiply Robust Estimation of Average Treatment Effects Using Instrumental Variables
- Multiply Robust Causal Inference with Double-Negative Control Adjustment for Categorical Unmeasured Confounding
- Double/debiased machine learning for treatment and structural parameters
- Estimating Dynamic Treatment Regimes in Mobile Health Using V-Learning
- New Statistical Learning Methods for Estimating Optimal Dynamic Treatment Regimes
- Personalized Policy Learning Using Longitudinal Mobile Health Data
This page was built for publication: Value Enhancement of Reinforcement Learning via Efficient and Robust Trust Region Optimization