Value Enhancement of Reinforcement Learning via Efficient and Robust Trust Region Optimization

Cites Work

Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Statistical inference for the mean outcome under a possibly non-unique optimal treatment strategy
Statistical methods for dynamic treatment regimes. Reinforcement learning, causal inference, and personalized medicine
Dynamic treatment regimes: technical challenges and applications
Gaussian approximation of suprema of empirical processes
Performance guarantees for individualized treatment rules
Semiparametric theory for causal mediation analysis: efficiency bounds, multiple robustness and sensitivity analysis
Basic properties of strong mixing conditions. A survey and some open questions
Fast learning rates for plug-in classifiers
High-dimensional \(A\)-learning for optimal dynamic treatment regimes
\({\mathcal Q}\)-learning
Optimal aggregation of classifiers in statistical learning.
Optimal Replacement of GMC Bus Engines: An Empirical Model of Harold Zurcher
Quantile-Optimal Treatment Regimes
Constructing dynamic treatment regimes over indefinite time horizons
Optimal Dynamic Treatment Regimes
Bounded, Efficient and Multiply Robust Estimation of Average Treatment Effects Using Instrumental Variables
Multiply Robust Causal Inference with Double-Negative Control Adjustment for Categorical Unmeasured Confounding
Double/debiased machine learning for treatment and structural parameters
Estimating Dynamic Treatment Regimes in Mobile Health Using V-Learning
New Statistical Learning Methods for Estimating Optimal Dynamic Treatment Regimes
Personalized Policy Learning Using Longitudinal Mobile Health Data

This page was built for publication: Value Enhancement of Reinforcement Learning via Efficient and Robust Trust Region Optimization