Reinforcement Learning in Latent Heterogeneous Environments
From MaRDI portal
Publication:6651418
DOI10.1080/01621459.2024.2308317MaRDI QIDQ6651418
Michael I. Jordan, Elynn Y. Chen, Rui Song
Publication date: 10 December 2024
Published in: Journal of the American Statistical Association (Search for Journal in Brave)
Cites Work
- Nearly unbiased variable selection under minimax concave penalty
- Optimizing active surveillance for prostate cancer using partially observable Markov decision processes
- Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties
- Constructing dynamic treatment regimes over indefinite time horizons
- The variance of discounted Markov decision processes
- Discretizing Unobserved Heterogeneity
- Statistical Foundations of Data Science
- Estimating Dynamic Treatment Regimes in Mobile Health Using V-Learning
- Grouping Pursuit Through a Regularization Solution Surface
- Off-Policy Confidence Interval Estimation with Confounded Markov Decision Process
- Statistical inference of the value function for reinforcement learning in infinite-horizon settings
This page was built for publication: Reinforcement Learning in Latent Heterogeneous Environments