A smoothed Q‐learning algorithm for estimating optimal dynamic treatment regimes
From MaRDI portal
Publication:5381071
DOI10.1111/sjos.12359zbMath1418.62391OpenAlexW2757814716MaRDI QIDQ5381071
Yanqin Fan, Ming He, Liangjun Su, Xiao-Hua Andrew Zhou
Publication date: 7 June 2019
Published in: Scandinavian Journal of Statistics (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1111/sjos.12359
asymptotic normalityoptimal smoothing parametersequential randomizationWald-type inferenceexceptional law
Applications of statistics to biology and medical sciences; meta analysis (62P10) Sequential estimation (62L12)
This page was built for publication: A smoothed Q‐learning algorithm for estimating optimal dynamic treatment regimes