Pages that link to "Item:Q682295"
From MaRDI portal
The following pages link to Targeted sequential design for targeted learning inference of the optimal treatment rule and its mean reward (Q682295):
Displaying 6 items.
- Statistical inference for the mean outcome under a possibly non-unique optimal treatment strategy (Q282469) (← links)
- Performance guarantees for policy learning (Q2227481) (← links)
- Statistical Inference for Online Decision Making via Stochastic Gradient Descent (Q4999148) (← links)
- Statistical Inference for Online Decision Making: In a Contextual Bandit Setting (Q5857145) (← links)
- Relaxing the i.i.d. assumption: adaptively minimax optimal regret via root-entropic regularization (Q6183761) (← links)
- Doubly Robust Interval Estimation for Optimal Policy Evaluation in Online Learning (Q6651384) (← links)