The following pages link to (Q5744820):
Displaying 14 items.
- Doubly robust policy evaluation and optimization (Q252797) (← links)
- Adversarial balancing-based representation learning for causal effect inference with observational data (Q2036786) (← links)
- Variational learning from implicit bandit feedback (Q2071347) (← links)
- Lessons on off-policy methods from a notification component of a chatbot (Q2071403) (← links)
- Constructing effective personalized policies using counterfactual inference from biased data sets with many features (Q2425241) (← links)
- Learning MAX-SAT from contextual examples for combinatorial optimisation (Q2680761) (← links)
- An Efficient Algorithm for Learning with Semi-bandit Feedback (Q2859220) (← links)
- Counterfactual reasoning and learning systems: the example of computational advertising (Q2933945) (← links)
- More Efficient Policy Learning via Optimal Retargeting (Q4999139) (← links)
- (Q5148951) (← links)
- (Q5159398) (← links)
- Learning When-to-Treat Policies (Q5857115) (← links)
- Orthogonal statistical learning (Q6136574) (← links)
- Off-policy evaluation in partially observed Markov decision processes under sequential ignorability (Q6183750) (← links)