Anytime-valid off-policy inference for contextual bandits (Q6414481)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Anytime-valid off-policy inference for contextual bandits |
preprint article from arXiv
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Anytime-valid off-policy inference for contextual bandits |
preprint article from arXiv |
Statements
19 October 2022
0 references
stat.ME
0 references
cs.LG
0 references
math.ST
0 references
stat.ML
0 references
stat.TH
0 references
Ian Waudby-Smith
0 references
Lili Wu
0 references
Aaditya Ramdas
0 references
Nikos Karampatziakis
0 references
Paul Mineiro
0 references