Minimax Off-Policy Evaluation for Multi-Armed Bandits (Q5096994)

From MaRDI portal
scientific article; zbMATH DE number 7573342
Language Label Description Also known as
English
Minimax Off-Policy Evaluation for Multi-Armed Bandits
scientific article; zbMATH DE number 7573342

    Statements

    Minimax Off-Policy Evaluation for Multi-Armed Bandits (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    19 August 2022
    0 references
    off-policy evaluation
    0 references
    multi-armed bandit
    0 references
    bounded rewards
    0 references
    minimax rate-optimal procedures
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references