KL-UCB-switch: optimal regret bounds for stochastic bandits from both a distribution-dependent and a distribution-free viewpoints (Q6301530)

From MaRDI portal





preprint article from arXiv
Language Label Description Also known as
English
KL-UCB-switch: optimal regret bounds for stochastic bandits from both a distribution-dependent and a distribution-free viewpoints
preprint article from arXiv

    Statements

    14 May 2018
    0 references
    stat.ML
    0 references
    cs.LG
    0 references
    math.ST
    0 references
    stat.TH
    0 references
    Aurélien Garivier
    0 references
    Hédi Hadiji
    0 references
    Pierre Menard
    0 references
    Gilles Stoltz
    0 references

    Identifiers

    0 references