Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement Learning (Q5089723)

From MaRDI portal
scientific article; zbMATH DE number 7556848
Language Label Description Also known as
English
Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement Learning
scientific article; zbMATH DE number 7556848

    Statements

    Identifiers