Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement Learning (Q5089723)
From MaRDI portal
scientific article; zbMATH DE number 7556848
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement Learning |
scientific article; zbMATH DE number 7556848 |
Statements
Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement Learning (English)
0 references
15 July 2022
0 references
Thompson sampling
0 references
contextual bandits
0 references
reinforcement learning
0 references
0 references