Gorthaur-EXP3: bandit-based selection from a portfolio of recommendation algorithms balancing the accuracy-diversity dilemma
DOI10.1016/j.ins.2020.08.106zbMath1475.68374OpenAlexW3082976567MaRDI QIDQ2055544
Olivier Camp, Fabien Chhel, Nicolas Gutowski, Tassadit Amghar
Publication date: 1 December 2021
Published in: Information Sciences (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/j.ins.2020.08.106
multi-armed banditrecommendation systemsapplication of reinforcement learningcontextual multi-armed banditportfolio approach
Learning and adaptive systems in artificial intelligence (68T05) Stopping times; optimal stopping problems; gambling theory (60G40) Theory of languages and software systems (knowledge-based systems, expert systems, etc.) for artificial intelligence (68T35)
Related Items (1)
Uses Software
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Analyzing bandit-based adaptive operator selection mechanisms
- Learning dynamic algorithm portfolios
- Simple explanation of the no-free-lunch theorem and its implications
- Multi-objective Contextual Multi-armed Bandit With a Dominant Objective
- The Nonstochastic Multiarmed Bandit Problem
- 10.1162/153244303321897663
- Algorithm Selection for Combinatorial Search Problems: A Survey
- Some aspects of the sequential design of experiments
This page was built for publication: Gorthaur-EXP3: bandit-based selection from a portfolio of recommendation algorithms balancing the accuracy-diversity dilemma