Interactive Thompson sampling for multi-objective multi-armed bandits
From MaRDI portal
Publication:1990281
DOI10.1007/978-3-319-67504-6_2zbMath1398.90082OpenAlexW2759684794MaRDI QIDQ1990281
Luisa M. Zintgraf, Diederik M. Roijers, Ann Nowé
Publication date: 25 October 2018
Full work available at URL: https://doi.org/10.1007/978-3-319-67504-6_2
Management decision making, including multiple objectives (90B50) Utility theory (91B16) Software, source code, etc. for problems pertaining to operations research and mathematical programming (90-04)
Related Items