A modified EXP3 in adversarial bandits with multi-user delayed feedback
From MaRDI portal
Publication:6591641
DOI10.1007/978-3-031-49193-1_20MaRDI QIDQ6591641
Publication date: 22 August 2024
Cites Work
- Title not available (Why is that?)
- Title not available (Why is that?)
- Multi-armed bandit models for the optimal design of clinical trials: benefits and challenges
- Sur les fonctions convexes et les inégalités entre les valeurs moyennes.
- Contextual dependent click bandit algorithm for web recommendation
- A Modular Analysis of Adaptive (Non-)Convex Optimization: Optimism, Composite Objectives, and Variational Bounds
- Bandit Algorithms
This page was built for publication: A modified EXP3 in adversarial bandits with multi-user delayed feedback
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6591641)