Multi-objective multi-armed bandit with lexicographically ordered and satisficing objectives
From MaRDI portal
Publication:2051318
DOI10.1007/s10994-021-05956-1OpenAlexW3159323250MaRDI QIDQ2051318
Publication date: 24 November 2021
Published in: Machine Learning (Search for Journal in Brave)
Full work available at URL: http://hdl.handle.net/11693/77165
Related Items (1)
Cites Work
- Unnamed Item
- Unnamed Item
- Active learning in heteroscedastic noise
- Asymptotically efficient adaptive allocation rules
- Optimal sequential sampling from two populations.
- Exceptional Paper—Lexicographic Orders, Utilities and Decision Rules: A Survey
- Satisficing in Multi-Armed Bandit Problems
- Multi-objective Contextual Multi-armed Bandit With a Dominant Objective
- A Structured Multiarmed Bandit Problem and the Greedy Policy
- Bandit Algorithms
- Explore First, Exploit Next: The True Shape of Regret in Bandit Problems
- Multicriteria Optimization
This page was built for publication: Multi-objective multi-armed bandit with lexicographically ordered and satisficing objectives