Player-optimal stable regret for bandit learning in matching markets
From MaRDI portal
Publication:6538584
DOI10.1137/1.9781611977554.ch55MaRDI QIDQ6538584
Publication date: 14 May 2024
This page was built for publication: Player-optimal stable regret for bandit learning in matching markets