Robust and efficient algorithms for conversational contextual bandit
From MaRDI portal
Publication:6180012
DOI10.1016/J.INS.2023.119993MaRDI QIDQ6180012
Ming-Sheng Shang, Haoran Gu, Yun-Ni Xia, Xiaoyu Shi, Hong Xie
Publication date: 18 January 2024
Published in: Information Sciences (Search for Journal in Brave)
upper confidence boundregret analysisconversational contextual banditimbalanced conversation feedback
Cites Work
This page was built for publication: Robust and efficient algorithms for conversational contextual bandit