Index-based policies for discounted multi-armed bandits on parallel machines.

From MaRDI portal
Publication:1872472