The Robbins-Isbell Two-Armed-Bandit Problem with Finite Memory
From MaRDI portal
Publication:5343908
DOI10.1214/aoms/1177699897zbMath0133.41701OpenAlexW1973532850MaRDI QIDQ5343908
Carter Vincent Smith, Ronald Pyke
Publication date: 1965
Published in: The Annals of Mathematical Statistics (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1214/aoms/1177699897
Related Items
Unnamed Item, The apparent conflict between estimation and control - a survey of the two-armed bandit problem, Comparison of eigenvectors of irreducible stochastic matrices