Randomized Rules for the Two-Armed-Bandit with Finite Memory
From MaRDI portal
Publication:5578620
DOI10.1214/aoms/1177698038zbMath0187.15203OpenAlexW2046467452MaRDI QIDQ5578620
Publication date: 1968
Published in: The Annals of Mathematical Statistics (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1214/aoms/1177698038
Related Items
Two-Armed Bandit Strategies that Discount Past and Future, The apparent conflict between estimation and control - a survey of the two-armed bandit problem