Mathematical Research Data Initiative
Main page
Recent changes
Random page
Help about MediaWiki
Create a new Item
Create a new Property
Create a new EntitySchema
Merge two items
In other projects
Discussion
View source
View history
Purge
English
Log in

Finding minimax strategy and minimax risk in a random environment (the two-armed bandit problem)

From MaRDI portal
Publication:664264
Jump to:navigation, search

DOI10.1134/S0005117911050092zbMath1235.93268MaRDI QIDQ664264

Alex V. Kolnogorov

Publication date: 1 March 2012

Published in: Automation and Remote Control (Search for Journal in Brave)



Mathematics Subject Classification ID

Stochastic learning and adaptive control (93E35)


Related Items (3)

Gaussian two-armed bandit and optimization of batch data processing ⋮ Poissonian two-armed bandit: a new approach ⋮ Parallel design of robust control in the stochastic environment (the two-armed bandit problem)



Cites Work

  • An Asymptotic Minimax Theorem for the Two Armed Bandit Problem
  • Some Remarks on the Two-Armed Bandit
  • Some aspects of the sequential design of experiments
  • Unnamed Item
  • Unnamed Item
  • Unnamed Item
  • Unnamed Item


This page was built for publication: Finding minimax strategy and minimax risk in a random environment (the two-armed bandit problem)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:664264&oldid=12574276"
Tools
What links here
Related changes
Special pages
Printable version
Permanent link
Page information
MaRDI portal item
This page was last edited on 30 January 2024, at 09:18.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki