scientific article; zbMATH DE number 7596797
From MaRDI portal
Publication:5043718
zbMath1500.91019MaRDI QIDQ5043718
S. V. Garbar, Alex V. Kolnogorov
Publication date: 6 October 2022
Full work available at URL: http://mathnet.ru/eng/mgta299
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
dynamic programmingMonte Carlo simulationsminimax approachmulti-armed bandit probleminvariant descriptionUCB ruleGaussian multi-armed bandit
Cites Work
- Batched bandit problems
- Adaptive treatment allocation and the multi-armed bandit problem
- On Bayesian index policies for sequential resource allocation
- Gaussian two-armed bandit and optimization of batch data processing
- Gaussian two-armed bandit: limiting description
- An Asymptotic Minimax Theorem for the Two Armed Bandit Problem
- Sequential medical trials
- 10.1162/153244303321897663
- Bandit Algorithms
- Prediction, Learning, and Games
- Finite-time analysis of the multiarmed bandit problem
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
This page was built for publication: