Robust control of the multi-armed bandit problem

From MaRDI portal

Publication:2095215

Jump to:navigation, search

DOI10.1007/s10479-015-1965-7zbMath1506.90268OpenAlexW3124603229MaRDI QIDQ2095215

Aparupa Das Gupta, Felipe Caro

Publication date: 9 November 2022

Published in: Annals of Operations Research (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1007/s10479-015-1965-7

zbMATH Keywords

Bellman equation project selection index policies multiarmed bandit robust Markov decision processes uncertain transition matrix

Mathematics Subject Classification ID

Markov and semi-Markov decision processes (90C40) Robustness in mathematical programming (90C17)

Related Items (2)

Optimal Learning Under Robustness and Time-Consistency ⋮ Computation of weighted sums of rewards for concurrent MDPs

Cites Work

This page was built for publication: Robust control of the multi-armed bandit problem

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:2095215&oldid=14584500"