Mathematical Research Data Initiative
Main page
Recent changes
Random page
Help about MediaWiki
Create a new Item
Create a new Property
Merge two items
In other projects
MaRDI portal item
Discussion
View source
View history
Purge
English
Log in

On the value of learning for Bernoulli bandits with unknown parameters

From MaRDI portal
Publication:2730296
Jump to:navigation, search

DOI10.1109/9.887641zbMath0976.90119OpenAlexW2125930088MaRDI QIDQ2730296

Ger Koole, Sandjai Bhulai

Publication date: 5 August 2001

Published in: IEEE Transactions on Automatic Control (Search for Journal in Brave)

Full work available at URL: https://semanticscholar.org/paper/b13e103016215fc8b68ee47f59432ab1e11793fc


zbMATH Keywords

bandit problempartially observed Markov decision problemBayesian adaptive control


Mathematics Subject Classification ID

Dynamic programming (90C39) Markov and semi-Markov decision processes (90C40)


Related Items (1)

Solving two‐armed Bernoulli bandit problems using a Bayesian learning automaton







This page was built for publication: On the value of learning for Bernoulli bandits with unknown parameters

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:2730296&oldid=15586827"
Tools
What links here
Related changes
Special pages
Printable version
Permanent link
Page information
This page was last edited on 3 February 2024, at 13:52.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki