Mathematical Research Data Initiative
Main page
Recent changes
Random page
Help about MediaWiki
Create a new Item
Create a new Property
Merge two items
In other projects
Discussion
View source
View history
Purge
English
Log in

On ergodic two-armed bandits

From MaRDI portal
Publication:417067
Jump to:navigation, search

DOI10.1214/10-AAP751zbMath1275.62056arXiv0905.0463OpenAlexW2136855133MaRDI QIDQ417067

Pierre Tarrès, Pierre Vandekerkhove

Publication date: 13 May 2012

Published in: The Annals of Applied Probability (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/0905.0463


zbMATH Keywords

convergencestochastic algorithms


Mathematics Subject Classification ID

Stochastic approximation (62L20) Sequential statistical design (62L05) Statistical aspects of information-theoretic topics (62B10)


Related Items (1)

Convergence in models with bounded expected relative hazard rates




Cites Work

  • Unnamed Item
  • A penalized bandit algorithm
  • A two armed bandit type problem
  • When can the two-armed bandit algorithm be trusted?
  • Stochastic algorithms
  • The law of the iterated logarithm for additive functionals of Markov chains
  • On the linear model with two absorbing barriers
  • Stochastic approximation with averaging innovation applied to Finance
  • A two armed bandit type problem revisited
  • How Fast Is the Bandit?
  • Learning Automata - A Survey
  • Use of Stochastic Automata for Parameter Self-Optimization with Multimodal Performance Criteria




This page was built for publication: On ergodic two-armed bandits

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:417067&oldid=12291420"
Tools
What links here
Related changes
Special pages
Printable version
Permanent link
Page information
MaRDI portal item
This page was last edited on 30 January 2024, at 03:43.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki