Mathematical Research Data Initiative
Main page
Recent changes
Random page
Help about MediaWiki
Create a new Item
Create a new Property
Create a new EntitySchema
Merge two items
In other projects
Discussion
View source
View history
Purge
English
Log in

Reinforcement learning from comparisons: three alternatives are enough, two are not

From MaRDI portal
Publication:1688021
Jump to:navigation, search

DOI10.1214/16-AAP1271zbMath1379.60081OpenAlexW2962773167MaRDI QIDQ1688021

Jean-François Laslier, Benoît Laslier

Publication date: 4 January 2018

Published in: The Annals of Applied Probability (Search for Journal in Brave)

Full work available at URL: https://projecteuclid.org/euclid.aoap/1509696037


zbMATH Keywords

learningtournamenturn process


Mathematics Subject Classification ID

Applications of Markov chains and discrete-time Markov processes on general state spaces (social mobility, learning theory, industrial processes, etc.) (60J20) Memory and learning in psychology (91E40) Evolutionary games (91A22)


Related Items (3)

An urn model with random multiple drawing and random addition ⋮ Asymptotic behaviour of the one-dimensional ``rock-paper-scissors cyclic cellular automaton ⋮ Multiple drawing multi-colour urns by stochastic approximation






This page was built for publication: Reinforcement learning from comparisons: three alternatives are enough, two are not

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:1688021&oldid=14004326"
Tools
What links here
Related changes
Special pages
Printable version
Permanent link
Page information
MaRDI portal item
This page was last edited on 1 February 2024, at 05:33.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki