A theoretical analysis of temporal difference learning in the iterated prisoner's dilemma game
From MaRDI portal
Publication:1048261
DOI10.1007/s11538-009-9424-8zbMath1182.91048OpenAlexW2073008835WikidataQ39975754 ScholiaQ39975754MaRDI QIDQ1048261
Publication date: 11 January 2010
Published in: Bulletin of Mathematical Biology (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1007/s11538-009-9424-8
Cooperative games (91A12) Models of societies, social and urban evolution (91D10) Memory and learning in psychology (91E40) Rationality and learning in game theory (91A26)
Related Items (4)
Immediate return preference emerged from a synaptic learning rule for return maximization ⋮ Global migration can lead to stronger spatial selection than local migration ⋮ Numerical analysis of a reinforcement learning model with the dynamic aspiration level in the iterated prisoner's dilemma ⋮ Evolution of cooperation facilitated by reinforcement learning with adaptive aspiration levels
Cites Work
- Unnamed Item
- Unnamed Item
- Learning to cooperate with Pavlov and adaptive strategy for the iterated prisoner's dilemma with noise
- The evolution of stochastic strategies in the prisoner's dilemma
- Game-dynamical aspects of the prisoner's dilemma
- Learning behavior in an experimental matching pennies game
- Individual learning in normal form games: Some laboratory results
- Convergence results for single-step on-policy reinforcement-learning algorithms
- Dynamics of internal models in game players
- Practical issues in temporal difference learning
- \({\mathcal Q}\)-learning
- Learning in extensive-form games: Experimental data and simple dynamic models in the intermediate term
- Automata, repeated games and noise
- Learning dynamics in social dilemmas
- Chaos in learning a simple two-person game
- Experience-weighted Attraction Learning in Normal Form Games
This page was built for publication: A theoretical analysis of temporal difference learning in the iterated prisoner's dilemma game