The Actor-Critic Learning Is Behind the Matching Law: Matching Versus Optimal Behaviors
From MaRDI portal
Publication:3539961
DOI10.1162/NECO.2008.20.1.227zbMath1207.92050OpenAlexW2172030907WikidataQ51899607 ScholiaQ51899607MaRDI QIDQ3539961
Publication date: 19 November 2008
Published in: Neural Computation (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1162/neco.2008.20.1.227
Related Items (6)
Statistical Mechanics of Reward-Modulated Learning in Decision-Making Networks ⋮ A stochastic policy search model for matching behavior ⋮ Model-based estimation of subjective values using choice tasks with probabilistic feedback ⋮ The relation between reinforcement learning parameters and the influence of reinforcement history on choice behavior ⋮ Dynamical Regimes in Neural Network Models of Matching Behavior ⋮ Operant Matching as a Nash Equilibrium of an Intertemporal Game
Cites Work
This page was built for publication: The Actor-Critic Learning Is Behind the Matching Law: Matching Versus Optimal Behaviors