A stochastic policy search model for matching behavior
From MaRDI portal
Publication:350884
DOI10.1007/s11432-011-4304-xzbMath1267.68178OpenAlexW2056870039MaRDI QIDQ350884
Zhidong Deng, Zhenbo Cheng, Yu Zhang
Publication date: 3 July 2013
Published in: Science China. Information Sciences (Search for Journal in Brave)
Full work available at URL: http://engine.scichina.com/doi/10.1007/s11432-011-4304-x
Learning and adaptive systems in artificial intelligence (68T05) Neural biology (92C20) Animal behavior (92D50) Measurement and performance in psychology (91E45)
Cites Work
- Simple statistical gradient-following algorithms for connectionist reinforcement learning
- Dopamine modulation in the basal ganglia locks the gate to working memory
- The Actor-Critic Learning Is Behind the Matching Law: Matching Versus Optimal Behaviors
- Making Working Memory Work: A Computational Model of Learning in the Prefrontal Cortex and Basal Ganglia
- Unnamed Item
- Unnamed Item
This page was built for publication: A stochastic policy search model for matching behavior