Mathematical Research Data Initiative
Main page
Recent changes
Random page
Help about MediaWiki
Create a new Item
Create a new Property
Create a new EntitySchema
Merge two items
In other projects
Discussion
View source
View history
Purge
English
Log in

A stochastic policy search model for matching behavior

From MaRDI portal
Publication:350884
Jump to:navigation, search

DOI10.1007/s11432-011-4304-xzbMath1267.68178OpenAlexW2056870039MaRDI QIDQ350884

Zhidong Deng, Zhenbo Cheng, Yu Zhang

Publication date: 3 July 2013

Published in: Science China. Information Sciences (Search for Journal in Brave)

Full work available at URL: http://engine.scichina.com/doi/10.1007/s11432-011-4304-x


zbMATH Keywords

reinforcement learningdecision-making modelmatching lawpolicy model


Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05) Neural biology (92C20) Animal behavior (92D50) Measurement and performance in psychology (91E45)




Cites Work

  • Simple statistical gradient-following algorithms for connectionist reinforcement learning
  • Dopamine modulation in the basal ganglia locks the gate to working memory
  • The Actor-Critic Learning Is Behind the Matching Law: Matching Versus Optimal Behaviors
  • Making Working Memory Work: A Computational Model of Learning in the Prefrontal Cortex and Basal Ganglia
  • Unnamed Item
  • Unnamed Item


This page was built for publication: A stochastic policy search model for matching behavior

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:350884&oldid=12225056"
Tools
What links here
Related changes
Special pages
Printable version
Permanent link
Page information
MaRDI portal item
This page was last edited on 30 January 2024, at 02:42.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki