Long-Term Reward Prediction in TD Models of the Dopamine System
From MaRDI portal
Publication:4409377
DOI10.1162/089976602760407973zbMath1021.92005OpenAlexW2171535819WikidataQ40621633 ScholiaQ40621633MaRDI QIDQ4409377
Nathaniel D. Daw, David S. Touretzky
Publication date: 22 October 2003
Published in: Neural Computation (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1162/089976602760407973
Related Items (9)
Multiple model-based reinforcement learning explains dopamine neuronal activity ⋮ Computational algorithms and neuronal network models underlying decision processes ⋮ Neural systems implicated in delayed and probabilistic reinforcement ⋮ The Actor-Critic Learning Is Behind the Matching Law: Matching Versus Optimal Behaviors ⋮ Hyperbolically Discounted Temporal Difference Learning ⋮ Internal-Time Temporal Difference Model for Neural Value-Based Decision Making ⋮ Representation and Timing in Theories of the Dopamine System ⋮ A Neurocomputational Model for Cocaine Addiction ⋮ Reinforcement learning in the brain
Cites Work
This page was built for publication: Long-Term Reward Prediction in TD Models of the Dopamine System