Cargando…

Evaluation of the hypothesis that phasic dopamine constitutes a cached-value signal

The phasic dopamine error signal is currently argued to be synonymous with the prediction error in Sutton and Barto (1987, 1998) model-free reinforcement learning algorithm (Schultz et al., 1997). This theory argues that phasic dopamine reflects a cached-value signal that endows reward-predictive cu...

Descripción completa

Detalles Bibliográficos
Autores principales: Sharpe, Melissa J., Schoenbaum, Geoffrey
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6136434/
https://www.ncbi.nlm.nih.gov/pubmed/29269085
http://dx.doi.org/10.1016/j.nlm.2017.12.002
_version_ 1783354998139125760
author Sharpe, Melissa J.
Schoenbaum, Geoffrey
author_facet Sharpe, Melissa J.
Schoenbaum, Geoffrey
author_sort Sharpe, Melissa J.
collection PubMed
description The phasic dopamine error signal is currently argued to be synonymous with the prediction error in Sutton and Barto (1987, 1998) model-free reinforcement learning algorithm (Schultz et al., 1997). This theory argues that phasic dopamine reflects a cached-value signal that endows reward-predictive cues with the scalar value inherent in reward. Such an interpretation does not envision a role for dopamine in more complex cognitive representations between events which underlie many forms of associative learning, restricting the role dopamine can play in learning. The cached-value hypothesis of dopamine makes three concrete predictions about when a phasic dopamine response should be seen and what types of learning this signal should be able to promote. We discuss these predictions in light of recent evidence which we believe provide particularly strong tests of their validity. In doing so, we find that while the phasic dopamine signal conforms to a cached-value account in some circumstances, other evidence demonstrate that this signal is not restricted to a model-free cached-value reinforcement learning signal. In light of this evidence, we argue that the phasic dopamine signal functions more generally to signal violations of expectancies to drive real-world associations between events.
format Online
Article
Text
id pubmed-6136434
institution National Center for Biotechnology Information
language English
publishDate 2017
record_format MEDLINE/PubMed
spelling pubmed-61364342018-09-13 Evaluation of the hypothesis that phasic dopamine constitutes a cached-value signal Sharpe, Melissa J. Schoenbaum, Geoffrey Neurobiol Learn Mem Article The phasic dopamine error signal is currently argued to be synonymous with the prediction error in Sutton and Barto (1987, 1998) model-free reinforcement learning algorithm (Schultz et al., 1997). This theory argues that phasic dopamine reflects a cached-value signal that endows reward-predictive cues with the scalar value inherent in reward. Such an interpretation does not envision a role for dopamine in more complex cognitive representations between events which underlie many forms of associative learning, restricting the role dopamine can play in learning. The cached-value hypothesis of dopamine makes three concrete predictions about when a phasic dopamine response should be seen and what types of learning this signal should be able to promote. We discuss these predictions in light of recent evidence which we believe provide particularly strong tests of their validity. In doing so, we find that while the phasic dopamine signal conforms to a cached-value account in some circumstances, other evidence demonstrate that this signal is not restricted to a model-free cached-value reinforcement learning signal. In light of this evidence, we argue that the phasic dopamine signal functions more generally to signal violations of expectancies to drive real-world associations between events. 2017-12-18 2018-09 /pmc/articles/PMC6136434/ /pubmed/29269085 http://dx.doi.org/10.1016/j.nlm.2017.12.002 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/BY-NC-ND/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) ).
spellingShingle Article
Sharpe, Melissa J.
Schoenbaum, Geoffrey
Evaluation of the hypothesis that phasic dopamine constitutes a cached-value signal
title Evaluation of the hypothesis that phasic dopamine constitutes a cached-value signal
title_full Evaluation of the hypothesis that phasic dopamine constitutes a cached-value signal
title_fullStr Evaluation of the hypothesis that phasic dopamine constitutes a cached-value signal
title_full_unstemmed Evaluation of the hypothesis that phasic dopamine constitutes a cached-value signal
title_short Evaluation of the hypothesis that phasic dopamine constitutes a cached-value signal
title_sort evaluation of the hypothesis that phasic dopamine constitutes a cached-value signal
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6136434/
https://www.ncbi.nlm.nih.gov/pubmed/29269085
http://dx.doi.org/10.1016/j.nlm.2017.12.002
work_keys_str_mv AT sharpemelissaj evaluationofthehypothesisthatphasicdopamineconstitutesacachedvaluesignal
AT schoenbaumgeoffrey evaluationofthehypothesisthatphasicdopamineconstitutesacachedvaluesignal