Cargando…

Updating dopamine reward signals

Recent work has advanced our knowledge of phasic dopamine reward prediction error signals. The error signal is bidirectional, reflects well the higher order prediction error described by temporal difference learning models, is compatible with model-free and model-based reinforcement learning, report...

Descripción completa

Detalles Bibliográficos
Autor principal: Schultz, Wolfram
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Current Biology 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3866681/
https://www.ncbi.nlm.nih.gov/pubmed/23267662
http://dx.doi.org/10.1016/j.conb.2012.11.012
_version_ 1782296204206407680
author Schultz, Wolfram
author_facet Schultz, Wolfram
author_sort Schultz, Wolfram
collection PubMed
description Recent work has advanced our knowledge of phasic dopamine reward prediction error signals. The error signal is bidirectional, reflects well the higher order prediction error described by temporal difference learning models, is compatible with model-free and model-based reinforcement learning, reports the subjective rather than physical reward value during temporal discounting and reflects subjective stimulus perception rather than physical stimulus aspects. Dopamine activations are primarily driven by reward, and to some extent risk, whereas punishment and salience have only limited activating effects when appropriate controls are respected. The signal is homogeneous in terms of time course but heterogeneous in many other aspects. It is essential for synaptic plasticity and a range of behavioural learning situations.
format Online
Article
Text
id pubmed-3866681
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Current Biology
record_format MEDLINE/PubMed
spelling pubmed-38666812013-12-18 Updating dopamine reward signals Schultz, Wolfram Curr Opin Neurobiol Article Recent work has advanced our knowledge of phasic dopamine reward prediction error signals. The error signal is bidirectional, reflects well the higher order prediction error described by temporal difference learning models, is compatible with model-free and model-based reinforcement learning, reports the subjective rather than physical reward value during temporal discounting and reflects subjective stimulus perception rather than physical stimulus aspects. Dopamine activations are primarily driven by reward, and to some extent risk, whereas punishment and salience have only limited activating effects when appropriate controls are respected. The signal is homogeneous in terms of time course but heterogeneous in many other aspects. It is essential for synaptic plasticity and a range of behavioural learning situations. Current Biology 2013-04 /pmc/articles/PMC3866681/ /pubmed/23267662 http://dx.doi.org/10.1016/j.conb.2012.11.012 Text en © 2013 Elsevier Ltd. https://creativecommons.org/licenses/by/3.0/ Open Access under CC BY 3.0 (https://creativecommons.org/licenses/by/3.0/) license
spellingShingle Article
Schultz, Wolfram
Updating dopamine reward signals
title Updating dopamine reward signals
title_full Updating dopamine reward signals
title_fullStr Updating dopamine reward signals
title_full_unstemmed Updating dopamine reward signals
title_short Updating dopamine reward signals
title_sort updating dopamine reward signals
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3866681/
https://www.ncbi.nlm.nih.gov/pubmed/23267662
http://dx.doi.org/10.1016/j.conb.2012.11.012
work_keys_str_mv AT schultzwolfram updatingdopaminerewardsignals