Cargando…
Correlates of reward-predictive value in learning-related hippocampal neural activity
Temporal difference learning (TD) is a popular algorithm in machine learning. Two learning signals that are derived from this algorithm, the predictive value and the prediction error, have been shown to explain changes in neural activity and behavior during learning across species. Here, the predict...
Autor principal: | |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Wiley Subscription Services, Inc., A Wiley Company
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2742500/ https://www.ncbi.nlm.nih.gov/pubmed/19123250 http://dx.doi.org/10.1002/hipo.20535 |
_version_ | 1782171822064664576 |
---|---|
author | Okatan, Murat |
author_facet | Okatan, Murat |
author_sort | Okatan, Murat |
collection | PubMed |
description | Temporal difference learning (TD) is a popular algorithm in machine learning. Two learning signals that are derived from this algorithm, the predictive value and the prediction error, have been shown to explain changes in neural activity and behavior during learning across species. Here, the predictive value signal is used to explain the time course of learning-related changes in the activity of hippocampal neurons in monkeys performing an associative learning task. The TD algorithm serves as the centerpiece of a joint probability model for the learning-related neural activity and the behavioral responses recorded during the task. The neural component of the model consists of spiking neurons that compete and learn the reward-predictive value of task-relevant input signals. The predictive-value signaled by these neurons influences the behavioral response generated by a stochastic decision stage, which constitutes the behavioral component of the model. It is shown that the time course of the changes in neural activity and behavioral performance generated by the model exhibits key features of the experimental data. The results suggest that information about correct associations may be expressed in the hippocampus before it is detected in the behavior of a subject. In this way, the hippocampus may be among the earliest brain areas to express learning and drive the behavioral changes associated with learning. Correlates of reward-predictive value may be expressed in the hippocampus through rate remapping within spatial memory representations, they may represent reward-related aspects of a declarative or explicit relational memory representation of task contingencies, or they may correspond to reward-related components of episodic memory representations. These potential functions are discussed in connection with hippocampal cell assembly sequences and their reverse reactivation during the awake state. The results provide further support for the proposal that neural processes underlying learning may be implementing a temporal difference-like algorithm. |
format | Text |
id | pubmed-2742500 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | Wiley Subscription Services, Inc., A Wiley Company |
record_format | MEDLINE/PubMed |
spelling | pubmed-27425002009-09-15 Correlates of reward-predictive value in learning-related hippocampal neural activity Okatan, Murat Hippocampus Research Article Temporal difference learning (TD) is a popular algorithm in machine learning. Two learning signals that are derived from this algorithm, the predictive value and the prediction error, have been shown to explain changes in neural activity and behavior during learning across species. Here, the predictive value signal is used to explain the time course of learning-related changes in the activity of hippocampal neurons in monkeys performing an associative learning task. The TD algorithm serves as the centerpiece of a joint probability model for the learning-related neural activity and the behavioral responses recorded during the task. The neural component of the model consists of spiking neurons that compete and learn the reward-predictive value of task-relevant input signals. The predictive-value signaled by these neurons influences the behavioral response generated by a stochastic decision stage, which constitutes the behavioral component of the model. It is shown that the time course of the changes in neural activity and behavioral performance generated by the model exhibits key features of the experimental data. The results suggest that information about correct associations may be expressed in the hippocampus before it is detected in the behavior of a subject. In this way, the hippocampus may be among the earliest brain areas to express learning and drive the behavioral changes associated with learning. Correlates of reward-predictive value may be expressed in the hippocampus through rate remapping within spatial memory representations, they may represent reward-related aspects of a declarative or explicit relational memory representation of task contingencies, or they may correspond to reward-related components of episodic memory representations. These potential functions are discussed in connection with hippocampal cell assembly sequences and their reverse reactivation during the awake state. The results provide further support for the proposal that neural processes underlying learning may be implementing a temporal difference-like algorithm. Wiley Subscription Services, Inc., A Wiley Company 2009-05 /pmc/articles/PMC2742500/ /pubmed/19123250 http://dx.doi.org/10.1002/hipo.20535 Text en Copyright © 2009 Wiley-Liss, Inc., A Wiley Company http://creativecommons.org/licenses/by/2.5/ Re-use of this article is permitted in accordance with the Creative Commons Deed, Attribution 2.5, which does not permit commercial exploitation. |
spellingShingle | Research Article Okatan, Murat Correlates of reward-predictive value in learning-related hippocampal neural activity |
title | Correlates of reward-predictive value in learning-related hippocampal neural activity |
title_full | Correlates of reward-predictive value in learning-related hippocampal neural activity |
title_fullStr | Correlates of reward-predictive value in learning-related hippocampal neural activity |
title_full_unstemmed | Correlates of reward-predictive value in learning-related hippocampal neural activity |
title_short | Correlates of reward-predictive value in learning-related hippocampal neural activity |
title_sort | correlates of reward-predictive value in learning-related hippocampal neural activity |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2742500/ https://www.ncbi.nlm.nih.gov/pubmed/19123250 http://dx.doi.org/10.1002/hipo.20535 |
work_keys_str_mv | AT okatanmurat correlatesofrewardpredictivevalueinlearningrelatedhippocampalneuralactivity |