Cargando…

Correlates of reward-predictive value in learning-related hippocampal neural activity

Temporal difference learning (TD) is a popular algorithm in machine learning. Two learning signals that are derived from this algorithm, the predictive value and the prediction error, have been shown to explain changes in neural activity and behavior during learning across species. Here, the predict...

Descripción completa

Detalles Bibliográficos
Autor principal: Okatan, Murat
Formato: Texto
Lenguaje:English
Publicado: Wiley Subscription Services, Inc., A Wiley Company 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2742500/
https://www.ncbi.nlm.nih.gov/pubmed/19123250
http://dx.doi.org/10.1002/hipo.20535
_version_ 1782171822064664576
author Okatan, Murat
author_facet Okatan, Murat
author_sort Okatan, Murat
collection PubMed
description Temporal difference learning (TD) is a popular algorithm in machine learning. Two learning signals that are derived from this algorithm, the predictive value and the prediction error, have been shown to explain changes in neural activity and behavior during learning across species. Here, the predictive value signal is used to explain the time course of learning-related changes in the activity of hippocampal neurons in monkeys performing an associative learning task. The TD algorithm serves as the centerpiece of a joint probability model for the learning-related neural activity and the behavioral responses recorded during the task. The neural component of the model consists of spiking neurons that compete and learn the reward-predictive value of task-relevant input signals. The predictive-value signaled by these neurons influences the behavioral response generated by a stochastic decision stage, which constitutes the behavioral component of the model. It is shown that the time course of the changes in neural activity and behavioral performance generated by the model exhibits key features of the experimental data. The results suggest that information about correct associations may be expressed in the hippocampus before it is detected in the behavior of a subject. In this way, the hippocampus may be among the earliest brain areas to express learning and drive the behavioral changes associated with learning. Correlates of reward-predictive value may be expressed in the hippocampus through rate remapping within spatial memory representations, they may represent reward-related aspects of a declarative or explicit relational memory representation of task contingencies, or they may correspond to reward-related components of episodic memory representations. These potential functions are discussed in connection with hippocampal cell assembly sequences and their reverse reactivation during the awake state. The results provide further support for the proposal that neural processes underlying learning may be implementing a temporal difference-like algorithm.
format Text
id pubmed-2742500
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Wiley Subscription Services, Inc., A Wiley Company
record_format MEDLINE/PubMed
spelling pubmed-27425002009-09-15 Correlates of reward-predictive value in learning-related hippocampal neural activity Okatan, Murat Hippocampus Research Article Temporal difference learning (TD) is a popular algorithm in machine learning. Two learning signals that are derived from this algorithm, the predictive value and the prediction error, have been shown to explain changes in neural activity and behavior during learning across species. Here, the predictive value signal is used to explain the time course of learning-related changes in the activity of hippocampal neurons in monkeys performing an associative learning task. The TD algorithm serves as the centerpiece of a joint probability model for the learning-related neural activity and the behavioral responses recorded during the task. The neural component of the model consists of spiking neurons that compete and learn the reward-predictive value of task-relevant input signals. The predictive-value signaled by these neurons influences the behavioral response generated by a stochastic decision stage, which constitutes the behavioral component of the model. It is shown that the time course of the changes in neural activity and behavioral performance generated by the model exhibits key features of the experimental data. The results suggest that information about correct associations may be expressed in the hippocampus before it is detected in the behavior of a subject. In this way, the hippocampus may be among the earliest brain areas to express learning and drive the behavioral changes associated with learning. Correlates of reward-predictive value may be expressed in the hippocampus through rate remapping within spatial memory representations, they may represent reward-related aspects of a declarative or explicit relational memory representation of task contingencies, or they may correspond to reward-related components of episodic memory representations. These potential functions are discussed in connection with hippocampal cell assembly sequences and their reverse reactivation during the awake state. The results provide further support for the proposal that neural processes underlying learning may be implementing a temporal difference-like algorithm. Wiley Subscription Services, Inc., A Wiley Company 2009-05 /pmc/articles/PMC2742500/ /pubmed/19123250 http://dx.doi.org/10.1002/hipo.20535 Text en Copyright © 2009 Wiley-Liss, Inc., A Wiley Company http://creativecommons.org/licenses/by/2.5/ Re-use of this article is permitted in accordance with the Creative Commons Deed, Attribution 2.5, which does not permit commercial exploitation.
spellingShingle Research Article
Okatan, Murat
Correlates of reward-predictive value in learning-related hippocampal neural activity
title Correlates of reward-predictive value in learning-related hippocampal neural activity
title_full Correlates of reward-predictive value in learning-related hippocampal neural activity
title_fullStr Correlates of reward-predictive value in learning-related hippocampal neural activity
title_full_unstemmed Correlates of reward-predictive value in learning-related hippocampal neural activity
title_short Correlates of reward-predictive value in learning-related hippocampal neural activity
title_sort correlates of reward-predictive value in learning-related hippocampal neural activity
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2742500/
https://www.ncbi.nlm.nih.gov/pubmed/19123250
http://dx.doi.org/10.1002/hipo.20535
work_keys_str_mv AT okatanmurat correlatesofrewardpredictivevalueinlearningrelatedhippocampalneuralactivity