Cargando…

Striatal and Tegmental Neurons Code Critical Signals for Temporal-Difference Learning of State Value in Domestic Chicks

To ensure survival, animals must update the internal representations of their environment in a trial-and-error fashion. Psychological studies of associative learning and neurophysiological analyses of dopaminergic neurons have suggested that this updating process involves the temporal-difference (TD...

Descripción completa

Detalles Bibliográficos
Autores principales:	Wen, Chentao, Ogura, Yukiko, Matsushima, Toshiya
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2016
Materias:	Neuroscience
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5099172/ https://www.ncbi.nlm.nih.gov/pubmed/27877100 http://dx.doi.org/10.3389/fnins.2016.00476

_version_	1782465894639730688
author	Wen, Chentao Ogura, Yukiko Matsushima, Toshiya
author_facet	Wen, Chentao Ogura, Yukiko Matsushima, Toshiya
author_sort	Wen, Chentao
collection	PubMed
description	To ensure survival, animals must update the internal representations of their environment in a trial-and-error fashion. Psychological studies of associative learning and neurophysiological analyses of dopaminergic neurons have suggested that this updating process involves the temporal-difference (TD) method in the basal ganglia network. However, the way in which the component variables of the TD method are implemented at the neuronal level is unclear. To investigate the underlying neural mechanisms, we trained domestic chicks to associate color cues with food rewards. We recorded neuronal activities from the medial striatum or tegmentum in a freely behaving condition and examined how reward omission changed neuronal firing. To compare neuronal activities with the signals assumed in the TD method, we simulated the behavioral task in the form of a finite sequence composed of discrete steps of time. The three signals assumed in the simulated task were the prediction signal, the target signal for updating, and the TD-error signal. In both the medial striatum and tegmentum, the majority of recorded neurons were categorized into three types according to their fitness for three models, though these neurons tended to form a continuum spectrum without distinct differences in the firing rate. Specifically, two types of striatal neurons successfully mimicked the target signal and the prediction signal. A linear summation of these two types of striatum neurons was a good fit for the activity of one type of tegmental neurons mimicking the TD-error signal. The present study thus demonstrates that the striatum and tegmentum can convey the signals critically required for the TD method. Based on the theoretical and neurophysiological studies, together with tract-tracing data, we propose a novel model to explain how the convergence of signals represented in the striatum could lead to the computation of TD error in tegmental dopaminergic neurons.
format	Online Article Text
id	pubmed-5099172
institution	National Center for Biotechnology Information
language	English
publishDate	2016
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-50991722016-11-22 Striatal and Tegmental Neurons Code Critical Signals for Temporal-Difference Learning of State Value in Domestic Chicks Wen, Chentao Ogura, Yukiko Matsushima, Toshiya Front Neurosci Neuroscience To ensure survival, animals must update the internal representations of their environment in a trial-and-error fashion. Psychological studies of associative learning and neurophysiological analyses of dopaminergic neurons have suggested that this updating process involves the temporal-difference (TD) method in the basal ganglia network. However, the way in which the component variables of the TD method are implemented at the neuronal level is unclear. To investigate the underlying neural mechanisms, we trained domestic chicks to associate color cues with food rewards. We recorded neuronal activities from the medial striatum or tegmentum in a freely behaving condition and examined how reward omission changed neuronal firing. To compare neuronal activities with the signals assumed in the TD method, we simulated the behavioral task in the form of a finite sequence composed of discrete steps of time. The three signals assumed in the simulated task were the prediction signal, the target signal for updating, and the TD-error signal. In both the medial striatum and tegmentum, the majority of recorded neurons were categorized into three types according to their fitness for three models, though these neurons tended to form a continuum spectrum without distinct differences in the firing rate. Specifically, two types of striatal neurons successfully mimicked the target signal and the prediction signal. A linear summation of these two types of striatum neurons was a good fit for the activity of one type of tegmental neurons mimicking the TD-error signal. The present study thus demonstrates that the striatum and tegmentum can convey the signals critically required for the TD method. Based on the theoretical and neurophysiological studies, together with tract-tracing data, we propose a novel model to explain how the convergence of signals represented in the striatum could lead to the computation of TD error in tegmental dopaminergic neurons. Frontiers Media S.A. 2016-11-08 /pmc/articles/PMC5099172/ /pubmed/27877100 http://dx.doi.org/10.3389/fnins.2016.00476 Text en Copyright © 2016 Wen, Ogura and Matsushima. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Neuroscience Wen, Chentao Ogura, Yukiko Matsushima, Toshiya Striatal and Tegmental Neurons Code Critical Signals for Temporal-Difference Learning of State Value in Domestic Chicks
title	Striatal and Tegmental Neurons Code Critical Signals for Temporal-Difference Learning of State Value in Domestic Chicks
title_full	Striatal and Tegmental Neurons Code Critical Signals for Temporal-Difference Learning of State Value in Domestic Chicks
title_fullStr	Striatal and Tegmental Neurons Code Critical Signals for Temporal-Difference Learning of State Value in Domestic Chicks
title_full_unstemmed	Striatal and Tegmental Neurons Code Critical Signals for Temporal-Difference Learning of State Value in Domestic Chicks
title_short	Striatal and Tegmental Neurons Code Critical Signals for Temporal-Difference Learning of State Value in Domestic Chicks
title_sort	striatal and tegmental neurons code critical signals for temporal-difference learning of state value in domestic chicks
topic	Neuroscience
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5099172/ https://www.ncbi.nlm.nih.gov/pubmed/27877100 http://dx.doi.org/10.3389/fnins.2016.00476
work_keys_str_mv	AT wenchentao striatalandtegmentalneuronscodecriticalsignalsfortemporaldifferencelearningofstatevalueindomesticchicks AT ogurayukiko striatalandtegmentalneuronscodecriticalsignalsfortemporaldifferencelearningofstatevalueindomesticchicks AT matsushimatoshiya striatalandtegmentalneuronscodecriticalsignalsfortemporaldifferencelearningofstatevalueindomesticchicks

Striatal and Tegmental Neurons Code Critical Signals for Temporal-Difference Learning of State Value in Domestic Chicks

Ejemplares similares