Cargando…

Computational Characteristics of the Striatal Dopamine System Described by Reinforcement Learning With Fast Generalization

Generalization is the ability to apply past experience to similar but non-identical situations. It not only affects stimulus-outcome relationships, as observed in conditioning experiments, but may also be essential for adaptive behaviors, which involve the interaction between individuals and their e...

Descripción completa

Detalles Bibliográficos
Autores principales:	Fujita, Yoshihisa, Yagishita, Sho, Kasai, Haruo, Ishii, Shin
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2020
Materias:	Neuroscience
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7388898/ https://www.ncbi.nlm.nih.gov/pubmed/32774245 http://dx.doi.org/10.3389/fncom.2020.00066

_version_	1783564388012130304
author	Fujita, Yoshihisa Yagishita, Sho Kasai, Haruo Ishii, Shin
author_facet	Fujita, Yoshihisa Yagishita, Sho Kasai, Haruo Ishii, Shin
author_sort	Fujita, Yoshihisa
collection	PubMed
description	Generalization is the ability to apply past experience to similar but non-identical situations. It not only affects stimulus-outcome relationships, as observed in conditioning experiments, but may also be essential for adaptive behaviors, which involve the interaction between individuals and their environment. Computational modeling could potentially clarify the effect of generalization on adaptive behaviors and how this effect emerges from the underlying computation. Recent neurobiological observation indicated that the striatal dopamine system achieves generalization and subsequent discrimination by updating the corticostriatal synaptic connections in differential response to reward and punishment. In this study, we analyzed how computational characteristics in this neurobiological system affects adaptive behaviors. We proposed a novel reinforcement learning model with multilayer neural networks in which the synaptic weights of only the last layer are updated according to the prediction error. We set fixed connections between the input and hidden layers to maintain the similarity of inputs in the hidden-layer representation. This network enabled fast generalization of reward and punishment learning, and thereby facilitated safe and efficient exploration of spatial navigation tasks. Notably, it demonstrated a quick reward approach and efficient punishment aversion in the early learning phase, compared to algorithms that do not show generalization. However, disturbance of the network that causes noisy generalization and impaired discrimination induced maladaptive valuation. These results suggested the advantage and potential drawback of computation by the striatal dopamine system with regard to adaptive behaviors.
format	Online Article Text
id	pubmed-7388898
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-73888982020-08-07 Computational Characteristics of the Striatal Dopamine System Described by Reinforcement Learning With Fast Generalization Fujita, Yoshihisa Yagishita, Sho Kasai, Haruo Ishii, Shin Front Comput Neurosci Neuroscience Generalization is the ability to apply past experience to similar but non-identical situations. It not only affects stimulus-outcome relationships, as observed in conditioning experiments, but may also be essential for adaptive behaviors, which involve the interaction between individuals and their environment. Computational modeling could potentially clarify the effect of generalization on adaptive behaviors and how this effect emerges from the underlying computation. Recent neurobiological observation indicated that the striatal dopamine system achieves generalization and subsequent discrimination by updating the corticostriatal synaptic connections in differential response to reward and punishment. In this study, we analyzed how computational characteristics in this neurobiological system affects adaptive behaviors. We proposed a novel reinforcement learning model with multilayer neural networks in which the synaptic weights of only the last layer are updated according to the prediction error. We set fixed connections between the input and hidden layers to maintain the similarity of inputs in the hidden-layer representation. This network enabled fast generalization of reward and punishment learning, and thereby facilitated safe and efficient exploration of spatial navigation tasks. Notably, it demonstrated a quick reward approach and efficient punishment aversion in the early learning phase, compared to algorithms that do not show generalization. However, disturbance of the network that causes noisy generalization and impaired discrimination induced maladaptive valuation. These results suggested the advantage and potential drawback of computation by the striatal dopamine system with regard to adaptive behaviors. Frontiers Media S.A. 2020-07-22 /pmc/articles/PMC7388898/ /pubmed/32774245 http://dx.doi.org/10.3389/fncom.2020.00066 Text en Copyright © 2020 Fujita, Yagishita, Kasai and Ishii. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Neuroscience Fujita, Yoshihisa Yagishita, Sho Kasai, Haruo Ishii, Shin Computational Characteristics of the Striatal Dopamine System Described by Reinforcement Learning With Fast Generalization
title	Computational Characteristics of the Striatal Dopamine System Described by Reinforcement Learning With Fast Generalization
title_full	Computational Characteristics of the Striatal Dopamine System Described by Reinforcement Learning With Fast Generalization
title_fullStr	Computational Characteristics of the Striatal Dopamine System Described by Reinforcement Learning With Fast Generalization
title_full_unstemmed	Computational Characteristics of the Striatal Dopamine System Described by Reinforcement Learning With Fast Generalization
title_short	Computational Characteristics of the Striatal Dopamine System Described by Reinforcement Learning With Fast Generalization
title_sort	computational characteristics of the striatal dopamine system described by reinforcement learning with fast generalization
topic	Neuroscience
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7388898/ https://www.ncbi.nlm.nih.gov/pubmed/32774245 http://dx.doi.org/10.3389/fncom.2020.00066
work_keys_str_mv	AT fujitayoshihisa computationalcharacteristicsofthestriataldopaminesystemdescribedbyreinforcementlearningwithfastgeneralization AT yagishitasho computationalcharacteristicsofthestriataldopaminesystemdescribedbyreinforcementlearningwithfastgeneralization AT kasaiharuo computationalcharacteristicsofthestriataldopaminesystemdescribedbyreinforcementlearningwithfastgeneralization AT ishiishin computationalcharacteristicsofthestriataldopaminesystemdescribedbyreinforcementlearningwithfastgeneralization

Computational Characteristics of the Striatal Dopamine System Described by Reinforcement Learning With Fast Generalization

Ejemplares similares