Cargando…

Opponent Learning with Different Representations in the Cortico-Basal Ganglia Circuits

The direct and indirect pathways of the basal ganglia (BG) have been suggested to learn mainly from positive and negative feedbacks, respectively. Since these pathways unevenly receive inputs from different cortical neuron types and/or regions, they may preferentially use different state/action repr...

Descripción completa

Detalles Bibliográficos
Autores principales: Morita, Kenji, Shimomura, Kanji, Kawaguchi, Yasuo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Society for Neuroscience 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9884109/
https://www.ncbi.nlm.nih.gov/pubmed/36653187
http://dx.doi.org/10.1523/ENEURO.0422-22.2023
_version_ 1784879647769493504
author Morita, Kenji
Shimomura, Kanji
Kawaguchi, Yasuo
author_facet Morita, Kenji
Shimomura, Kanji
Kawaguchi, Yasuo
author_sort Morita, Kenji
collection PubMed
description The direct and indirect pathways of the basal ganglia (BG) have been suggested to learn mainly from positive and negative feedbacks, respectively. Since these pathways unevenly receive inputs from different cortical neuron types and/or regions, they may preferentially use different state/action representations. We explored whether such a combined use of different representations, coupled with different learning rates from positive and negative reward prediction errors (RPEs), has computational benefits. We modeled animal as an agent equipped with two learning systems, each of which adopted individual representation (IR) or successor representation (SR) of states. With varying the combination of IR or SR and also the learning rates from positive and negative RPEs in each system, we examined how the agent performed in a dynamic reward navigation task. We found that combination of SR-based system learning mainly from positive RPEs and IR-based system learning mainly from negative RPEs could achieve a good performance in the task, as compared with other combinations. In such a combination of appetitive SR-based and aversive IR-based systems, both systems show activities of comparable magnitudes with opposite signs, consistent with the suggested profiles of the two BG pathways. Moreover, the architecture of such a combination provides a novel coherent explanation for the functional significance and underlying mechanism of diverse findings about the cortico-BG circuits. These results suggest that particularly combining different representations with appetitive and aversive learning could be an effective learning strategy in certain dynamic environments, and it might actually be implemented in the cortico-BG circuits.
format Online
Article
Text
id pubmed-9884109
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Society for Neuroscience
record_format MEDLINE/PubMed
spelling pubmed-98841092023-01-30 Opponent Learning with Different Representations in the Cortico-Basal Ganglia Circuits Morita, Kenji Shimomura, Kanji Kawaguchi, Yasuo eNeuro Research Article: New Research The direct and indirect pathways of the basal ganglia (BG) have been suggested to learn mainly from positive and negative feedbacks, respectively. Since these pathways unevenly receive inputs from different cortical neuron types and/or regions, they may preferentially use different state/action representations. We explored whether such a combined use of different representations, coupled with different learning rates from positive and negative reward prediction errors (RPEs), has computational benefits. We modeled animal as an agent equipped with two learning systems, each of which adopted individual representation (IR) or successor representation (SR) of states. With varying the combination of IR or SR and also the learning rates from positive and negative RPEs in each system, we examined how the agent performed in a dynamic reward navigation task. We found that combination of SR-based system learning mainly from positive RPEs and IR-based system learning mainly from negative RPEs could achieve a good performance in the task, as compared with other combinations. In such a combination of appetitive SR-based and aversive IR-based systems, both systems show activities of comparable magnitudes with opposite signs, consistent with the suggested profiles of the two BG pathways. Moreover, the architecture of such a combination provides a novel coherent explanation for the functional significance and underlying mechanism of diverse findings about the cortico-BG circuits. These results suggest that particularly combining different representations with appetitive and aversive learning could be an effective learning strategy in certain dynamic environments, and it might actually be implemented in the cortico-BG circuits. Society for Neuroscience 2023-01-25 /pmc/articles/PMC9884109/ /pubmed/36653187 http://dx.doi.org/10.1523/ENEURO.0422-22.2023 Text en Copyright © 2023 Morita et al. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution and reproduction in any medium provided that the original work is properly attributed.
spellingShingle Research Article: New Research
Morita, Kenji
Shimomura, Kanji
Kawaguchi, Yasuo
Opponent Learning with Different Representations in the Cortico-Basal Ganglia Circuits
title Opponent Learning with Different Representations in the Cortico-Basal Ganglia Circuits
title_full Opponent Learning with Different Representations in the Cortico-Basal Ganglia Circuits
title_fullStr Opponent Learning with Different Representations in the Cortico-Basal Ganglia Circuits
title_full_unstemmed Opponent Learning with Different Representations in the Cortico-Basal Ganglia Circuits
title_short Opponent Learning with Different Representations in the Cortico-Basal Ganglia Circuits
title_sort opponent learning with different representations in the cortico-basal ganglia circuits
topic Research Article: New Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9884109/
https://www.ncbi.nlm.nih.gov/pubmed/36653187
http://dx.doi.org/10.1523/ENEURO.0422-22.2023
work_keys_str_mv AT moritakenji opponentlearningwithdifferentrepresentationsinthecorticobasalgangliacircuits
AT shimomurakanji opponentlearningwithdifferentrepresentationsinthecorticobasalgangliacircuits
AT kawaguchiyasuo opponentlearningwithdifferentrepresentationsinthecorticobasalgangliacircuits