Cargando…

Learning predictive cognitive maps with spiking neurons during behavior and replays

The hippocampus has been proposed to encode environments using a representation that contains predictive information about likely future states, called the successor representation. However, it is not clear how such a representation could be learned in the hippocampal circuit. Here, we propose a pla...

Descripción completa

Detalles Bibliográficos
Autores principales: Bono, Jacopo, Zannone, Sara, Pedrosa, Victor, Clopath, Claudia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: eLife Sciences Publications, Ltd 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10019888/
https://www.ncbi.nlm.nih.gov/pubmed/36927625
http://dx.doi.org/10.7554/eLife.80671
_version_ 1784908129439318016
author Bono, Jacopo
Zannone, Sara
Pedrosa, Victor
Clopath, Claudia
author_facet Bono, Jacopo
Zannone, Sara
Pedrosa, Victor
Clopath, Claudia
author_sort Bono, Jacopo
collection PubMed
description The hippocampus has been proposed to encode environments using a representation that contains predictive information about likely future states, called the successor representation. However, it is not clear how such a representation could be learned in the hippocampal circuit. Here, we propose a plasticity rule that can learn this predictive map of the environment using a spiking neural network. We connect this biologically plausible plasticity rule to reinforcement learning, mathematically and numerically showing that it implements the TD-lambda algorithm. By spanning these different levels, we show how our framework naturally encompasses behavioral activity and replays, smoothly moving from rate to temporal coding, and allows learning over behavioral timescales with a plasticity rule acting on a timescale of milliseconds. We discuss how biological parameters such as dwelling times at states, neuronal firing rates and neuromodulation relate to the delay discounting parameter of the TD algorithm, and how they influence the learned representation. We also find that, in agreement with psychological studies and contrary to reinforcement learning theory, the discount factor decreases hyperbolically with time. Finally, our framework suggests a role for replays, in both aiding learning in novel environments and finding shortcut trajectories that were not experienced during behavior, in agreement with experimental data.
format Online
Article
Text
id pubmed-10019888
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher eLife Sciences Publications, Ltd
record_format MEDLINE/PubMed
spelling pubmed-100198882023-03-17 Learning predictive cognitive maps with spiking neurons during behavior and replays Bono, Jacopo Zannone, Sara Pedrosa, Victor Clopath, Claudia eLife Computational and Systems Biology The hippocampus has been proposed to encode environments using a representation that contains predictive information about likely future states, called the successor representation. However, it is not clear how such a representation could be learned in the hippocampal circuit. Here, we propose a plasticity rule that can learn this predictive map of the environment using a spiking neural network. We connect this biologically plausible plasticity rule to reinforcement learning, mathematically and numerically showing that it implements the TD-lambda algorithm. By spanning these different levels, we show how our framework naturally encompasses behavioral activity and replays, smoothly moving from rate to temporal coding, and allows learning over behavioral timescales with a plasticity rule acting on a timescale of milliseconds. We discuss how biological parameters such as dwelling times at states, neuronal firing rates and neuromodulation relate to the delay discounting parameter of the TD algorithm, and how they influence the learned representation. We also find that, in agreement with psychological studies and contrary to reinforcement learning theory, the discount factor decreases hyperbolically with time. Finally, our framework suggests a role for replays, in both aiding learning in novel environments and finding shortcut trajectories that were not experienced during behavior, in agreement with experimental data. eLife Sciences Publications, Ltd 2023-03-16 /pmc/articles/PMC10019888/ /pubmed/36927625 http://dx.doi.org/10.7554/eLife.80671 Text en © 2023, Bono, Zannone et al https://creativecommons.org/licenses/by/4.0/This article is distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use and redistribution provided that the original author and source are credited.
spellingShingle Computational and Systems Biology
Bono, Jacopo
Zannone, Sara
Pedrosa, Victor
Clopath, Claudia
Learning predictive cognitive maps with spiking neurons during behavior and replays
title Learning predictive cognitive maps with spiking neurons during behavior and replays
title_full Learning predictive cognitive maps with spiking neurons during behavior and replays
title_fullStr Learning predictive cognitive maps with spiking neurons during behavior and replays
title_full_unstemmed Learning predictive cognitive maps with spiking neurons during behavior and replays
title_short Learning predictive cognitive maps with spiking neurons during behavior and replays
title_sort learning predictive cognitive maps with spiking neurons during behavior and replays
topic Computational and Systems Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10019888/
https://www.ncbi.nlm.nih.gov/pubmed/36927625
http://dx.doi.org/10.7554/eLife.80671
work_keys_str_mv AT bonojacopo learningpredictivecognitivemapswithspikingneuronsduringbehaviorandreplays
AT zannonesara learningpredictivecognitivemapswithspikingneuronsduringbehaviorandreplays
AT pedrosavictor learningpredictivecognitivemapswithspikingneuronsduringbehaviorandreplays
AT clopathclaudia learningpredictivecognitivemapswithspikingneuronsduringbehaviorandreplays