Cargando…
Learning predictive cognitive maps with spiking neurons during behavior and replays
The hippocampus has been proposed to encode environments using a representation that contains predictive information about likely future states, called the successor representation. However, it is not clear how such a representation could be learned in the hippocampal circuit. Here, we propose a pla...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
eLife Sciences Publications, Ltd
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10019888/ https://www.ncbi.nlm.nih.gov/pubmed/36927625 http://dx.doi.org/10.7554/eLife.80671 |
_version_ | 1784908129439318016 |
---|---|
author | Bono, Jacopo Zannone, Sara Pedrosa, Victor Clopath, Claudia |
author_facet | Bono, Jacopo Zannone, Sara Pedrosa, Victor Clopath, Claudia |
author_sort | Bono, Jacopo |
collection | PubMed |
description | The hippocampus has been proposed to encode environments using a representation that contains predictive information about likely future states, called the successor representation. However, it is not clear how such a representation could be learned in the hippocampal circuit. Here, we propose a plasticity rule that can learn this predictive map of the environment using a spiking neural network. We connect this biologically plausible plasticity rule to reinforcement learning, mathematically and numerically showing that it implements the TD-lambda algorithm. By spanning these different levels, we show how our framework naturally encompasses behavioral activity and replays, smoothly moving from rate to temporal coding, and allows learning over behavioral timescales with a plasticity rule acting on a timescale of milliseconds. We discuss how biological parameters such as dwelling times at states, neuronal firing rates and neuromodulation relate to the delay discounting parameter of the TD algorithm, and how they influence the learned representation. We also find that, in agreement with psychological studies and contrary to reinforcement learning theory, the discount factor decreases hyperbolically with time. Finally, our framework suggests a role for replays, in both aiding learning in novel environments and finding shortcut trajectories that were not experienced during behavior, in agreement with experimental data. |
format | Online Article Text |
id | pubmed-10019888 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | eLife Sciences Publications, Ltd |
record_format | MEDLINE/PubMed |
spelling | pubmed-100198882023-03-17 Learning predictive cognitive maps with spiking neurons during behavior and replays Bono, Jacopo Zannone, Sara Pedrosa, Victor Clopath, Claudia eLife Computational and Systems Biology The hippocampus has been proposed to encode environments using a representation that contains predictive information about likely future states, called the successor representation. However, it is not clear how such a representation could be learned in the hippocampal circuit. Here, we propose a plasticity rule that can learn this predictive map of the environment using a spiking neural network. We connect this biologically plausible plasticity rule to reinforcement learning, mathematically and numerically showing that it implements the TD-lambda algorithm. By spanning these different levels, we show how our framework naturally encompasses behavioral activity and replays, smoothly moving from rate to temporal coding, and allows learning over behavioral timescales with a plasticity rule acting on a timescale of milliseconds. We discuss how biological parameters such as dwelling times at states, neuronal firing rates and neuromodulation relate to the delay discounting parameter of the TD algorithm, and how they influence the learned representation. We also find that, in agreement with psychological studies and contrary to reinforcement learning theory, the discount factor decreases hyperbolically with time. Finally, our framework suggests a role for replays, in both aiding learning in novel environments and finding shortcut trajectories that were not experienced during behavior, in agreement with experimental data. eLife Sciences Publications, Ltd 2023-03-16 /pmc/articles/PMC10019888/ /pubmed/36927625 http://dx.doi.org/10.7554/eLife.80671 Text en © 2023, Bono, Zannone et al https://creativecommons.org/licenses/by/4.0/This article is distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use and redistribution provided that the original author and source are credited. |
spellingShingle | Computational and Systems Biology Bono, Jacopo Zannone, Sara Pedrosa, Victor Clopath, Claudia Learning predictive cognitive maps with spiking neurons during behavior and replays |
title | Learning predictive cognitive maps with spiking neurons during behavior and replays |
title_full | Learning predictive cognitive maps with spiking neurons during behavior and replays |
title_fullStr | Learning predictive cognitive maps with spiking neurons during behavior and replays |
title_full_unstemmed | Learning predictive cognitive maps with spiking neurons during behavior and replays |
title_short | Learning predictive cognitive maps with spiking neurons during behavior and replays |
title_sort | learning predictive cognitive maps with spiking neurons during behavior and replays |
topic | Computational and Systems Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10019888/ https://www.ncbi.nlm.nih.gov/pubmed/36927625 http://dx.doi.org/10.7554/eLife.80671 |
work_keys_str_mv | AT bonojacopo learningpredictivecognitivemapswithspikingneuronsduringbehaviorandreplays AT zannonesara learningpredictivecognitivemapswithspikingneuronsduringbehaviorandreplays AT pedrosavictor learningpredictivecognitivemapswithspikingneuronsduringbehaviorandreplays AT clopathclaudia learningpredictivecognitivemapswithspikingneuronsduringbehaviorandreplays |