Cargando…

Modelling continual learning in humans with Hebbian context gating and exponentially decaying task signals

Humans can learn several tasks in succession with minimal mutual interference but perform more poorly when trained on multiple tasks at once. The opposite is true for standard deep neural networks. Here, we propose novel computational constraints for artificial neural networks, inspired by earlier w...

Descripción completa

Detalles Bibliográficos
Autores principales: Flesch, Timo, Nagy, David G., Saxe, Andrew, Summerfield, Christopher
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9851563/
https://www.ncbi.nlm.nih.gov/pubmed/36656823
http://dx.doi.org/10.1371/journal.pcbi.1010808
_version_ 1784872427882283008
author Flesch, Timo
Nagy, David G.
Saxe, Andrew
Summerfield, Christopher
author_facet Flesch, Timo
Nagy, David G.
Saxe, Andrew
Summerfield, Christopher
author_sort Flesch, Timo
collection PubMed
description Humans can learn several tasks in succession with minimal mutual interference but perform more poorly when trained on multiple tasks at once. The opposite is true for standard deep neural networks. Here, we propose novel computational constraints for artificial neural networks, inspired by earlier work on gating in the primate prefrontal cortex, that capture the cost of interleaved training and allow the network to learn two tasks in sequence without forgetting. We augment standard stochastic gradient descent with two algorithmic motifs, so-called “sluggish” task units and a Hebbian training step that strengthens connections between task units and hidden units that encode task-relevant information. We found that the “sluggish” units introduce a switch-cost during training, which biases representations under interleaved training towards a joint representation that ignores the contextual cue, while the Hebbian step promotes the formation of a gating scheme from task units to the hidden layer that produces orthogonal representations which are perfectly guarded against interference. Validating the model on previously published human behavioural data revealed that it matches performance of participants who had been trained on blocked or interleaved curricula, and that these performance differences were driven by misestimation of the true category boundary.
format Online
Article
Text
id pubmed-9851563
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-98515632023-01-20 Modelling continual learning in humans with Hebbian context gating and exponentially decaying task signals Flesch, Timo Nagy, David G. Saxe, Andrew Summerfield, Christopher PLoS Comput Biol Research Article Humans can learn several tasks in succession with minimal mutual interference but perform more poorly when trained on multiple tasks at once. The opposite is true for standard deep neural networks. Here, we propose novel computational constraints for artificial neural networks, inspired by earlier work on gating in the primate prefrontal cortex, that capture the cost of interleaved training and allow the network to learn two tasks in sequence without forgetting. We augment standard stochastic gradient descent with two algorithmic motifs, so-called “sluggish” task units and a Hebbian training step that strengthens connections between task units and hidden units that encode task-relevant information. We found that the “sluggish” units introduce a switch-cost during training, which biases representations under interleaved training towards a joint representation that ignores the contextual cue, while the Hebbian step promotes the formation of a gating scheme from task units to the hidden layer that produces orthogonal representations which are perfectly guarded against interference. Validating the model on previously published human behavioural data revealed that it matches performance of participants who had been trained on blocked or interleaved curricula, and that these performance differences were driven by misestimation of the true category boundary. Public Library of Science 2023-01-19 /pmc/articles/PMC9851563/ /pubmed/36656823 http://dx.doi.org/10.1371/journal.pcbi.1010808 Text en © 2023 Flesch et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Flesch, Timo
Nagy, David G.
Saxe, Andrew
Summerfield, Christopher
Modelling continual learning in humans with Hebbian context gating and exponentially decaying task signals
title Modelling continual learning in humans with Hebbian context gating and exponentially decaying task signals
title_full Modelling continual learning in humans with Hebbian context gating and exponentially decaying task signals
title_fullStr Modelling continual learning in humans with Hebbian context gating and exponentially decaying task signals
title_full_unstemmed Modelling continual learning in humans with Hebbian context gating and exponentially decaying task signals
title_short Modelling continual learning in humans with Hebbian context gating and exponentially decaying task signals
title_sort modelling continual learning in humans with hebbian context gating and exponentially decaying task signals
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9851563/
https://www.ncbi.nlm.nih.gov/pubmed/36656823
http://dx.doi.org/10.1371/journal.pcbi.1010808
work_keys_str_mv AT fleschtimo modellingcontinuallearninginhumanswithhebbiancontextgatingandexponentiallydecayingtasksignals
AT nagydavidg modellingcontinuallearninginhumanswithhebbiancontextgatingandexponentiallydecayingtasksignals
AT saxeandrew modellingcontinuallearninginhumanswithhebbiancontextgatingandexponentiallydecayingtasksignals
AT summerfieldchristopher modellingcontinuallearninginhumanswithhebbiancontextgatingandexponentiallydecayingtasksignals