Cargando…

Overparameterized neural networks implement associative memory

Identifying computational mechanisms for memorization and retrieval of data is a long-standing problem at the intersection of machine learning and neuroscience. Our main finding is that standard overparameterized deep neural networks trained using standard optimization methods implement such a mecha...

Descripción completa

Detalles Bibliográficos
Autores principales: Radhakrishnan, Adityanarayanan, Belkin, Mikhail, Uhler, Caroline
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Academy of Sciences 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7959487/
https://www.ncbi.nlm.nih.gov/pubmed/33067397
http://dx.doi.org/10.1073/pnas.2005013117
_version_ 1783664971660394496
author Radhakrishnan, Adityanarayanan
Belkin, Mikhail
Uhler, Caroline
author_facet Radhakrishnan, Adityanarayanan
Belkin, Mikhail
Uhler, Caroline
author_sort Radhakrishnan, Adityanarayanan
collection PubMed
description Identifying computational mechanisms for memorization and retrieval of data is a long-standing problem at the intersection of machine learning and neuroscience. Our main finding is that standard overparameterized deep neural networks trained using standard optimization methods implement such a mechanism for real-valued data. We provide empirical evidence that 1) overparameterized autoencoders store training samples as attractors and thus iterating the learned map leads to sample recovery, and that 2) the same mechanism allows for encoding sequences of examples and serves as an even more efficient mechanism for memory than autoencoding. Theoretically, we prove that when trained on a single example, autoencoders store the example as an attractor. Lastly, by treating a sequence encoder as a composition of maps, we prove that sequence encoding provides a more efficient mechanism for memory than autoencoding.
format Online
Article
Text
id pubmed-7959487
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher National Academy of Sciences
record_format MEDLINE/PubMed
spelling pubmed-79594872021-03-22 Overparameterized neural networks implement associative memory Radhakrishnan, Adityanarayanan Belkin, Mikhail Uhler, Caroline Proc Natl Acad Sci U S A Physical Sciences Identifying computational mechanisms for memorization and retrieval of data is a long-standing problem at the intersection of machine learning and neuroscience. Our main finding is that standard overparameterized deep neural networks trained using standard optimization methods implement such a mechanism for real-valued data. We provide empirical evidence that 1) overparameterized autoencoders store training samples as attractors and thus iterating the learned map leads to sample recovery, and that 2) the same mechanism allows for encoding sequences of examples and serves as an even more efficient mechanism for memory than autoencoding. Theoretically, we prove that when trained on a single example, autoencoders store the example as an attractor. Lastly, by treating a sequence encoder as a composition of maps, we prove that sequence encoding provides a more efficient mechanism for memory than autoencoding. National Academy of Sciences 2020-11-03 2020-10-16 /pmc/articles/PMC7959487/ /pubmed/33067397 http://dx.doi.org/10.1073/pnas.2005013117 Text en Copyright © 2020 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by-nc-nd/4.0/ https://creativecommons.org/licenses/by-nc-nd/4.0/This open access article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND) (https://creativecommons.org/licenses/by-nc-nd/4.0/) .
spellingShingle Physical Sciences
Radhakrishnan, Adityanarayanan
Belkin, Mikhail
Uhler, Caroline
Overparameterized neural networks implement associative memory
title Overparameterized neural networks implement associative memory
title_full Overparameterized neural networks implement associative memory
title_fullStr Overparameterized neural networks implement associative memory
title_full_unstemmed Overparameterized neural networks implement associative memory
title_short Overparameterized neural networks implement associative memory
title_sort overparameterized neural networks implement associative memory
topic Physical Sciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7959487/
https://www.ncbi.nlm.nih.gov/pubmed/33067397
http://dx.doi.org/10.1073/pnas.2005013117
work_keys_str_mv AT radhakrishnanadityanarayanan overparameterizedneuralnetworksimplementassociativememory
AT belkinmikhail overparameterizedneuralnetworksimplementassociativememory
AT uhlercaroline overparameterizedneuralnetworksimplementassociativememory