Cargando…
Overparameterized neural networks implement associative memory
Identifying computational mechanisms for memorization and retrieval of data is a long-standing problem at the intersection of machine learning and neuroscience. Our main finding is that standard overparameterized deep neural networks trained using standard optimization methods implement such a mecha...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
National Academy of Sciences
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7959487/ https://www.ncbi.nlm.nih.gov/pubmed/33067397 http://dx.doi.org/10.1073/pnas.2005013117 |
_version_ | 1783664971660394496 |
---|---|
author | Radhakrishnan, Adityanarayanan Belkin, Mikhail Uhler, Caroline |
author_facet | Radhakrishnan, Adityanarayanan Belkin, Mikhail Uhler, Caroline |
author_sort | Radhakrishnan, Adityanarayanan |
collection | PubMed |
description | Identifying computational mechanisms for memorization and retrieval of data is a long-standing problem at the intersection of machine learning and neuroscience. Our main finding is that standard overparameterized deep neural networks trained using standard optimization methods implement such a mechanism for real-valued data. We provide empirical evidence that 1) overparameterized autoencoders store training samples as attractors and thus iterating the learned map leads to sample recovery, and that 2) the same mechanism allows for encoding sequences of examples and serves as an even more efficient mechanism for memory than autoencoding. Theoretically, we prove that when trained on a single example, autoencoders store the example as an attractor. Lastly, by treating a sequence encoder as a composition of maps, we prove that sequence encoding provides a more efficient mechanism for memory than autoencoding. |
format | Online Article Text |
id | pubmed-7959487 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | National Academy of Sciences |
record_format | MEDLINE/PubMed |
spelling | pubmed-79594872021-03-22 Overparameterized neural networks implement associative memory Radhakrishnan, Adityanarayanan Belkin, Mikhail Uhler, Caroline Proc Natl Acad Sci U S A Physical Sciences Identifying computational mechanisms for memorization and retrieval of data is a long-standing problem at the intersection of machine learning and neuroscience. Our main finding is that standard overparameterized deep neural networks trained using standard optimization methods implement such a mechanism for real-valued data. We provide empirical evidence that 1) overparameterized autoencoders store training samples as attractors and thus iterating the learned map leads to sample recovery, and that 2) the same mechanism allows for encoding sequences of examples and serves as an even more efficient mechanism for memory than autoencoding. Theoretically, we prove that when trained on a single example, autoencoders store the example as an attractor. Lastly, by treating a sequence encoder as a composition of maps, we prove that sequence encoding provides a more efficient mechanism for memory than autoencoding. National Academy of Sciences 2020-11-03 2020-10-16 /pmc/articles/PMC7959487/ /pubmed/33067397 http://dx.doi.org/10.1073/pnas.2005013117 Text en Copyright © 2020 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by-nc-nd/4.0/ https://creativecommons.org/licenses/by-nc-nd/4.0/This open access article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND) (https://creativecommons.org/licenses/by-nc-nd/4.0/) . |
spellingShingle | Physical Sciences Radhakrishnan, Adityanarayanan Belkin, Mikhail Uhler, Caroline Overparameterized neural networks implement associative memory |
title | Overparameterized neural networks implement associative memory |
title_full | Overparameterized neural networks implement associative memory |
title_fullStr | Overparameterized neural networks implement associative memory |
title_full_unstemmed | Overparameterized neural networks implement associative memory |
title_short | Overparameterized neural networks implement associative memory |
title_sort | overparameterized neural networks implement associative memory |
topic | Physical Sciences |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7959487/ https://www.ncbi.nlm.nih.gov/pubmed/33067397 http://dx.doi.org/10.1073/pnas.2005013117 |
work_keys_str_mv | AT radhakrishnanadityanarayanan overparameterizedneuralnetworksimplementassociativememory AT belkinmikhail overparameterizedneuralnetworksimplementassociativememory AT uhlercaroline overparameterizedneuralnetworksimplementassociativememory |