Cargando…
Correspondence between neuroevolution and gradient descent
We show analytically that training a neural network by conditioned stochastic mutation or neuroevolution of its weights is equivalent, in the limit of small mutations, to gradient descent on the loss function in the presence of Gaussian white noise. Averaged over independent realizations of the lear...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8563972/ https://www.ncbi.nlm.nih.gov/pubmed/34728632 http://dx.doi.org/10.1038/s41467-021-26568-2 |
_version_ | 1784593515811962880 |
---|---|
author | Whitelam, Stephen Selin, Viktor Park, Sang-Won Tamblyn, Isaac |
author_facet | Whitelam, Stephen Selin, Viktor Park, Sang-Won Tamblyn, Isaac |
author_sort | Whitelam, Stephen |
collection | PubMed |
description | We show analytically that training a neural network by conditioned stochastic mutation or neuroevolution of its weights is equivalent, in the limit of small mutations, to gradient descent on the loss function in the presence of Gaussian white noise. Averaged over independent realizations of the learning process, neuroevolution is equivalent to gradient descent on the loss function. We use numerical simulation to show that this correspondence can be observed for finite mutations, for shallow and deep neural networks. Our results provide a connection between two families of neural-network training methods that are usually considered to be fundamentally different. |
format | Online Article Text |
id | pubmed-8563972 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-85639722021-11-19 Correspondence between neuroevolution and gradient descent Whitelam, Stephen Selin, Viktor Park, Sang-Won Tamblyn, Isaac Nat Commun Article We show analytically that training a neural network by conditioned stochastic mutation or neuroevolution of its weights is equivalent, in the limit of small mutations, to gradient descent on the loss function in the presence of Gaussian white noise. Averaged over independent realizations of the learning process, neuroevolution is equivalent to gradient descent on the loss function. We use numerical simulation to show that this correspondence can be observed for finite mutations, for shallow and deep neural networks. Our results provide a connection between two families of neural-network training methods that are usually considered to be fundamentally different. Nature Publishing Group UK 2021-11-02 /pmc/articles/PMC8563972/ /pubmed/34728632 http://dx.doi.org/10.1038/s41467-021-26568-2 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Whitelam, Stephen Selin, Viktor Park, Sang-Won Tamblyn, Isaac Correspondence between neuroevolution and gradient descent |
title | Correspondence between neuroevolution and gradient descent |
title_full | Correspondence between neuroevolution and gradient descent |
title_fullStr | Correspondence between neuroevolution and gradient descent |
title_full_unstemmed | Correspondence between neuroevolution and gradient descent |
title_short | Correspondence between neuroevolution and gradient descent |
title_sort | correspondence between neuroevolution and gradient descent |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8563972/ https://www.ncbi.nlm.nih.gov/pubmed/34728632 http://dx.doi.org/10.1038/s41467-021-26568-2 |
work_keys_str_mv | AT whitelamstephen correspondencebetweenneuroevolutionandgradientdescent AT selinviktor correspondencebetweenneuroevolutionandgradientdescent AT parksangwon correspondencebetweenneuroevolutionandgradientdescent AT tamblynisaac correspondencebetweenneuroevolutionandgradientdescent |