Cargando…

A law of data separation in deep learning

While deep learning has enabled significant advances in many areas of science, its black-box nature hinders architecture design for future artificial intelligence applications and interpretation for high-stakes decision-makings. We addressed this issue by studying the fundamental question of how dee...

Descripción completa

Detalles Bibliográficos
Autores principales: He, Hangfeng, Su, Weijie J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Academy of Sciences 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10483613/
https://www.ncbi.nlm.nih.gov/pubmed/37639604
http://dx.doi.org/10.1073/pnas.2221704120
_version_ 1785102423797268480
author He, Hangfeng
Su, Weijie J.
author_facet He, Hangfeng
Su, Weijie J.
author_sort He, Hangfeng
collection PubMed
description While deep learning has enabled significant advances in many areas of science, its black-box nature hinders architecture design for future artificial intelligence applications and interpretation for high-stakes decision-makings. We addressed this issue by studying the fundamental question of how deep neural networks process data in the intermediate layers. Our finding is a simple and quantitative law that governs how deep neural networks separate data according to class membership throughout all layers for classification. This law shows that each layer improves data separation at a constant geometric rate, and its emergence is observed in a collection of network architectures and datasets during training. This law offers practical guidelines for designing architectures, improving model robustness and out-of-sample performance, as well as interpreting the predictions.
format Online
Article
Text
id pubmed-10483613
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher National Academy of Sciences
record_format MEDLINE/PubMed
spelling pubmed-104836132023-09-08 A law of data separation in deep learning He, Hangfeng Su, Weijie J. Proc Natl Acad Sci U S A Physical Sciences While deep learning has enabled significant advances in many areas of science, its black-box nature hinders architecture design for future artificial intelligence applications and interpretation for high-stakes decision-makings. We addressed this issue by studying the fundamental question of how deep neural networks process data in the intermediate layers. Our finding is a simple and quantitative law that governs how deep neural networks separate data according to class membership throughout all layers for classification. This law shows that each layer improves data separation at a constant geometric rate, and its emergence is observed in a collection of network architectures and datasets during training. This law offers practical guidelines for designing architectures, improving model robustness and out-of-sample performance, as well as interpreting the predictions. National Academy of Sciences 2023-08-28 2023-09-05 /pmc/articles/PMC10483613/ /pubmed/37639604 http://dx.doi.org/10.1073/pnas.2221704120 Text en Copyright © 2023 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by-nc-nd/4.0/This open access article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND) (https://creativecommons.org/licenses/by-nc-nd/4.0/) .
spellingShingle Physical Sciences
He, Hangfeng
Su, Weijie J.
A law of data separation in deep learning
title A law of data separation in deep learning
title_full A law of data separation in deep learning
title_fullStr A law of data separation in deep learning
title_full_unstemmed A law of data separation in deep learning
title_short A law of data separation in deep learning
title_sort law of data separation in deep learning
topic Physical Sciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10483613/
https://www.ncbi.nlm.nih.gov/pubmed/37639604
http://dx.doi.org/10.1073/pnas.2221704120
work_keys_str_mv AT hehangfeng alawofdataseparationindeeplearning
AT suweijiej alawofdataseparationindeeplearning
AT hehangfeng lawofdataseparationindeeplearning
AT suweijiej lawofdataseparationindeeplearning