Cargando…
A Geometric Perspective on Information Plane Analysis
Information plane analysis, describing the mutual information between the input and a hidden layer and between a hidden layer and the target over time, has recently been proposed to analyze the training of neural networks. Since the activations of a hidden layer are typically continuous-valued, this...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8228019/ https://www.ncbi.nlm.nih.gov/pubmed/34205211 http://dx.doi.org/10.3390/e23060711 |
_version_ | 1783712644336713728 |
---|---|
author | Basirat, Mina Geiger, Bernhard C. Roth, Peter M. |
author_facet | Basirat, Mina Geiger, Bernhard C. Roth, Peter M. |
author_sort | Basirat, Mina |
collection | PubMed |
description | Information plane analysis, describing the mutual information between the input and a hidden layer and between a hidden layer and the target over time, has recently been proposed to analyze the training of neural networks. Since the activations of a hidden layer are typically continuous-valued, this mutual information cannot be computed analytically and must thus be estimated, resulting in apparently inconsistent or even contradicting results in the literature. The goal of this paper is to demonstrate how information plane analysis can still be a valuable tool for analyzing neural network training. To this end, we complement the prevailing binning estimator for mutual information with a geometric interpretation. With this geometric interpretation in mind, we evaluate the impact of regularization and interpret phenomena such as underfitting and overfitting. In addition, we investigate neural network learning in the presence of noisy data and noisy labels. |
format | Online Article Text |
id | pubmed-8228019 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-82280192021-06-26 A Geometric Perspective on Information Plane Analysis Basirat, Mina Geiger, Bernhard C. Roth, Peter M. Entropy (Basel) Article Information plane analysis, describing the mutual information between the input and a hidden layer and between a hidden layer and the target over time, has recently been proposed to analyze the training of neural networks. Since the activations of a hidden layer are typically continuous-valued, this mutual information cannot be computed analytically and must thus be estimated, resulting in apparently inconsistent or even contradicting results in the literature. The goal of this paper is to demonstrate how information plane analysis can still be a valuable tool for analyzing neural network training. To this end, we complement the prevailing binning estimator for mutual information with a geometric interpretation. With this geometric interpretation in mind, we evaluate the impact of regularization and interpret phenomena such as underfitting and overfitting. In addition, we investigate neural network learning in the presence of noisy data and noisy labels. MDPI 2021-06-03 /pmc/articles/PMC8228019/ /pubmed/34205211 http://dx.doi.org/10.3390/e23060711 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Basirat, Mina Geiger, Bernhard C. Roth, Peter M. A Geometric Perspective on Information Plane Analysis |
title | A Geometric Perspective on Information Plane Analysis |
title_full | A Geometric Perspective on Information Plane Analysis |
title_fullStr | A Geometric Perspective on Information Plane Analysis |
title_full_unstemmed | A Geometric Perspective on Information Plane Analysis |
title_short | A Geometric Perspective on Information Plane Analysis |
title_sort | geometric perspective on information plane analysis |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8228019/ https://www.ncbi.nlm.nih.gov/pubmed/34205211 http://dx.doi.org/10.3390/e23060711 |
work_keys_str_mv | AT basiratmina ageometricperspectiveoninformationplaneanalysis AT geigerbernhardc ageometricperspectiveoninformationplaneanalysis AT rothpeterm ageometricperspectiveoninformationplaneanalysis AT basiratmina geometricperspectiveoninformationplaneanalysis AT geigerbernhardc geometricperspectiveoninformationplaneanalysis AT rothpeterm geometricperspectiveoninformationplaneanalysis |