Cargando…
Utilizing Information Bottleneck to Evaluate the Capability of Deep Neural Networks for Image Classification †
Inspired by the pioneering work of the information bottleneck (IB) principle for Deep Neural Networks’ (DNNs) analysis, we thoroughly study the relationship among the model accuracy, [Formula: see text] and [Formula: see text] , where [Formula: see text] and [Formula: see text] are the mutual inform...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7514945/ https://www.ncbi.nlm.nih.gov/pubmed/33267170 http://dx.doi.org/10.3390/e21050456 |
_version_ | 1783586704798515200 |
---|---|
author | Cheng, Hao Lian, Dongze Gao, Shenghua Geng, Yanlin |
author_facet | Cheng, Hao Lian, Dongze Gao, Shenghua Geng, Yanlin |
author_sort | Cheng, Hao |
collection | PubMed |
description | Inspired by the pioneering work of the information bottleneck (IB) principle for Deep Neural Networks’ (DNNs) analysis, we thoroughly study the relationship among the model accuracy, [Formula: see text] and [Formula: see text] , where [Formula: see text] and [Formula: see text] are the mutual information of DNN’s output T with input X and label Y. Then, we design an information plane-based framework to evaluate the capability of DNNs (including CNNs) for image classification. Instead of each hidden layer’s output, our framework focuses on the model output T. We successfully apply our framework to many application scenarios arising in deep learning and image classification problems, such as image classification with unbalanced data distribution, model selection, and transfer learning. The experimental results verify the effectiveness of the information plane-based framework: Our framework may facilitate a quick model selection and determine the number of samples needed for each class in the unbalanced classification problem. Furthermore, the framework explains the efficiency of transfer learning in the deep learning area. |
format | Online Article Text |
id | pubmed-7514945 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-75149452020-11-09 Utilizing Information Bottleneck to Evaluate the Capability of Deep Neural Networks for Image Classification † Cheng, Hao Lian, Dongze Gao, Shenghua Geng, Yanlin Entropy (Basel) Article Inspired by the pioneering work of the information bottleneck (IB) principle for Deep Neural Networks’ (DNNs) analysis, we thoroughly study the relationship among the model accuracy, [Formula: see text] and [Formula: see text] , where [Formula: see text] and [Formula: see text] are the mutual information of DNN’s output T with input X and label Y. Then, we design an information plane-based framework to evaluate the capability of DNNs (including CNNs) for image classification. Instead of each hidden layer’s output, our framework focuses on the model output T. We successfully apply our framework to many application scenarios arising in deep learning and image classification problems, such as image classification with unbalanced data distribution, model selection, and transfer learning. The experimental results verify the effectiveness of the information plane-based framework: Our framework may facilitate a quick model selection and determine the number of samples needed for each class in the unbalanced classification problem. Furthermore, the framework explains the efficiency of transfer learning in the deep learning area. MDPI 2019-05-01 /pmc/articles/PMC7514945/ /pubmed/33267170 http://dx.doi.org/10.3390/e21050456 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Cheng, Hao Lian, Dongze Gao, Shenghua Geng, Yanlin Utilizing Information Bottleneck to Evaluate the Capability of Deep Neural Networks for Image Classification † |
title | Utilizing Information Bottleneck to Evaluate the Capability of Deep Neural Networks for Image Classification † |
title_full | Utilizing Information Bottleneck to Evaluate the Capability of Deep Neural Networks for Image Classification † |
title_fullStr | Utilizing Information Bottleneck to Evaluate the Capability of Deep Neural Networks for Image Classification † |
title_full_unstemmed | Utilizing Information Bottleneck to Evaluate the Capability of Deep Neural Networks for Image Classification † |
title_short | Utilizing Information Bottleneck to Evaluate the Capability of Deep Neural Networks for Image Classification † |
title_sort | utilizing information bottleneck to evaluate the capability of deep neural networks for image classification † |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7514945/ https://www.ncbi.nlm.nih.gov/pubmed/33267170 http://dx.doi.org/10.3390/e21050456 |
work_keys_str_mv | AT chenghao utilizinginformationbottlenecktoevaluatethecapabilityofdeepneuralnetworksforimageclassification AT liandongze utilizinginformationbottlenecktoevaluatethecapabilityofdeepneuralnetworksforimageclassification AT gaoshenghua utilizinginformationbottlenecktoevaluatethecapabilityofdeepneuralnetworksforimageclassification AT gengyanlin utilizinginformationbottlenecktoevaluatethecapabilityofdeepneuralnetworksforimageclassification |