Cargando…
A Novel Deep-Learning Model Compression Based on Filter-Stripe Group Pruning and Its IoT Application
Nowadays, there is a tradeoff between the deep-learning module-compression ratio and the module accuracy. In this paper, a strategy for refining the pruning quantification and weights based on neural network filters is proposed. Firstly, filters in the neural network were refined into strip-like fil...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9371170/ https://www.ncbi.nlm.nih.gov/pubmed/35957176 http://dx.doi.org/10.3390/s22155623 |
_version_ | 1784767054624063488 |
---|---|
author | Zhao, Ming Tong, Xindi Wu, Weixian Wang, Zhen Zhou, Bingxue Huang, Xiaodan |
author_facet | Zhao, Ming Tong, Xindi Wu, Weixian Wang, Zhen Zhou, Bingxue Huang, Xiaodan |
author_sort | Zhao, Ming |
collection | PubMed |
description | Nowadays, there is a tradeoff between the deep-learning module-compression ratio and the module accuracy. In this paper, a strategy for refining the pruning quantification and weights based on neural network filters is proposed. Firstly, filters in the neural network were refined into strip-like filter strips. Then, the evaluation of the filter strips was used to refine the partial importance of the filter, cut off the unimportant filter strips and reorganize the remaining filter strips. Finally, the training of the neural network after recombination was quantified to further compress the computational amount of the neural network. The results show that the method can significantly reduce the computational effort of the neural network and compress the number of parameters in the model. Based on experimental results on ResNet56, this method can reduce the number of parameters to 1/4 and the amount of calculation to 1/5, and the loss of model accuracy is only 0.01. On VGG16, the number of parameters is reduced to 1/14, the amount of calculation is reduced to 1/3, and the accuracy loss is 0.5%. |
format | Online Article Text |
id | pubmed-9371170 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-93711702022-08-12 A Novel Deep-Learning Model Compression Based on Filter-Stripe Group Pruning and Its IoT Application Zhao, Ming Tong, Xindi Wu, Weixian Wang, Zhen Zhou, Bingxue Huang, Xiaodan Sensors (Basel) Article Nowadays, there is a tradeoff between the deep-learning module-compression ratio and the module accuracy. In this paper, a strategy for refining the pruning quantification and weights based on neural network filters is proposed. Firstly, filters in the neural network were refined into strip-like filter strips. Then, the evaluation of the filter strips was used to refine the partial importance of the filter, cut off the unimportant filter strips and reorganize the remaining filter strips. Finally, the training of the neural network after recombination was quantified to further compress the computational amount of the neural network. The results show that the method can significantly reduce the computational effort of the neural network and compress the number of parameters in the model. Based on experimental results on ResNet56, this method can reduce the number of parameters to 1/4 and the amount of calculation to 1/5, and the loss of model accuracy is only 0.01. On VGG16, the number of parameters is reduced to 1/14, the amount of calculation is reduced to 1/3, and the accuracy loss is 0.5%. MDPI 2022-07-27 /pmc/articles/PMC9371170/ /pubmed/35957176 http://dx.doi.org/10.3390/s22155623 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Zhao, Ming Tong, Xindi Wu, Weixian Wang, Zhen Zhou, Bingxue Huang, Xiaodan A Novel Deep-Learning Model Compression Based on Filter-Stripe Group Pruning and Its IoT Application |
title | A Novel Deep-Learning Model Compression Based on Filter-Stripe Group Pruning and Its IoT Application |
title_full | A Novel Deep-Learning Model Compression Based on Filter-Stripe Group Pruning and Its IoT Application |
title_fullStr | A Novel Deep-Learning Model Compression Based on Filter-Stripe Group Pruning and Its IoT Application |
title_full_unstemmed | A Novel Deep-Learning Model Compression Based on Filter-Stripe Group Pruning and Its IoT Application |
title_short | A Novel Deep-Learning Model Compression Based on Filter-Stripe Group Pruning and Its IoT Application |
title_sort | novel deep-learning model compression based on filter-stripe group pruning and its iot application |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9371170/ https://www.ncbi.nlm.nih.gov/pubmed/35957176 http://dx.doi.org/10.3390/s22155623 |
work_keys_str_mv | AT zhaoming anoveldeeplearningmodelcompressionbasedonfilterstripegrouppruninganditsiotapplication AT tongxindi anoveldeeplearningmodelcompressionbasedonfilterstripegrouppruninganditsiotapplication AT wuweixian anoveldeeplearningmodelcompressionbasedonfilterstripegrouppruninganditsiotapplication AT wangzhen anoveldeeplearningmodelcompressionbasedonfilterstripegrouppruninganditsiotapplication AT zhoubingxue anoveldeeplearningmodelcompressionbasedonfilterstripegrouppruninganditsiotapplication AT huangxiaodan anoveldeeplearningmodelcompressionbasedonfilterstripegrouppruninganditsiotapplication AT zhaoming noveldeeplearningmodelcompressionbasedonfilterstripegrouppruninganditsiotapplication AT tongxindi noveldeeplearningmodelcompressionbasedonfilterstripegrouppruninganditsiotapplication AT wuweixian noveldeeplearningmodelcompressionbasedonfilterstripegrouppruninganditsiotapplication AT wangzhen noveldeeplearningmodelcompressionbasedonfilterstripegrouppruninganditsiotapplication AT zhoubingxue noveldeeplearningmodelcompressionbasedonfilterstripegrouppruninganditsiotapplication AT huangxiaodan noveldeeplearningmodelcompressionbasedonfilterstripegrouppruninganditsiotapplication |