Cargando…

A Novel Deep-Learning Model Compression Based on Filter-Stripe Group Pruning and Its IoT Application

Nowadays, there is a tradeoff between the deep-learning module-compression ratio and the module accuracy. In this paper, a strategy for refining the pruning quantification and weights based on neural network filters is proposed. Firstly, filters in the neural network were refined into strip-like fil...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Ming, Tong, Xindi, Wu, Weixian, Wang, Zhen, Zhou, Bingxue, Huang, Xiaodan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9371170/
https://www.ncbi.nlm.nih.gov/pubmed/35957176
http://dx.doi.org/10.3390/s22155623
_version_ 1784767054624063488
author Zhao, Ming
Tong, Xindi
Wu, Weixian
Wang, Zhen
Zhou, Bingxue
Huang, Xiaodan
author_facet Zhao, Ming
Tong, Xindi
Wu, Weixian
Wang, Zhen
Zhou, Bingxue
Huang, Xiaodan
author_sort Zhao, Ming
collection PubMed
description Nowadays, there is a tradeoff between the deep-learning module-compression ratio and the module accuracy. In this paper, a strategy for refining the pruning quantification and weights based on neural network filters is proposed. Firstly, filters in the neural network were refined into strip-like filter strips. Then, the evaluation of the filter strips was used to refine the partial importance of the filter, cut off the unimportant filter strips and reorganize the remaining filter strips. Finally, the training of the neural network after recombination was quantified to further compress the computational amount of the neural network. The results show that the method can significantly reduce the computational effort of the neural network and compress the number of parameters in the model. Based on experimental results on ResNet56, this method can reduce the number of parameters to 1/4 and the amount of calculation to 1/5, and the loss of model accuracy is only 0.01. On VGG16, the number of parameters is reduced to 1/14, the amount of calculation is reduced to 1/3, and the accuracy loss is 0.5%.
format Online
Article
Text
id pubmed-9371170
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-93711702022-08-12 A Novel Deep-Learning Model Compression Based on Filter-Stripe Group Pruning and Its IoT Application Zhao, Ming Tong, Xindi Wu, Weixian Wang, Zhen Zhou, Bingxue Huang, Xiaodan Sensors (Basel) Article Nowadays, there is a tradeoff between the deep-learning module-compression ratio and the module accuracy. In this paper, a strategy for refining the pruning quantification and weights based on neural network filters is proposed. Firstly, filters in the neural network were refined into strip-like filter strips. Then, the evaluation of the filter strips was used to refine the partial importance of the filter, cut off the unimportant filter strips and reorganize the remaining filter strips. Finally, the training of the neural network after recombination was quantified to further compress the computational amount of the neural network. The results show that the method can significantly reduce the computational effort of the neural network and compress the number of parameters in the model. Based on experimental results on ResNet56, this method can reduce the number of parameters to 1/4 and the amount of calculation to 1/5, and the loss of model accuracy is only 0.01. On VGG16, the number of parameters is reduced to 1/14, the amount of calculation is reduced to 1/3, and the accuracy loss is 0.5%. MDPI 2022-07-27 /pmc/articles/PMC9371170/ /pubmed/35957176 http://dx.doi.org/10.3390/s22155623 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zhao, Ming
Tong, Xindi
Wu, Weixian
Wang, Zhen
Zhou, Bingxue
Huang, Xiaodan
A Novel Deep-Learning Model Compression Based on Filter-Stripe Group Pruning and Its IoT Application
title A Novel Deep-Learning Model Compression Based on Filter-Stripe Group Pruning and Its IoT Application
title_full A Novel Deep-Learning Model Compression Based on Filter-Stripe Group Pruning and Its IoT Application
title_fullStr A Novel Deep-Learning Model Compression Based on Filter-Stripe Group Pruning and Its IoT Application
title_full_unstemmed A Novel Deep-Learning Model Compression Based on Filter-Stripe Group Pruning and Its IoT Application
title_short A Novel Deep-Learning Model Compression Based on Filter-Stripe Group Pruning and Its IoT Application
title_sort novel deep-learning model compression based on filter-stripe group pruning and its iot application
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9371170/
https://www.ncbi.nlm.nih.gov/pubmed/35957176
http://dx.doi.org/10.3390/s22155623
work_keys_str_mv AT zhaoming anoveldeeplearningmodelcompressionbasedonfilterstripegrouppruninganditsiotapplication
AT tongxindi anoveldeeplearningmodelcompressionbasedonfilterstripegrouppruninganditsiotapplication
AT wuweixian anoveldeeplearningmodelcompressionbasedonfilterstripegrouppruninganditsiotapplication
AT wangzhen anoveldeeplearningmodelcompressionbasedonfilterstripegrouppruninganditsiotapplication
AT zhoubingxue anoveldeeplearningmodelcompressionbasedonfilterstripegrouppruninganditsiotapplication
AT huangxiaodan anoveldeeplearningmodelcompressionbasedonfilterstripegrouppruninganditsiotapplication
AT zhaoming noveldeeplearningmodelcompressionbasedonfilterstripegrouppruninganditsiotapplication
AT tongxindi noveldeeplearningmodelcompressionbasedonfilterstripegrouppruninganditsiotapplication
AT wuweixian noveldeeplearningmodelcompressionbasedonfilterstripegrouppruninganditsiotapplication
AT wangzhen noveldeeplearningmodelcompressionbasedonfilterstripegrouppruninganditsiotapplication
AT zhoubingxue noveldeeplearningmodelcompressionbasedonfilterstripegrouppruninganditsiotapplication
AT huangxiaodan noveldeeplearningmodelcompressionbasedonfilterstripegrouppruninganditsiotapplication