Cargando…

HMC: Hybrid model compression method based on layer sensitivity grouping

Previous studies have shown that deep models are often over-parameterized, and this parameter redundancy makes deep compression possible. The redundancy of model weight is often manifested as low rank and sparsity. Ignoring any part of the two or the different distributions of these two characterist...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Guoliang, Yu, Shuaiying, Yang, Hao, Nie, Ziling, Wang, Jixiang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10561844/
https://www.ncbi.nlm.nih.gov/pubmed/37812605
http://dx.doi.org/10.1371/journal.pone.0292517
_version_ 1785118006570909696
author Yang, Guoliang
Yu, Shuaiying
Yang, Hao
Nie, Ziling
Wang, Jixiang
author_facet Yang, Guoliang
Yu, Shuaiying
Yang, Hao
Nie, Ziling
Wang, Jixiang
author_sort Yang, Guoliang
collection PubMed
description Previous studies have shown that deep models are often over-parameterized, and this parameter redundancy makes deep compression possible. The redundancy of model weight is often manifested as low rank and sparsity. Ignoring any part of the two or the different distributions of these two characteristics in the model will lead to low accuracy and a low compression rate of deep compression. To make full use of the difference between low-rank and sparsity, a unified framework combining low-rank tensor decomposition and structured pruning is proposed: a hybrid model compression method based on sensitivity grouping (HMC). This framework unifies the existing additive hybrid compression method (AHC) and the non-additive hybrid compression method (NaHC) proposed by us into one model. The latter group the network according to the sensitivity difference of the convolutional layer to different compression methods, which can better integrate the low rank and sparsity of the model compared with the former. Experiments show that our approach achieves a better trade-off between test accuracy and compression ratio when compressing the ResNet family of models than other recent compression methods using a single strategy or additive hybrid compression.
format Online
Article
Text
id pubmed-10561844
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-105618442023-10-10 HMC: Hybrid model compression method based on layer sensitivity grouping Yang, Guoliang Yu, Shuaiying Yang, Hao Nie, Ziling Wang, Jixiang PLoS One Research Article Previous studies have shown that deep models are often over-parameterized, and this parameter redundancy makes deep compression possible. The redundancy of model weight is often manifested as low rank and sparsity. Ignoring any part of the two or the different distributions of these two characteristics in the model will lead to low accuracy and a low compression rate of deep compression. To make full use of the difference between low-rank and sparsity, a unified framework combining low-rank tensor decomposition and structured pruning is proposed: a hybrid model compression method based on sensitivity grouping (HMC). This framework unifies the existing additive hybrid compression method (AHC) and the non-additive hybrid compression method (NaHC) proposed by us into one model. The latter group the network according to the sensitivity difference of the convolutional layer to different compression methods, which can better integrate the low rank and sparsity of the model compared with the former. Experiments show that our approach achieves a better trade-off between test accuracy and compression ratio when compressing the ResNet family of models than other recent compression methods using a single strategy or additive hybrid compression. Public Library of Science 2023-10-09 /pmc/articles/PMC10561844/ /pubmed/37812605 http://dx.doi.org/10.1371/journal.pone.0292517 Text en © 2023 Yang et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Yang, Guoliang
Yu, Shuaiying
Yang, Hao
Nie, Ziling
Wang, Jixiang
HMC: Hybrid model compression method based on layer sensitivity grouping
title HMC: Hybrid model compression method based on layer sensitivity grouping
title_full HMC: Hybrid model compression method based on layer sensitivity grouping
title_fullStr HMC: Hybrid model compression method based on layer sensitivity grouping
title_full_unstemmed HMC: Hybrid model compression method based on layer sensitivity grouping
title_short HMC: Hybrid model compression method based on layer sensitivity grouping
title_sort hmc: hybrid model compression method based on layer sensitivity grouping
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10561844/
https://www.ncbi.nlm.nih.gov/pubmed/37812605
http://dx.doi.org/10.1371/journal.pone.0292517
work_keys_str_mv AT yangguoliang hmchybridmodelcompressionmethodbasedonlayersensitivitygrouping
AT yushuaiying hmchybridmodelcompressionmethodbasedonlayersensitivitygrouping
AT yanghao hmchybridmodelcompressionmethodbasedonlayersensitivitygrouping
AT nieziling hmchybridmodelcompressionmethodbasedonlayersensitivitygrouping
AT wangjixiang hmchybridmodelcompressionmethodbasedonlayersensitivitygrouping