Cargando…
Towards Convolutional Neural Network Acceleration and Compression Based on Simon k-Means
Convolutional Neural Networks (CNNs) are popular models that are widely used in image classification, target recognition, and other fields. Model compression is a common step in transplanting neural networks into embedded devices, and it is often used in the retraining stage. However, it requires a...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9185260/ https://www.ncbi.nlm.nih.gov/pubmed/35684919 http://dx.doi.org/10.3390/s22114298 |
_version_ | 1784724680311046144 |
---|---|
author | Wei, Mingjie Zhao, Yunping Chen, Xiaowen Li, Chen Lu, Jianzhuang |
author_facet | Wei, Mingjie Zhao, Yunping Chen, Xiaowen Li, Chen Lu, Jianzhuang |
author_sort | Wei, Mingjie |
collection | PubMed |
description | Convolutional Neural Networks (CNNs) are popular models that are widely used in image classification, target recognition, and other fields. Model compression is a common step in transplanting neural networks into embedded devices, and it is often used in the retraining stage. However, it requires a high expenditure of time by retraining weight data to atone for the loss of precision. Unlike in prior designs, we propose a novel model compression approach based on Simon k-means, which is specifically designed to support a hardware acceleration scheme. First, we propose an extension algorithm named Simon k-means based on simple k-means. We use Simon k-means to cluster trained weights in convolutional layers and fully connected layers. Second, we reduce the consumption of hardware resources in data movement and storage by using a data storage and index approach. Finally, we provide the hardware implementation of the compressed CNN accelerator. Our evaluations on several classifications show that our design can achieve 5.27× compression and reduce 74.3% of the multiply–accumulate (MAC) operations in AlexNet on the FASHION-MNIST dataset. |
format | Online Article Text |
id | pubmed-9185260 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-91852602022-06-11 Towards Convolutional Neural Network Acceleration and Compression Based on Simon k-Means Wei, Mingjie Zhao, Yunping Chen, Xiaowen Li, Chen Lu, Jianzhuang Sensors (Basel) Article Convolutional Neural Networks (CNNs) are popular models that are widely used in image classification, target recognition, and other fields. Model compression is a common step in transplanting neural networks into embedded devices, and it is often used in the retraining stage. However, it requires a high expenditure of time by retraining weight data to atone for the loss of precision. Unlike in prior designs, we propose a novel model compression approach based on Simon k-means, which is specifically designed to support a hardware acceleration scheme. First, we propose an extension algorithm named Simon k-means based on simple k-means. We use Simon k-means to cluster trained weights in convolutional layers and fully connected layers. Second, we reduce the consumption of hardware resources in data movement and storage by using a data storage and index approach. Finally, we provide the hardware implementation of the compressed CNN accelerator. Our evaluations on several classifications show that our design can achieve 5.27× compression and reduce 74.3% of the multiply–accumulate (MAC) operations in AlexNet on the FASHION-MNIST dataset. MDPI 2022-06-06 /pmc/articles/PMC9185260/ /pubmed/35684919 http://dx.doi.org/10.3390/s22114298 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Wei, Mingjie Zhao, Yunping Chen, Xiaowen Li, Chen Lu, Jianzhuang Towards Convolutional Neural Network Acceleration and Compression Based on Simon k-Means |
title | Towards Convolutional Neural Network Acceleration and Compression Based on Simon
k-Means |
title_full | Towards Convolutional Neural Network Acceleration and Compression Based on Simon
k-Means |
title_fullStr | Towards Convolutional Neural Network Acceleration and Compression Based on Simon
k-Means |
title_full_unstemmed | Towards Convolutional Neural Network Acceleration and Compression Based on Simon
k-Means |
title_short | Towards Convolutional Neural Network Acceleration and Compression Based on Simon
k-Means |
title_sort | towards convolutional neural network acceleration and compression based on simon
k-means |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9185260/ https://www.ncbi.nlm.nih.gov/pubmed/35684919 http://dx.doi.org/10.3390/s22114298 |
work_keys_str_mv | AT weimingjie towardsconvolutionalneuralnetworkaccelerationandcompressionbasedonsimonkmeans AT zhaoyunping towardsconvolutionalneuralnetworkaccelerationandcompressionbasedonsimonkmeans AT chenxiaowen towardsconvolutionalneuralnetworkaccelerationandcompressionbasedonsimonkmeans AT lichen towardsconvolutionalneuralnetworkaccelerationandcompressionbasedonsimonkmeans AT lujianzhuang towardsconvolutionalneuralnetworkaccelerationandcompressionbasedonsimonkmeans |