Cargando…
Whether the Support Region of Three-Bit Uniform Quantizer Has a Strong Impact on Post-Training Quantization for MNIST Dataset?
Driven by the need for the compression of weights in neural networks (NNs), which is especially beneficial for edge devices with a constrained resource, and by the need to utilize the simplest possible quantization model, in this paper, we study the performance of three-bit post-training uniform qua...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8700806/ https://www.ncbi.nlm.nih.gov/pubmed/34946005 http://dx.doi.org/10.3390/e23121699 |
_version_ | 1784620846783922176 |
---|---|
author | Nikolić, Jelena Perić, Zoran Aleksić, Danijela Tomić, Stefan Jovanović, Aleksandra |
author_facet | Nikolić, Jelena Perić, Zoran Aleksić, Danijela Tomić, Stefan Jovanović, Aleksandra |
author_sort | Nikolić, Jelena |
collection | PubMed |
description | Driven by the need for the compression of weights in neural networks (NNs), which is especially beneficial for edge devices with a constrained resource, and by the need to utilize the simplest possible quantization model, in this paper, we study the performance of three-bit post-training uniform quantization. The goal is to put various choices of the key parameter of the quantizer in question (support region threshold) in one place and provide a detailed overview of this choice’s impact on the performance of post-training quantization for the MNIST dataset. Specifically, we analyze whether it is possible to preserve the accuracy of the two NN models (MLP and CNN) to a great extent with the very simple three-bit uniform quantizer, regardless of the choice of the key parameter. Moreover, our goal is to answer the question of whether it is of the utmost importance in post-training three-bit uniform quantization, as it is in quantization, to determine the optimal support region threshold value of the quantizer to achieve some predefined accuracy of the quantized neural network (QNN). The results show that the choice of the support region threshold value of the three-bit uniform quantizer does not have such a strong impact on the accuracy of the QNNs, which is not the case with two-bit uniform post-training quantization, when applied in MLP for the same classification task. Accordingly, one can anticipate that due to this special property, the post-training quantization model in question can be greatly exploited. |
format | Online Article Text |
id | pubmed-8700806 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-87008062021-12-24 Whether the Support Region of Three-Bit Uniform Quantizer Has a Strong Impact on Post-Training Quantization for MNIST Dataset? Nikolić, Jelena Perić, Zoran Aleksić, Danijela Tomić, Stefan Jovanović, Aleksandra Entropy (Basel) Article Driven by the need for the compression of weights in neural networks (NNs), which is especially beneficial for edge devices with a constrained resource, and by the need to utilize the simplest possible quantization model, in this paper, we study the performance of three-bit post-training uniform quantization. The goal is to put various choices of the key parameter of the quantizer in question (support region threshold) in one place and provide a detailed overview of this choice’s impact on the performance of post-training quantization for the MNIST dataset. Specifically, we analyze whether it is possible to preserve the accuracy of the two NN models (MLP and CNN) to a great extent with the very simple three-bit uniform quantizer, regardless of the choice of the key parameter. Moreover, our goal is to answer the question of whether it is of the utmost importance in post-training three-bit uniform quantization, as it is in quantization, to determine the optimal support region threshold value of the quantizer to achieve some predefined accuracy of the quantized neural network (QNN). The results show that the choice of the support region threshold value of the three-bit uniform quantizer does not have such a strong impact on the accuracy of the QNNs, which is not the case with two-bit uniform post-training quantization, when applied in MLP for the same classification task. Accordingly, one can anticipate that due to this special property, the post-training quantization model in question can be greatly exploited. MDPI 2021-12-20 /pmc/articles/PMC8700806/ /pubmed/34946005 http://dx.doi.org/10.3390/e23121699 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Nikolić, Jelena Perić, Zoran Aleksić, Danijela Tomić, Stefan Jovanović, Aleksandra Whether the Support Region of Three-Bit Uniform Quantizer Has a Strong Impact on Post-Training Quantization for MNIST Dataset? |
title | Whether the Support Region of Three-Bit Uniform Quantizer Has a Strong Impact on Post-Training Quantization for MNIST Dataset? |
title_full | Whether the Support Region of Three-Bit Uniform Quantizer Has a Strong Impact on Post-Training Quantization for MNIST Dataset? |
title_fullStr | Whether the Support Region of Three-Bit Uniform Quantizer Has a Strong Impact on Post-Training Quantization for MNIST Dataset? |
title_full_unstemmed | Whether the Support Region of Three-Bit Uniform Quantizer Has a Strong Impact on Post-Training Quantization for MNIST Dataset? |
title_short | Whether the Support Region of Three-Bit Uniform Quantizer Has a Strong Impact on Post-Training Quantization for MNIST Dataset? |
title_sort | whether the support region of three-bit uniform quantizer has a strong impact on post-training quantization for mnist dataset? |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8700806/ https://www.ncbi.nlm.nih.gov/pubmed/34946005 http://dx.doi.org/10.3390/e23121699 |
work_keys_str_mv | AT nikolicjelena whetherthesupportregionofthreebituniformquantizerhasastrongimpactonposttrainingquantizationformnistdataset AT periczoran whetherthesupportregionofthreebituniformquantizerhasastrongimpactonposttrainingquantizationformnistdataset AT aleksicdanijela whetherthesupportregionofthreebituniformquantizerhasastrongimpactonposttrainingquantizationformnistdataset AT tomicstefan whetherthesupportregionofthreebituniformquantizerhasastrongimpactonposttrainingquantizationformnistdataset AT jovanovicaleksandra whetherthesupportregionofthreebituniformquantizerhasastrongimpactonposttrainingquantizationformnistdataset |