Cargando…

Whether the Support Region of Three-Bit Uniform Quantizer Has a Strong Impact on Post-Training Quantization for MNIST Dataset?

Driven by the need for the compression of weights in neural networks (NNs), which is especially beneficial for edge devices with a constrained resource, and by the need to utilize the simplest possible quantization model, in this paper, we study the performance of three-bit post-training uniform qua...

Descripción completa

Detalles Bibliográficos
Autores principales: Nikolić, Jelena, Perić, Zoran, Aleksić, Danijela, Tomić, Stefan, Jovanović, Aleksandra
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8700806/
https://www.ncbi.nlm.nih.gov/pubmed/34946005
http://dx.doi.org/10.3390/e23121699
_version_ 1784620846783922176
author Nikolić, Jelena
Perić, Zoran
Aleksić, Danijela
Tomić, Stefan
Jovanović, Aleksandra
author_facet Nikolić, Jelena
Perić, Zoran
Aleksić, Danijela
Tomić, Stefan
Jovanović, Aleksandra
author_sort Nikolić, Jelena
collection PubMed
description Driven by the need for the compression of weights in neural networks (NNs), which is especially beneficial for edge devices with a constrained resource, and by the need to utilize the simplest possible quantization model, in this paper, we study the performance of three-bit post-training uniform quantization. The goal is to put various choices of the key parameter of the quantizer in question (support region threshold) in one place and provide a detailed overview of this choice’s impact on the performance of post-training quantization for the MNIST dataset. Specifically, we analyze whether it is possible to preserve the accuracy of the two NN models (MLP and CNN) to a great extent with the very simple three-bit uniform quantizer, regardless of the choice of the key parameter. Moreover, our goal is to answer the question of whether it is of the utmost importance in post-training three-bit uniform quantization, as it is in quantization, to determine the optimal support region threshold value of the quantizer to achieve some predefined accuracy of the quantized neural network (QNN). The results show that the choice of the support region threshold value of the three-bit uniform quantizer does not have such a strong impact on the accuracy of the QNNs, which is not the case with two-bit uniform post-training quantization, when applied in MLP for the same classification task. Accordingly, one can anticipate that due to this special property, the post-training quantization model in question can be greatly exploited.
format Online
Article
Text
id pubmed-8700806
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-87008062021-12-24 Whether the Support Region of Three-Bit Uniform Quantizer Has a Strong Impact on Post-Training Quantization for MNIST Dataset? Nikolić, Jelena Perić, Zoran Aleksić, Danijela Tomić, Stefan Jovanović, Aleksandra Entropy (Basel) Article Driven by the need for the compression of weights in neural networks (NNs), which is especially beneficial for edge devices with a constrained resource, and by the need to utilize the simplest possible quantization model, in this paper, we study the performance of three-bit post-training uniform quantization. The goal is to put various choices of the key parameter of the quantizer in question (support region threshold) in one place and provide a detailed overview of this choice’s impact on the performance of post-training quantization for the MNIST dataset. Specifically, we analyze whether it is possible to preserve the accuracy of the two NN models (MLP and CNN) to a great extent with the very simple three-bit uniform quantizer, regardless of the choice of the key parameter. Moreover, our goal is to answer the question of whether it is of the utmost importance in post-training three-bit uniform quantization, as it is in quantization, to determine the optimal support region threshold value of the quantizer to achieve some predefined accuracy of the quantized neural network (QNN). The results show that the choice of the support region threshold value of the three-bit uniform quantizer does not have such a strong impact on the accuracy of the QNNs, which is not the case with two-bit uniform post-training quantization, when applied in MLP for the same classification task. Accordingly, one can anticipate that due to this special property, the post-training quantization model in question can be greatly exploited. MDPI 2021-12-20 /pmc/articles/PMC8700806/ /pubmed/34946005 http://dx.doi.org/10.3390/e23121699 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Nikolić, Jelena
Perić, Zoran
Aleksić, Danijela
Tomić, Stefan
Jovanović, Aleksandra
Whether the Support Region of Three-Bit Uniform Quantizer Has a Strong Impact on Post-Training Quantization for MNIST Dataset?
title Whether the Support Region of Three-Bit Uniform Quantizer Has a Strong Impact on Post-Training Quantization for MNIST Dataset?
title_full Whether the Support Region of Three-Bit Uniform Quantizer Has a Strong Impact on Post-Training Quantization for MNIST Dataset?
title_fullStr Whether the Support Region of Three-Bit Uniform Quantizer Has a Strong Impact on Post-Training Quantization for MNIST Dataset?
title_full_unstemmed Whether the Support Region of Three-Bit Uniform Quantizer Has a Strong Impact on Post-Training Quantization for MNIST Dataset?
title_short Whether the Support Region of Three-Bit Uniform Quantizer Has a Strong Impact on Post-Training Quantization for MNIST Dataset?
title_sort whether the support region of three-bit uniform quantizer has a strong impact on post-training quantization for mnist dataset?
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8700806/
https://www.ncbi.nlm.nih.gov/pubmed/34946005
http://dx.doi.org/10.3390/e23121699
work_keys_str_mv AT nikolicjelena whetherthesupportregionofthreebituniformquantizerhasastrongimpactonposttrainingquantizationformnistdataset
AT periczoran whetherthesupportregionofthreebituniformquantizerhasastrongimpactonposttrainingquantizationformnistdataset
AT aleksicdanijela whetherthesupportregionofthreebituniformquantizerhasastrongimpactonposttrainingquantizationformnistdataset
AT tomicstefan whetherthesupportregionofthreebituniformquantizerhasastrongimpactonposttrainingquantizationformnistdataset
AT jovanovicaleksandra whetherthesupportregionofthreebituniformquantizerhasastrongimpactonposttrainingquantizationformnistdataset