Cargando…
Design of a 2-Bit Neural Network Quantizer for Laplacian Source
Achieving real-time inference is one of the major issues in contemporary neural network applications, as complex algorithms are frequently being deployed to mobile devices that have constrained storage and computing power. Moving from a full-precision neural network model to a lower representation b...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8393619/ https://www.ncbi.nlm.nih.gov/pubmed/34441074 http://dx.doi.org/10.3390/e23080933 |
_version_ | 1783743767002480640 |
---|---|
author | Perić, Zoran Savić, Milan Simić, Nikola Denić, Bojan Despotović, Vladimir |
author_facet | Perić, Zoran Savić, Milan Simić, Nikola Denić, Bojan Despotović, Vladimir |
author_sort | Perić, Zoran |
collection | PubMed |
description | Achieving real-time inference is one of the major issues in contemporary neural network applications, as complex algorithms are frequently being deployed to mobile devices that have constrained storage and computing power. Moving from a full-precision neural network model to a lower representation by applying quantization techniques is a popular approach to facilitate this issue. Here, we analyze in detail and design a 2-bit uniform quantization model for Laplacian source due to its significance in terms of implementation simplicity, which further leads to a shorter processing time and faster inference. The results show that it is possible to achieve high classification accuracy (more than 96% in the case of MLP and more than 98% in the case of CNN) by implementing the proposed model, which is competitive to the performance of the other quantization solutions with almost optimal precision. |
format | Online Article Text |
id | pubmed-8393619 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-83936192021-08-28 Design of a 2-Bit Neural Network Quantizer for Laplacian Source Perić, Zoran Savić, Milan Simić, Nikola Denić, Bojan Despotović, Vladimir Entropy (Basel) Article Achieving real-time inference is one of the major issues in contemporary neural network applications, as complex algorithms are frequently being deployed to mobile devices that have constrained storage and computing power. Moving from a full-precision neural network model to a lower representation by applying quantization techniques is a popular approach to facilitate this issue. Here, we analyze in detail and design a 2-bit uniform quantization model for Laplacian source due to its significance in terms of implementation simplicity, which further leads to a shorter processing time and faster inference. The results show that it is possible to achieve high classification accuracy (more than 96% in the case of MLP and more than 98% in the case of CNN) by implementing the proposed model, which is competitive to the performance of the other quantization solutions with almost optimal precision. MDPI 2021-07-22 /pmc/articles/PMC8393619/ /pubmed/34441074 http://dx.doi.org/10.3390/e23080933 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Perić, Zoran Savić, Milan Simić, Nikola Denić, Bojan Despotović, Vladimir Design of a 2-Bit Neural Network Quantizer for Laplacian Source |
title | Design of a 2-Bit Neural Network Quantizer for Laplacian Source |
title_full | Design of a 2-Bit Neural Network Quantizer for Laplacian Source |
title_fullStr | Design of a 2-Bit Neural Network Quantizer for Laplacian Source |
title_full_unstemmed | Design of a 2-Bit Neural Network Quantizer for Laplacian Source |
title_short | Design of a 2-Bit Neural Network Quantizer for Laplacian Source |
title_sort | design of a 2-bit neural network quantizer for laplacian source |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8393619/ https://www.ncbi.nlm.nih.gov/pubmed/34441074 http://dx.doi.org/10.3390/e23080933 |
work_keys_str_mv | AT periczoran designofa2bitneuralnetworkquantizerforlaplaciansource AT savicmilan designofa2bitneuralnetworkquantizerforlaplaciansource AT simicnikola designofa2bitneuralnetworkquantizerforlaplaciansource AT denicbojan designofa2bitneuralnetworkquantizerforlaplaciansource AT despotovicvladimir designofa2bitneuralnetworkquantizerforlaplaciansource |