Cargando…

Design of a 2-Bit Neural Network Quantizer for Laplacian Source

Achieving real-time inference is one of the major issues in contemporary neural network applications, as complex algorithms are frequently being deployed to mobile devices that have constrained storage and computing power. Moving from a full-precision neural network model to a lower representation b...

Descripción completa

Detalles Bibliográficos
Autores principales: Perić, Zoran, Savić, Milan, Simić, Nikola, Denić, Bojan, Despotović, Vladimir
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8393619/
https://www.ncbi.nlm.nih.gov/pubmed/34441074
http://dx.doi.org/10.3390/e23080933
_version_ 1783743767002480640
author Perić, Zoran
Savić, Milan
Simić, Nikola
Denić, Bojan
Despotović, Vladimir
author_facet Perić, Zoran
Savić, Milan
Simić, Nikola
Denić, Bojan
Despotović, Vladimir
author_sort Perić, Zoran
collection PubMed
description Achieving real-time inference is one of the major issues in contemporary neural network applications, as complex algorithms are frequently being deployed to mobile devices that have constrained storage and computing power. Moving from a full-precision neural network model to a lower representation by applying quantization techniques is a popular approach to facilitate this issue. Here, we analyze in detail and design a 2-bit uniform quantization model for Laplacian source due to its significance in terms of implementation simplicity, which further leads to a shorter processing time and faster inference. The results show that it is possible to achieve high classification accuracy (more than 96% in the case of MLP and more than 98% in the case of CNN) by implementing the proposed model, which is competitive to the performance of the other quantization solutions with almost optimal precision.
format Online
Article
Text
id pubmed-8393619
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-83936192021-08-28 Design of a 2-Bit Neural Network Quantizer for Laplacian Source Perić, Zoran Savić, Milan Simić, Nikola Denić, Bojan Despotović, Vladimir Entropy (Basel) Article Achieving real-time inference is one of the major issues in contemporary neural network applications, as complex algorithms are frequently being deployed to mobile devices that have constrained storage and computing power. Moving from a full-precision neural network model to a lower representation by applying quantization techniques is a popular approach to facilitate this issue. Here, we analyze in detail and design a 2-bit uniform quantization model for Laplacian source due to its significance in terms of implementation simplicity, which further leads to a shorter processing time and faster inference. The results show that it is possible to achieve high classification accuracy (more than 96% in the case of MLP and more than 98% in the case of CNN) by implementing the proposed model, which is competitive to the performance of the other quantization solutions with almost optimal precision. MDPI 2021-07-22 /pmc/articles/PMC8393619/ /pubmed/34441074 http://dx.doi.org/10.3390/e23080933 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Perić, Zoran
Savić, Milan
Simić, Nikola
Denić, Bojan
Despotović, Vladimir
Design of a 2-Bit Neural Network Quantizer for Laplacian Source
title Design of a 2-Bit Neural Network Quantizer for Laplacian Source
title_full Design of a 2-Bit Neural Network Quantizer for Laplacian Source
title_fullStr Design of a 2-Bit Neural Network Quantizer for Laplacian Source
title_full_unstemmed Design of a 2-Bit Neural Network Quantizer for Laplacian Source
title_short Design of a 2-Bit Neural Network Quantizer for Laplacian Source
title_sort design of a 2-bit neural network quantizer for laplacian source
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8393619/
https://www.ncbi.nlm.nih.gov/pubmed/34441074
http://dx.doi.org/10.3390/e23080933
work_keys_str_mv AT periczoran designofa2bitneuralnetworkquantizerforlaplaciansource
AT savicmilan designofa2bitneuralnetworkquantizerforlaplaciansource
AT simicnikola designofa2bitneuralnetworkquantizerforlaplaciansource
AT denicbojan designofa2bitneuralnetworkquantizerforlaplaciansource
AT despotovicvladimir designofa2bitneuralnetworkquantizerforlaplaciansource