Cargando…

Conductance-Aware Quantization Based on Minimum Error Substitution for Non-Linear-Conductance-State Tolerance in Neural Computing Systems

Emerging resistive random-access memory (ReRAM) has demonstrated great potential in the achievement of the in-memory computing paradigm to overcome the well-known “memory wall” in current von Neumann architecture. The ReRAM crossbar array (RCA) is a promising circuit structure to accelerate the vita...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Chenglong, Xu, Nuo, Wang, Wenqing, Hu, Yihong, Fang, Liang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9143747/
https://www.ncbi.nlm.nih.gov/pubmed/35630134
http://dx.doi.org/10.3390/mi13050667
_version_ 1784715881734995968
author Huang, Chenglong
Xu, Nuo
Wang, Wenqing
Hu, Yihong
Fang, Liang
author_facet Huang, Chenglong
Xu, Nuo
Wang, Wenqing
Hu, Yihong
Fang, Liang
author_sort Huang, Chenglong
collection PubMed
description Emerging resistive random-access memory (ReRAM) has demonstrated great potential in the achievement of the in-memory computing paradigm to overcome the well-known “memory wall” in current von Neumann architecture. The ReRAM crossbar array (RCA) is a promising circuit structure to accelerate the vital multiplication-and-accumulation (MAC) operations in deep neural networks (DNN). However, due to the nonlinear distribution of conductance levels in ReRAM, a large deviation exists in the mapping process when the trained weights that are quantized by linear relationships are directly mapped to the nonlinear conductance values from the realistic ReRAM device. This deviation degrades the inference accuracy of the RCA-based DNN. In this paper, we propose a minimum error substitution based on a conductance-aware quantization method to eliminate the deviation in the mapping process from the weights to the actual conductance values. The method is suitable for multiple ReRAM devices with different non-linear conductance distribution and is also immune to the device variation. The simulation results on LeNet5, AlexNet and VGG16 demonstrate that this method can vastly rescue the accuracy degradation from the non-linear resistance distribution of ReRAM devices compared to the linear quantization method.
format Online
Article
Text
id pubmed-9143747
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-91437472022-05-29 Conductance-Aware Quantization Based on Minimum Error Substitution for Non-Linear-Conductance-State Tolerance in Neural Computing Systems Huang, Chenglong Xu, Nuo Wang, Wenqing Hu, Yihong Fang, Liang Micromachines (Basel) Article Emerging resistive random-access memory (ReRAM) has demonstrated great potential in the achievement of the in-memory computing paradigm to overcome the well-known “memory wall” in current von Neumann architecture. The ReRAM crossbar array (RCA) is a promising circuit structure to accelerate the vital multiplication-and-accumulation (MAC) operations in deep neural networks (DNN). However, due to the nonlinear distribution of conductance levels in ReRAM, a large deviation exists in the mapping process when the trained weights that are quantized by linear relationships are directly mapped to the nonlinear conductance values from the realistic ReRAM device. This deviation degrades the inference accuracy of the RCA-based DNN. In this paper, we propose a minimum error substitution based on a conductance-aware quantization method to eliminate the deviation in the mapping process from the weights to the actual conductance values. The method is suitable for multiple ReRAM devices with different non-linear conductance distribution and is also immune to the device variation. The simulation results on LeNet5, AlexNet and VGG16 demonstrate that this method can vastly rescue the accuracy degradation from the non-linear resistance distribution of ReRAM devices compared to the linear quantization method. MDPI 2022-04-24 /pmc/articles/PMC9143747/ /pubmed/35630134 http://dx.doi.org/10.3390/mi13050667 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Huang, Chenglong
Xu, Nuo
Wang, Wenqing
Hu, Yihong
Fang, Liang
Conductance-Aware Quantization Based on Minimum Error Substitution for Non-Linear-Conductance-State Tolerance in Neural Computing Systems
title Conductance-Aware Quantization Based on Minimum Error Substitution for Non-Linear-Conductance-State Tolerance in Neural Computing Systems
title_full Conductance-Aware Quantization Based on Minimum Error Substitution for Non-Linear-Conductance-State Tolerance in Neural Computing Systems
title_fullStr Conductance-Aware Quantization Based on Minimum Error Substitution for Non-Linear-Conductance-State Tolerance in Neural Computing Systems
title_full_unstemmed Conductance-Aware Quantization Based on Minimum Error Substitution for Non-Linear-Conductance-State Tolerance in Neural Computing Systems
title_short Conductance-Aware Quantization Based on Minimum Error Substitution for Non-Linear-Conductance-State Tolerance in Neural Computing Systems
title_sort conductance-aware quantization based on minimum error substitution for non-linear-conductance-state tolerance in neural computing systems
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9143747/
https://www.ncbi.nlm.nih.gov/pubmed/35630134
http://dx.doi.org/10.3390/mi13050667
work_keys_str_mv AT huangchenglong conductanceawarequantizationbasedonminimumerrorsubstitutionfornonlinearconductancestatetoleranceinneuralcomputingsystems
AT xunuo conductanceawarequantizationbasedonminimumerrorsubstitutionfornonlinearconductancestatetoleranceinneuralcomputingsystems
AT wangwenqing conductanceawarequantizationbasedonminimumerrorsubstitutionfornonlinearconductancestatetoleranceinneuralcomputingsystems
AT huyihong conductanceawarequantizationbasedonminimumerrorsubstitutionfornonlinearconductancestatetoleranceinneuralcomputingsystems
AT fangliang conductanceawarequantizationbasedonminimumerrorsubstitutionfornonlinearconductancestatetoleranceinneuralcomputingsystems