Cargando…
Conductance-Aware Quantization Based on Minimum Error Substitution for Non-Linear-Conductance-State Tolerance in Neural Computing Systems
Emerging resistive random-access memory (ReRAM) has demonstrated great potential in the achievement of the in-memory computing paradigm to overcome the well-known “memory wall” in current von Neumann architecture. The ReRAM crossbar array (RCA) is a promising circuit structure to accelerate the vita...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9143747/ https://www.ncbi.nlm.nih.gov/pubmed/35630134 http://dx.doi.org/10.3390/mi13050667 |
_version_ | 1784715881734995968 |
---|---|
author | Huang, Chenglong Xu, Nuo Wang, Wenqing Hu, Yihong Fang, Liang |
author_facet | Huang, Chenglong Xu, Nuo Wang, Wenqing Hu, Yihong Fang, Liang |
author_sort | Huang, Chenglong |
collection | PubMed |
description | Emerging resistive random-access memory (ReRAM) has demonstrated great potential in the achievement of the in-memory computing paradigm to overcome the well-known “memory wall” in current von Neumann architecture. The ReRAM crossbar array (RCA) is a promising circuit structure to accelerate the vital multiplication-and-accumulation (MAC) operations in deep neural networks (DNN). However, due to the nonlinear distribution of conductance levels in ReRAM, a large deviation exists in the mapping process when the trained weights that are quantized by linear relationships are directly mapped to the nonlinear conductance values from the realistic ReRAM device. This deviation degrades the inference accuracy of the RCA-based DNN. In this paper, we propose a minimum error substitution based on a conductance-aware quantization method to eliminate the deviation in the mapping process from the weights to the actual conductance values. The method is suitable for multiple ReRAM devices with different non-linear conductance distribution and is also immune to the device variation. The simulation results on LeNet5, AlexNet and VGG16 demonstrate that this method can vastly rescue the accuracy degradation from the non-linear resistance distribution of ReRAM devices compared to the linear quantization method. |
format | Online Article Text |
id | pubmed-9143747 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-91437472022-05-29 Conductance-Aware Quantization Based on Minimum Error Substitution for Non-Linear-Conductance-State Tolerance in Neural Computing Systems Huang, Chenglong Xu, Nuo Wang, Wenqing Hu, Yihong Fang, Liang Micromachines (Basel) Article Emerging resistive random-access memory (ReRAM) has demonstrated great potential in the achievement of the in-memory computing paradigm to overcome the well-known “memory wall” in current von Neumann architecture. The ReRAM crossbar array (RCA) is a promising circuit structure to accelerate the vital multiplication-and-accumulation (MAC) operations in deep neural networks (DNN). However, due to the nonlinear distribution of conductance levels in ReRAM, a large deviation exists in the mapping process when the trained weights that are quantized by linear relationships are directly mapped to the nonlinear conductance values from the realistic ReRAM device. This deviation degrades the inference accuracy of the RCA-based DNN. In this paper, we propose a minimum error substitution based on a conductance-aware quantization method to eliminate the deviation in the mapping process from the weights to the actual conductance values. The method is suitable for multiple ReRAM devices with different non-linear conductance distribution and is also immune to the device variation. The simulation results on LeNet5, AlexNet and VGG16 demonstrate that this method can vastly rescue the accuracy degradation from the non-linear resistance distribution of ReRAM devices compared to the linear quantization method. MDPI 2022-04-24 /pmc/articles/PMC9143747/ /pubmed/35630134 http://dx.doi.org/10.3390/mi13050667 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Huang, Chenglong Xu, Nuo Wang, Wenqing Hu, Yihong Fang, Liang Conductance-Aware Quantization Based on Minimum Error Substitution for Non-Linear-Conductance-State Tolerance in Neural Computing Systems |
title | Conductance-Aware Quantization Based on Minimum Error Substitution for Non-Linear-Conductance-State Tolerance in Neural Computing Systems |
title_full | Conductance-Aware Quantization Based on Minimum Error Substitution for Non-Linear-Conductance-State Tolerance in Neural Computing Systems |
title_fullStr | Conductance-Aware Quantization Based on Minimum Error Substitution for Non-Linear-Conductance-State Tolerance in Neural Computing Systems |
title_full_unstemmed | Conductance-Aware Quantization Based on Minimum Error Substitution for Non-Linear-Conductance-State Tolerance in Neural Computing Systems |
title_short | Conductance-Aware Quantization Based on Minimum Error Substitution for Non-Linear-Conductance-State Tolerance in Neural Computing Systems |
title_sort | conductance-aware quantization based on minimum error substitution for non-linear-conductance-state tolerance in neural computing systems |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9143747/ https://www.ncbi.nlm.nih.gov/pubmed/35630134 http://dx.doi.org/10.3390/mi13050667 |
work_keys_str_mv | AT huangchenglong conductanceawarequantizationbasedonminimumerrorsubstitutionfornonlinearconductancestatetoleranceinneuralcomputingsystems AT xunuo conductanceawarequantizationbasedonminimumerrorsubstitutionfornonlinearconductancestatetoleranceinneuralcomputingsystems AT wangwenqing conductanceawarequantizationbasedonminimumerrorsubstitutionfornonlinearconductancestatetoleranceinneuralcomputingsystems AT huyihong conductanceawarequantizationbasedonminimumerrorsubstitutionfornonlinearconductancestatetoleranceinneuralcomputingsystems AT fangliang conductanceawarequantizationbasedonminimumerrorsubstitutionfornonlinearconductancestatetoleranceinneuralcomputingsystems |