Cargando…

SCA: Search-Based Computing Hardware Architecture with Precision Scalable and Computation Reconfigurable Scheme

Deep neural networks have been deployed in various hardware accelerators, such as graph process units (GPUs), field-program gate arrays (FPGAs), and application specific integrated circuit (ASIC) chips. Normally, a huge amount of computation is required in the inference process, creating significant...

Descripción completa

Detalles Bibliográficos
Autores principales: Chang, Liang, Zhao, Xin, Zhou, Jun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9658340/
https://www.ncbi.nlm.nih.gov/pubmed/36366242
http://dx.doi.org/10.3390/s22218545
_version_ 1784829927032356864
author Chang, Liang
Zhao, Xin
Zhou, Jun
author_facet Chang, Liang
Zhao, Xin
Zhou, Jun
author_sort Chang, Liang
collection PubMed
description Deep neural networks have been deployed in various hardware accelerators, such as graph process units (GPUs), field-program gate arrays (FPGAs), and application specific integrated circuit (ASIC) chips. Normally, a huge amount of computation is required in the inference process, creating significant logic resource overheads. In addition, frequent data accessions between off-chip memory and hardware accelerators create bottlenecks, leading to decline in hardware efficiency. Many solutions have been proposed to reduce hardware overhead and data movements. For example, specific lookup-table (LUT)-based hardware architecture can be used to mitigate computing operation demands. However, typical LUT-based accelerators are affected by computational precision limitation and poor scalability issues. In this paper, we propose a search-based computing scheme based on an LUT solution, which improves computation efficiency by replacing traditional multiplication with a search operation. In addition, the proposed scheme supports different precision multiple-bit widths to meet the needs of different DNN-based applications. We design a reconfigurable computing strategy, which can efficiently adapt to the convolution of different kernel sizes to improve hardware scalability. We implement a search-based architecture, namely SCA, which adopts an on-chip storage mechanism, thus greatly reducing interactions with off-chip memory and alleviating bandwidth pressure. Based on experimental evaluation, the proposed SCA architecture can achieve 92%, 96% and 98% computational utilization for computational precision of 4 bit, 8 bit and 16 bit, respectively. Compared with state-of-the-art LUT-based architecture, the efficiency can be improved four-fold.
format Online
Article
Text
id pubmed-9658340
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-96583402022-11-15 SCA: Search-Based Computing Hardware Architecture with Precision Scalable and Computation Reconfigurable Scheme Chang, Liang Zhao, Xin Zhou, Jun Sensors (Basel) Article Deep neural networks have been deployed in various hardware accelerators, such as graph process units (GPUs), field-program gate arrays (FPGAs), and application specific integrated circuit (ASIC) chips. Normally, a huge amount of computation is required in the inference process, creating significant logic resource overheads. In addition, frequent data accessions between off-chip memory and hardware accelerators create bottlenecks, leading to decline in hardware efficiency. Many solutions have been proposed to reduce hardware overhead and data movements. For example, specific lookup-table (LUT)-based hardware architecture can be used to mitigate computing operation demands. However, typical LUT-based accelerators are affected by computational precision limitation and poor scalability issues. In this paper, we propose a search-based computing scheme based on an LUT solution, which improves computation efficiency by replacing traditional multiplication with a search operation. In addition, the proposed scheme supports different precision multiple-bit widths to meet the needs of different DNN-based applications. We design a reconfigurable computing strategy, which can efficiently adapt to the convolution of different kernel sizes to improve hardware scalability. We implement a search-based architecture, namely SCA, which adopts an on-chip storage mechanism, thus greatly reducing interactions with off-chip memory and alleviating bandwidth pressure. Based on experimental evaluation, the proposed SCA architecture can achieve 92%, 96% and 98% computational utilization for computational precision of 4 bit, 8 bit and 16 bit, respectively. Compared with state-of-the-art LUT-based architecture, the efficiency can be improved four-fold. MDPI 2022-11-06 /pmc/articles/PMC9658340/ /pubmed/36366242 http://dx.doi.org/10.3390/s22218545 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Chang, Liang
Zhao, Xin
Zhou, Jun
SCA: Search-Based Computing Hardware Architecture with Precision Scalable and Computation Reconfigurable Scheme
title SCA: Search-Based Computing Hardware Architecture with Precision Scalable and Computation Reconfigurable Scheme
title_full SCA: Search-Based Computing Hardware Architecture with Precision Scalable and Computation Reconfigurable Scheme
title_fullStr SCA: Search-Based Computing Hardware Architecture with Precision Scalable and Computation Reconfigurable Scheme
title_full_unstemmed SCA: Search-Based Computing Hardware Architecture with Precision Scalable and Computation Reconfigurable Scheme
title_short SCA: Search-Based Computing Hardware Architecture with Precision Scalable and Computation Reconfigurable Scheme
title_sort sca: search-based computing hardware architecture with precision scalable and computation reconfigurable scheme
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9658340/
https://www.ncbi.nlm.nih.gov/pubmed/36366242
http://dx.doi.org/10.3390/s22218545
work_keys_str_mv AT changliang scasearchbasedcomputinghardwarearchitecturewithprecisionscalableandcomputationreconfigurablescheme
AT zhaoxin scasearchbasedcomputinghardwarearchitecturewithprecisionscalableandcomputationreconfigurablescheme
AT zhoujun scasearchbasedcomputinghardwarearchitecturewithprecisionscalableandcomputationreconfigurablescheme