Cargando…
SCA: Search-Based Computing Hardware Architecture with Precision Scalable and Computation Reconfigurable Scheme
Deep neural networks have been deployed in various hardware accelerators, such as graph process units (GPUs), field-program gate arrays (FPGAs), and application specific integrated circuit (ASIC) chips. Normally, a huge amount of computation is required in the inference process, creating significant...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9658340/ https://www.ncbi.nlm.nih.gov/pubmed/36366242 http://dx.doi.org/10.3390/s22218545 |
_version_ | 1784829927032356864 |
---|---|
author | Chang, Liang Zhao, Xin Zhou, Jun |
author_facet | Chang, Liang Zhao, Xin Zhou, Jun |
author_sort | Chang, Liang |
collection | PubMed |
description | Deep neural networks have been deployed in various hardware accelerators, such as graph process units (GPUs), field-program gate arrays (FPGAs), and application specific integrated circuit (ASIC) chips. Normally, a huge amount of computation is required in the inference process, creating significant logic resource overheads. In addition, frequent data accessions between off-chip memory and hardware accelerators create bottlenecks, leading to decline in hardware efficiency. Many solutions have been proposed to reduce hardware overhead and data movements. For example, specific lookup-table (LUT)-based hardware architecture can be used to mitigate computing operation demands. However, typical LUT-based accelerators are affected by computational precision limitation and poor scalability issues. In this paper, we propose a search-based computing scheme based on an LUT solution, which improves computation efficiency by replacing traditional multiplication with a search operation. In addition, the proposed scheme supports different precision multiple-bit widths to meet the needs of different DNN-based applications. We design a reconfigurable computing strategy, which can efficiently adapt to the convolution of different kernel sizes to improve hardware scalability. We implement a search-based architecture, namely SCA, which adopts an on-chip storage mechanism, thus greatly reducing interactions with off-chip memory and alleviating bandwidth pressure. Based on experimental evaluation, the proposed SCA architecture can achieve 92%, 96% and 98% computational utilization for computational precision of 4 bit, 8 bit and 16 bit, respectively. Compared with state-of-the-art LUT-based architecture, the efficiency can be improved four-fold. |
format | Online Article Text |
id | pubmed-9658340 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-96583402022-11-15 SCA: Search-Based Computing Hardware Architecture with Precision Scalable and Computation Reconfigurable Scheme Chang, Liang Zhao, Xin Zhou, Jun Sensors (Basel) Article Deep neural networks have been deployed in various hardware accelerators, such as graph process units (GPUs), field-program gate arrays (FPGAs), and application specific integrated circuit (ASIC) chips. Normally, a huge amount of computation is required in the inference process, creating significant logic resource overheads. In addition, frequent data accessions between off-chip memory and hardware accelerators create bottlenecks, leading to decline in hardware efficiency. Many solutions have been proposed to reduce hardware overhead and data movements. For example, specific lookup-table (LUT)-based hardware architecture can be used to mitigate computing operation demands. However, typical LUT-based accelerators are affected by computational precision limitation and poor scalability issues. In this paper, we propose a search-based computing scheme based on an LUT solution, which improves computation efficiency by replacing traditional multiplication with a search operation. In addition, the proposed scheme supports different precision multiple-bit widths to meet the needs of different DNN-based applications. We design a reconfigurable computing strategy, which can efficiently adapt to the convolution of different kernel sizes to improve hardware scalability. We implement a search-based architecture, namely SCA, which adopts an on-chip storage mechanism, thus greatly reducing interactions with off-chip memory and alleviating bandwidth pressure. Based on experimental evaluation, the proposed SCA architecture can achieve 92%, 96% and 98% computational utilization for computational precision of 4 bit, 8 bit and 16 bit, respectively. Compared with state-of-the-art LUT-based architecture, the efficiency can be improved four-fold. MDPI 2022-11-06 /pmc/articles/PMC9658340/ /pubmed/36366242 http://dx.doi.org/10.3390/s22218545 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Chang, Liang Zhao, Xin Zhou, Jun SCA: Search-Based Computing Hardware Architecture with Precision Scalable and Computation Reconfigurable Scheme |
title | SCA: Search-Based Computing Hardware Architecture with Precision Scalable and Computation Reconfigurable Scheme |
title_full | SCA: Search-Based Computing Hardware Architecture with Precision Scalable and Computation Reconfigurable Scheme |
title_fullStr | SCA: Search-Based Computing Hardware Architecture with Precision Scalable and Computation Reconfigurable Scheme |
title_full_unstemmed | SCA: Search-Based Computing Hardware Architecture with Precision Scalable and Computation Reconfigurable Scheme |
title_short | SCA: Search-Based Computing Hardware Architecture with Precision Scalable and Computation Reconfigurable Scheme |
title_sort | sca: search-based computing hardware architecture with precision scalable and computation reconfigurable scheme |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9658340/ https://www.ncbi.nlm.nih.gov/pubmed/36366242 http://dx.doi.org/10.3390/s22218545 |
work_keys_str_mv | AT changliang scasearchbasedcomputinghardwarearchitecturewithprecisionscalableandcomputationreconfigurablescheme AT zhaoxin scasearchbasedcomputinghardwarearchitecturewithprecisionscalableandcomputationreconfigurablescheme AT zhoujun scasearchbasedcomputinghardwarearchitecturewithprecisionscalableandcomputationreconfigurablescheme |