Cargando…
Efficient Binary Weight Convolutional Network Accelerator for Speech Recognition
Speech recognition has progressed tremendously in the area of artificial intelligence (AI). However, the performance of the real-time offline Chinese speech recognition neural network accelerator for edge AI needs to be improved. This paper proposes a configurable convolutional neural network accele...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9920974/ https://www.ncbi.nlm.nih.gov/pubmed/36772567 http://dx.doi.org/10.3390/s23031530 |
_version_ | 1784887202175516672 |
---|---|
author | Guo, Lunyi Mu, Shining Deng, Yijie Shi, Chaofan Yan, Bo Xiao, Zhuoling |
author_facet | Guo, Lunyi Mu, Shining Deng, Yijie Shi, Chaofan Yan, Bo Xiao, Zhuoling |
author_sort | Guo, Lunyi |
collection | PubMed |
description | Speech recognition has progressed tremendously in the area of artificial intelligence (AI). However, the performance of the real-time offline Chinese speech recognition neural network accelerator for edge AI needs to be improved. This paper proposes a configurable convolutional neural network accelerator based on a lightweight speech recognition model, which can dramatically reduce hardware resource consumption while guaranteeing an acceptable error rate. For convolutional layers, the weights are binarized to reduce the number of model parameters and improve computational and storage efficiency. A multichannel shared computation (MCSC) architecture is proposed to maximize the reuse of weight and feature map data. The binary weight-sharing processing engine (PE) is designed to avoid limiting the number of multipliers. A custom instruction set is established according to the variable length of voice input to configure parameters for adapting to different network structures. Finally, the ping-pong storage method is used when the feature map is an input. We implemented this accelerator on Xilinx ZYNQ XC7Z035 under the working frequency of 150 MHz. The processing time for 2.24 s and 8 s of speech was 69.8 ms and 189.51 ms, respectively, and the convolution performance reached 35.66 GOPS/W. Compared with other computing platforms, accelerators perform better in terms of energy efficiency, power consumption and hardware resource consumption. |
format | Online Article Text |
id | pubmed-9920974 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-99209742023-02-12 Efficient Binary Weight Convolutional Network Accelerator for Speech Recognition Guo, Lunyi Mu, Shining Deng, Yijie Shi, Chaofan Yan, Bo Xiao, Zhuoling Sensors (Basel) Article Speech recognition has progressed tremendously in the area of artificial intelligence (AI). However, the performance of the real-time offline Chinese speech recognition neural network accelerator for edge AI needs to be improved. This paper proposes a configurable convolutional neural network accelerator based on a lightweight speech recognition model, which can dramatically reduce hardware resource consumption while guaranteeing an acceptable error rate. For convolutional layers, the weights are binarized to reduce the number of model parameters and improve computational and storage efficiency. A multichannel shared computation (MCSC) architecture is proposed to maximize the reuse of weight and feature map data. The binary weight-sharing processing engine (PE) is designed to avoid limiting the number of multipliers. A custom instruction set is established according to the variable length of voice input to configure parameters for adapting to different network structures. Finally, the ping-pong storage method is used when the feature map is an input. We implemented this accelerator on Xilinx ZYNQ XC7Z035 under the working frequency of 150 MHz. The processing time for 2.24 s and 8 s of speech was 69.8 ms and 189.51 ms, respectively, and the convolution performance reached 35.66 GOPS/W. Compared with other computing platforms, accelerators perform better in terms of energy efficiency, power consumption and hardware resource consumption. MDPI 2023-01-30 /pmc/articles/PMC9920974/ /pubmed/36772567 http://dx.doi.org/10.3390/s23031530 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Guo, Lunyi Mu, Shining Deng, Yijie Shi, Chaofan Yan, Bo Xiao, Zhuoling Efficient Binary Weight Convolutional Network Accelerator for Speech Recognition |
title | Efficient Binary Weight Convolutional Network Accelerator for Speech Recognition |
title_full | Efficient Binary Weight Convolutional Network Accelerator for Speech Recognition |
title_fullStr | Efficient Binary Weight Convolutional Network Accelerator for Speech Recognition |
title_full_unstemmed | Efficient Binary Weight Convolutional Network Accelerator for Speech Recognition |
title_short | Efficient Binary Weight Convolutional Network Accelerator for Speech Recognition |
title_sort | efficient binary weight convolutional network accelerator for speech recognition |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9920974/ https://www.ncbi.nlm.nih.gov/pubmed/36772567 http://dx.doi.org/10.3390/s23031530 |
work_keys_str_mv | AT guolunyi efficientbinaryweightconvolutionalnetworkacceleratorforspeechrecognition AT mushining efficientbinaryweightconvolutionalnetworkacceleratorforspeechrecognition AT dengyijie efficientbinaryweightconvolutionalnetworkacceleratorforspeechrecognition AT shichaofan efficientbinaryweightconvolutionalnetworkacceleratorforspeechrecognition AT yanbo efficientbinaryweightconvolutionalnetworkacceleratorforspeechrecognition AT xiaozhuoling efficientbinaryweightconvolutionalnetworkacceleratorforspeechrecognition |