Cargando…

Efficient Binary Weight Convolutional Network Accelerator for Speech Recognition

Speech recognition has progressed tremendously in the area of artificial intelligence (AI). However, the performance of the real-time offline Chinese speech recognition neural network accelerator for edge AI needs to be improved. This paper proposes a configurable convolutional neural network accele...

Descripción completa

Detalles Bibliográficos
Autores principales: Guo, Lunyi, Mu, Shining, Deng, Yijie, Shi, Chaofan, Yan, Bo, Xiao, Zhuoling
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9920974/
https://www.ncbi.nlm.nih.gov/pubmed/36772567
http://dx.doi.org/10.3390/s23031530
_version_ 1784887202175516672
author Guo, Lunyi
Mu, Shining
Deng, Yijie
Shi, Chaofan
Yan, Bo
Xiao, Zhuoling
author_facet Guo, Lunyi
Mu, Shining
Deng, Yijie
Shi, Chaofan
Yan, Bo
Xiao, Zhuoling
author_sort Guo, Lunyi
collection PubMed
description Speech recognition has progressed tremendously in the area of artificial intelligence (AI). However, the performance of the real-time offline Chinese speech recognition neural network accelerator for edge AI needs to be improved. This paper proposes a configurable convolutional neural network accelerator based on a lightweight speech recognition model, which can dramatically reduce hardware resource consumption while guaranteeing an acceptable error rate. For convolutional layers, the weights are binarized to reduce the number of model parameters and improve computational and storage efficiency. A multichannel shared computation (MCSC) architecture is proposed to maximize the reuse of weight and feature map data. The binary weight-sharing processing engine (PE) is designed to avoid limiting the number of multipliers. A custom instruction set is established according to the variable length of voice input to configure parameters for adapting to different network structures. Finally, the ping-pong storage method is used when the feature map is an input. We implemented this accelerator on Xilinx ZYNQ XC7Z035 under the working frequency of 150 MHz. The processing time for 2.24 s and 8 s of speech was 69.8 ms and 189.51 ms, respectively, and the convolution performance reached 35.66 GOPS/W. Compared with other computing platforms, accelerators perform better in terms of energy efficiency, power consumption and hardware resource consumption.
format Online
Article
Text
id pubmed-9920974
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-99209742023-02-12 Efficient Binary Weight Convolutional Network Accelerator for Speech Recognition Guo, Lunyi Mu, Shining Deng, Yijie Shi, Chaofan Yan, Bo Xiao, Zhuoling Sensors (Basel) Article Speech recognition has progressed tremendously in the area of artificial intelligence (AI). However, the performance of the real-time offline Chinese speech recognition neural network accelerator for edge AI needs to be improved. This paper proposes a configurable convolutional neural network accelerator based on a lightweight speech recognition model, which can dramatically reduce hardware resource consumption while guaranteeing an acceptable error rate. For convolutional layers, the weights are binarized to reduce the number of model parameters and improve computational and storage efficiency. A multichannel shared computation (MCSC) architecture is proposed to maximize the reuse of weight and feature map data. The binary weight-sharing processing engine (PE) is designed to avoid limiting the number of multipliers. A custom instruction set is established according to the variable length of voice input to configure parameters for adapting to different network structures. Finally, the ping-pong storage method is used when the feature map is an input. We implemented this accelerator on Xilinx ZYNQ XC7Z035 under the working frequency of 150 MHz. The processing time for 2.24 s and 8 s of speech was 69.8 ms and 189.51 ms, respectively, and the convolution performance reached 35.66 GOPS/W. Compared with other computing platforms, accelerators perform better in terms of energy efficiency, power consumption and hardware resource consumption. MDPI 2023-01-30 /pmc/articles/PMC9920974/ /pubmed/36772567 http://dx.doi.org/10.3390/s23031530 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Guo, Lunyi
Mu, Shining
Deng, Yijie
Shi, Chaofan
Yan, Bo
Xiao, Zhuoling
Efficient Binary Weight Convolutional Network Accelerator for Speech Recognition
title Efficient Binary Weight Convolutional Network Accelerator for Speech Recognition
title_full Efficient Binary Weight Convolutional Network Accelerator for Speech Recognition
title_fullStr Efficient Binary Weight Convolutional Network Accelerator for Speech Recognition
title_full_unstemmed Efficient Binary Weight Convolutional Network Accelerator for Speech Recognition
title_short Efficient Binary Weight Convolutional Network Accelerator for Speech Recognition
title_sort efficient binary weight convolutional network accelerator for speech recognition
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9920974/
https://www.ncbi.nlm.nih.gov/pubmed/36772567
http://dx.doi.org/10.3390/s23031530
work_keys_str_mv AT guolunyi efficientbinaryweightconvolutionalnetworkacceleratorforspeechrecognition
AT mushining efficientbinaryweightconvolutionalnetworkacceleratorforspeechrecognition
AT dengyijie efficientbinaryweightconvolutionalnetworkacceleratorforspeechrecognition
AT shichaofan efficientbinaryweightconvolutionalnetworkacceleratorforspeechrecognition
AT yanbo efficientbinaryweightconvolutionalnetworkacceleratorforspeechrecognition
AT xiaozhuoling efficientbinaryweightconvolutionalnetworkacceleratorforspeechrecognition