Cargando…

A novel silent speech recognition approach based on parallel inception convolutional neural network and Mel frequency spectral coefficient

Silent speech recognition breaks the limitations of automatic speech recognition when acoustic signals cannot be produced or captured clearly, but still has a long way to go before being ready for any real-life applications. To address this issue, we propose a novel silent speech recognition framewo...

Descripción completa

Detalles Bibliográficos
Autores principales:	Wu, Jinghan, Zhang, Yakun, Xie, Liang, Yan, Ye, Zhang, Xu, Liu, Shuang, An, Xingwei, Yin, Erwei, Ming, Dong
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2022
Materias:	Neuroscience
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9478652/ https://www.ncbi.nlm.nih.gov/pubmed/36119717 http://dx.doi.org/10.3389/fnbot.2022.971446

_version_	1784790619460206592
author	Wu, Jinghan Zhang, Yakun Xie, Liang Yan, Ye Zhang, Xu Liu, Shuang An, Xingwei Yin, Erwei Ming, Dong
author_facet	Wu, Jinghan Zhang, Yakun Xie, Liang Yan, Ye Zhang, Xu Liu, Shuang An, Xingwei Yin, Erwei Ming, Dong
author_sort	Wu, Jinghan
collection	PubMed
description	Silent speech recognition breaks the limitations of automatic speech recognition when acoustic signals cannot be produced or captured clearly, but still has a long way to go before being ready for any real-life applications. To address this issue, we propose a novel silent speech recognition framework based on surface electromyography (sEMG) signals. In our approach, a new deep learning architecture Parallel Inception Convolutional Neural Network (PICNN) is proposed and implemented in our silent speech recognition system, with six inception modules processing six channels of sEMG data, separately and simultaneously. Meanwhile, Mel Frequency Spectral Coefficients (MFSCs) are employed to extract speech-related sEMG features for the first time. We further design and generate a 100-class dataset containing daily life assistance demands for the elderly and disabled individuals. The experimental results obtained from 28 subjects confirm that our silent speech recognition method outperforms state-of-the-art machine learning algorithms and deep learning architectures, achieving the best recognition accuracy of 90.76%. With sEMG data collected from four new subjects, efficient steps of subject-based transfer learning are conducted to further improve the cross-subject recognition ability of the proposed model. Promising results prove that our sEMG-based silent speech recognition system could have high recognition accuracy and steady performance in practical applications.
format	Online Article Text
id	pubmed-9478652
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-94786522022-09-17 A novel silent speech recognition approach based on parallel inception convolutional neural network and Mel frequency spectral coefficient Wu, Jinghan Zhang, Yakun Xie, Liang Yan, Ye Zhang, Xu Liu, Shuang An, Xingwei Yin, Erwei Ming, Dong Front Neurorobot Neuroscience Silent speech recognition breaks the limitations of automatic speech recognition when acoustic signals cannot be produced or captured clearly, but still has a long way to go before being ready for any real-life applications. To address this issue, we propose a novel silent speech recognition framework based on surface electromyography (sEMG) signals. In our approach, a new deep learning architecture Parallel Inception Convolutional Neural Network (PICNN) is proposed and implemented in our silent speech recognition system, with six inception modules processing six channels of sEMG data, separately and simultaneously. Meanwhile, Mel Frequency Spectral Coefficients (MFSCs) are employed to extract speech-related sEMG features for the first time. We further design and generate a 100-class dataset containing daily life assistance demands for the elderly and disabled individuals. The experimental results obtained from 28 subjects confirm that our silent speech recognition method outperforms state-of-the-art machine learning algorithms and deep learning architectures, achieving the best recognition accuracy of 90.76%. With sEMG data collected from four new subjects, efficient steps of subject-based transfer learning are conducted to further improve the cross-subject recognition ability of the proposed model. Promising results prove that our sEMG-based silent speech recognition system could have high recognition accuracy and steady performance in practical applications. Frontiers Media S.A. 2022-09-02 /pmc/articles/PMC9478652/ /pubmed/36119717 http://dx.doi.org/10.3389/fnbot.2022.971446 Text en Copyright © 2022 Wu, Zhang, Xie, Yan, Zhang, Liu, An, Yin and Ming. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Neuroscience Wu, Jinghan Zhang, Yakun Xie, Liang Yan, Ye Zhang, Xu Liu, Shuang An, Xingwei Yin, Erwei Ming, Dong A novel silent speech recognition approach based on parallel inception convolutional neural network and Mel frequency spectral coefficient
title	A novel silent speech recognition approach based on parallel inception convolutional neural network and Mel frequency spectral coefficient
title_full	A novel silent speech recognition approach based on parallel inception convolutional neural network and Mel frequency spectral coefficient
title_fullStr	A novel silent speech recognition approach based on parallel inception convolutional neural network and Mel frequency spectral coefficient
title_full_unstemmed	A novel silent speech recognition approach based on parallel inception convolutional neural network and Mel frequency spectral coefficient
title_short	A novel silent speech recognition approach based on parallel inception convolutional neural network and Mel frequency spectral coefficient
title_sort	novel silent speech recognition approach based on parallel inception convolutional neural network and mel frequency spectral coefficient
topic	Neuroscience
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9478652/ https://www.ncbi.nlm.nih.gov/pubmed/36119717 http://dx.doi.org/10.3389/fnbot.2022.971446
work_keys_str_mv	AT wujinghan anovelsilentspeechrecognitionapproachbasedonparallelinceptionconvolutionalneuralnetworkandmelfrequencyspectralcoefficient AT zhangyakun anovelsilentspeechrecognitionapproachbasedonparallelinceptionconvolutionalneuralnetworkandmelfrequencyspectralcoefficient AT xieliang anovelsilentspeechrecognitionapproachbasedonparallelinceptionconvolutionalneuralnetworkandmelfrequencyspectralcoefficient AT yanye anovelsilentspeechrecognitionapproachbasedonparallelinceptionconvolutionalneuralnetworkandmelfrequencyspectralcoefficient AT zhangxu anovelsilentspeechrecognitionapproachbasedonparallelinceptionconvolutionalneuralnetworkandmelfrequencyspectralcoefficient AT liushuang anovelsilentspeechrecognitionapproachbasedonparallelinceptionconvolutionalneuralnetworkandmelfrequencyspectralcoefficient AT anxingwei anovelsilentspeechrecognitionapproachbasedonparallelinceptionconvolutionalneuralnetworkandmelfrequencyspectralcoefficient AT yinerwei anovelsilentspeechrecognitionapproachbasedonparallelinceptionconvolutionalneuralnetworkandmelfrequencyspectralcoefficient AT mingdong anovelsilentspeechrecognitionapproachbasedonparallelinceptionconvolutionalneuralnetworkandmelfrequencyspectralcoefficient AT wujinghan novelsilentspeechrecognitionapproachbasedonparallelinceptionconvolutionalneuralnetworkandmelfrequencyspectralcoefficient AT zhangyakun novelsilentspeechrecognitionapproachbasedonparallelinceptionconvolutionalneuralnetworkandmelfrequencyspectralcoefficient AT xieliang novelsilentspeechrecognitionapproachbasedonparallelinceptionconvolutionalneuralnetworkandmelfrequencyspectralcoefficient AT yanye novelsilentspeechrecognitionapproachbasedonparallelinceptionconvolutionalneuralnetworkandmelfrequencyspectralcoefficient AT zhangxu novelsilentspeechrecognitionapproachbasedonparallelinceptionconvolutionalneuralnetworkandmelfrequencyspectralcoefficient AT liushuang novelsilentspeechrecognitionapproachbasedonparallelinceptionconvolutionalneuralnetworkandmelfrequencyspectralcoefficient AT anxingwei novelsilentspeechrecognitionapproachbasedonparallelinceptionconvolutionalneuralnetworkandmelfrequencyspectralcoefficient AT yinerwei novelsilentspeechrecognitionapproachbasedonparallelinceptionconvolutionalneuralnetworkandmelfrequencyspectralcoefficient AT mingdong novelsilentspeechrecognitionapproachbasedonparallelinceptionconvolutionalneuralnetworkandmelfrequencyspectralcoefficient

A novel silent speech recognition approach based on parallel inception convolutional neural network and Mel frequency spectral coefficient

Ejemplares similares