Cargando…

Application of dynamic time warping optimization algorithm in speech recognition of machine translation

Speech recognition is the foundation of human-computer interaction technology and an important aspect of speech signal processing, with broad application prospects. Therefore, it is very necessary to recognize speech. At present, speech recognition has problems such as low recognition rate, slow rec...

Descripción completa

Detalles Bibliográficos
Autores principales: Jiang, Shaohua, Chen, Zheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10651500/
https://www.ncbi.nlm.nih.gov/pubmed/38027668
http://dx.doi.org/10.1016/j.heliyon.2023.e21625
_version_ 1785136011475419136
author Jiang, Shaohua
Chen, Zheng
author_facet Jiang, Shaohua
Chen, Zheng
author_sort Jiang, Shaohua
collection PubMed
description Speech recognition is the foundation of human-computer interaction technology and an important aspect of speech signal processing, with broad application prospects. Therefore, it is very necessary to recognize speech. At present, speech recognition has problems such as low recognition rate, slow recognition speed, and severe interference from other factors. This paper studied speech recognition based on dynamic time warping (DTW) algorithm. By introducing speech recognition, the specific steps of speech recognition were understood. Before performing speech recognition, the speech that needs to be recognized needs to be converted into a speech sequence using an acoustic model. Then, the DTW algorithm was used to preprocess speech recognition, mainly by sampling and windowing the speech. After preprocessing, speech feature extraction was carried out. After feature extraction was completed, speech recognition was carried out. Through experiments, it can be found that the recognition rate of speech recognition on the basis of DTW algorithm was very high. In a quiet environment, the recognition rate was above 93.85 %, and the average recognition rate of the 10 selected testers was 95.8 %. In a noisy environment, the recognition rate was above 91.4 %, and the average recognition rate of the 10 selected testers was 93 %. In addition to high recognition rate, DTW based speech recognition also had a very fast speed for vocabulary recognition. Based on the DTW algorithm, speech recognition not only has a high recognition rate, but also has a faster recognition speed.
format Online
Article
Text
id pubmed-10651500
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-106515002023-10-27 Application of dynamic time warping optimization algorithm in speech recognition of machine translation Jiang, Shaohua Chen, Zheng Heliyon Research Article Speech recognition is the foundation of human-computer interaction technology and an important aspect of speech signal processing, with broad application prospects. Therefore, it is very necessary to recognize speech. At present, speech recognition has problems such as low recognition rate, slow recognition speed, and severe interference from other factors. This paper studied speech recognition based on dynamic time warping (DTW) algorithm. By introducing speech recognition, the specific steps of speech recognition were understood. Before performing speech recognition, the speech that needs to be recognized needs to be converted into a speech sequence using an acoustic model. Then, the DTW algorithm was used to preprocess speech recognition, mainly by sampling and windowing the speech. After preprocessing, speech feature extraction was carried out. After feature extraction was completed, speech recognition was carried out. Through experiments, it can be found that the recognition rate of speech recognition on the basis of DTW algorithm was very high. In a quiet environment, the recognition rate was above 93.85 %, and the average recognition rate of the 10 selected testers was 95.8 %. In a noisy environment, the recognition rate was above 91.4 %, and the average recognition rate of the 10 selected testers was 93 %. In addition to high recognition rate, DTW based speech recognition also had a very fast speed for vocabulary recognition. Based on the DTW algorithm, speech recognition not only has a high recognition rate, but also has a faster recognition speed. Elsevier 2023-10-27 /pmc/articles/PMC10651500/ /pubmed/38027668 http://dx.doi.org/10.1016/j.heliyon.2023.e21625 Text en © 2023 Published by Elsevier Ltd. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Research Article
Jiang, Shaohua
Chen, Zheng
Application of dynamic time warping optimization algorithm in speech recognition of machine translation
title Application of dynamic time warping optimization algorithm in speech recognition of machine translation
title_full Application of dynamic time warping optimization algorithm in speech recognition of machine translation
title_fullStr Application of dynamic time warping optimization algorithm in speech recognition of machine translation
title_full_unstemmed Application of dynamic time warping optimization algorithm in speech recognition of machine translation
title_short Application of dynamic time warping optimization algorithm in speech recognition of machine translation
title_sort application of dynamic time warping optimization algorithm in speech recognition of machine translation
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10651500/
https://www.ncbi.nlm.nih.gov/pubmed/38027668
http://dx.doi.org/10.1016/j.heliyon.2023.e21625
work_keys_str_mv AT jiangshaohua applicationofdynamictimewarpingoptimizationalgorithminspeechrecognitionofmachinetranslation
AT chenzheng applicationofdynamictimewarpingoptimizationalgorithminspeechrecognitionofmachinetranslation