Cargando…
Speech Enhancement Based on Deep AutoEncoder for Remote Arabic Speech Recognition
Remote applications that deal with speech need the speech signal to be compressed. First, speech coding transforms the continuous waveform into a numerical form. Then, the digitized signal is compressed with or without loss of information. This transformation affects the original waveform and degrad...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7340912/ http://dx.doi.org/10.1007/978-3-030-51935-3_24 |
_version_ | 1783555120325197824 |
---|---|
author | Dendani, Bilal Bahi, Halima Sari, Toufik |
author_facet | Dendani, Bilal Bahi, Halima Sari, Toufik |
author_sort | Dendani, Bilal |
collection | PubMed |
description | Remote applications that deal with speech need the speech signal to be compressed. First, speech coding transforms the continuous waveform into a numerical form. Then, the digitized signal is compressed with or without loss of information. This transformation affects the original waveform and degrades performances for further recognition of the speech signal. Meanwhile, the transmission is another source of speech degradation. To restore the original “clean” speech, speech enhancement (SE) is widely used, and deep learning algorithms are state-of-the-art, nowadays. In this paper, the target application is a remote Arabic speech recognition system, and the aim of using SE is to improve the accuracy of the speech recognizer. For that purpose, a Deep Auto Encoder (DAE) is used. The effect of the DAE-based SE is studied through different configurations, and the performances are evaluated through accuracy. The results showed an improvement of about 3.17 between the accuracy prior to the SE and that computed with the enhanced speech. |
format | Online Article Text |
id | pubmed-7340912 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
record_format | MEDLINE/PubMed |
spelling | pubmed-73409122020-07-08 Speech Enhancement Based on Deep AutoEncoder for Remote Arabic Speech Recognition Dendani, Bilal Bahi, Halima Sari, Toufik Image and Signal Processing Article Remote applications that deal with speech need the speech signal to be compressed. First, speech coding transforms the continuous waveform into a numerical form. Then, the digitized signal is compressed with or without loss of information. This transformation affects the original waveform and degrades performances for further recognition of the speech signal. Meanwhile, the transmission is another source of speech degradation. To restore the original “clean” speech, speech enhancement (SE) is widely used, and deep learning algorithms are state-of-the-art, nowadays. In this paper, the target application is a remote Arabic speech recognition system, and the aim of using SE is to improve the accuracy of the speech recognizer. For that purpose, a Deep Auto Encoder (DAE) is used. The effect of the DAE-based SE is studied through different configurations, and the performances are evaluated through accuracy. The results showed an improvement of about 3.17 between the accuracy prior to the SE and that computed with the enhanced speech. 2020-06-05 /pmc/articles/PMC7340912/ http://dx.doi.org/10.1007/978-3-030-51935-3_24 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Article Dendani, Bilal Bahi, Halima Sari, Toufik Speech Enhancement Based on Deep AutoEncoder for Remote Arabic Speech Recognition |
title | Speech Enhancement Based on Deep AutoEncoder for Remote Arabic Speech Recognition |
title_full | Speech Enhancement Based on Deep AutoEncoder for Remote Arabic Speech Recognition |
title_fullStr | Speech Enhancement Based on Deep AutoEncoder for Remote Arabic Speech Recognition |
title_full_unstemmed | Speech Enhancement Based on Deep AutoEncoder for Remote Arabic Speech Recognition |
title_short | Speech Enhancement Based on Deep AutoEncoder for Remote Arabic Speech Recognition |
title_sort | speech enhancement based on deep autoencoder for remote arabic speech recognition |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7340912/ http://dx.doi.org/10.1007/978-3-030-51935-3_24 |
work_keys_str_mv | AT dendanibilal speechenhancementbasedondeepautoencoderforremotearabicspeechrecognition AT bahihalima speechenhancementbasedondeepautoencoderforremotearabicspeechrecognition AT saritoufik speechenhancementbasedondeepautoencoderforremotearabicspeechrecognition |