Cargando…
Experimental Study on Transfer Learning in Denoising Autoencoders for Speech Enhancement
The quality of speech signals is affected by a combination of background noise, reverberation, and other distortions in real-life environments. The processing of such signals presents important challenges for tasks such as voice or speaker recognition. To enhance signals in such challenging conditio...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7297578/ http://dx.doi.org/10.1007/978-3-030-49076-8_29 |
_version_ | 1783547035541045248 |
---|---|
author | Coto-Jiménez, Marvin |
author_facet | Coto-Jiménez, Marvin |
author_sort | Coto-Jiménez, Marvin |
collection | PubMed |
description | The quality of speech signals is affected by a combination of background noise, reverberation, and other distortions in real-life environments. The processing of such signals presents important challenges for tasks such as voice or speaker recognition. To enhance signals in such challenging conditions several deep learning-based methods have been proposed. Those new methods have proven to be effective, in comparison to classical algorithms based on statistical analysis and signal processing. In particular, recurrent neural networks, especially those with long short-term memory (LSTM and BLSTM), have presented surprising results in tasks related to enhancing speech. One of the most challenging aspects of artificial neural networks is to reduce the high computational cost of the training procedure. In this work, we present a comparative study on transfer learning to accelerate and improve traditional training based on random initialization of the internal weights of the networks. The results show the advantage of the proposal in terms of less training time and better results for the task of denoising speech signals at several signal-to-noise ratio levels of white noise. |
format | Online Article Text |
id | pubmed-7297578 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
record_format | MEDLINE/PubMed |
spelling | pubmed-72975782020-06-17 Experimental Study on Transfer Learning in Denoising Autoencoders for Speech Enhancement Coto-Jiménez, Marvin Pattern Recognition Article The quality of speech signals is affected by a combination of background noise, reverberation, and other distortions in real-life environments. The processing of such signals presents important challenges for tasks such as voice or speaker recognition. To enhance signals in such challenging conditions several deep learning-based methods have been proposed. Those new methods have proven to be effective, in comparison to classical algorithms based on statistical analysis and signal processing. In particular, recurrent neural networks, especially those with long short-term memory (LSTM and BLSTM), have presented surprising results in tasks related to enhancing speech. One of the most challenging aspects of artificial neural networks is to reduce the high computational cost of the training procedure. In this work, we present a comparative study on transfer learning to accelerate and improve traditional training based on random initialization of the internal weights of the networks. The results show the advantage of the proposal in terms of less training time and better results for the task of denoising speech signals at several signal-to-noise ratio levels of white noise. 2020-04-29 /pmc/articles/PMC7297578/ http://dx.doi.org/10.1007/978-3-030-49076-8_29 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Article Coto-Jiménez, Marvin Experimental Study on Transfer Learning in Denoising Autoencoders for Speech Enhancement |
title | Experimental Study on Transfer Learning in Denoising Autoencoders for Speech Enhancement |
title_full | Experimental Study on Transfer Learning in Denoising Autoencoders for Speech Enhancement |
title_fullStr | Experimental Study on Transfer Learning in Denoising Autoencoders for Speech Enhancement |
title_full_unstemmed | Experimental Study on Transfer Learning in Denoising Autoencoders for Speech Enhancement |
title_short | Experimental Study on Transfer Learning in Denoising Autoencoders for Speech Enhancement |
title_sort | experimental study on transfer learning in denoising autoencoders for speech enhancement |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7297578/ http://dx.doi.org/10.1007/978-3-030-49076-8_29 |
work_keys_str_mv | AT cotojimenezmarvin experimentalstudyontransferlearningindenoisingautoencodersforspeechenhancement |