Cargando…

Experimental Study on Transfer Learning in Denoising Autoencoders for Speech Enhancement

The quality of speech signals is affected by a combination of background noise, reverberation, and other distortions in real-life environments. The processing of such signals presents important challenges for tasks such as voice or speaker recognition. To enhance signals in such challenging conditio...

Descripción completa

Detalles Bibliográficos
Autor principal: Coto-Jiménez, Marvin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7297578/
http://dx.doi.org/10.1007/978-3-030-49076-8_29
_version_ 1783547035541045248
author Coto-Jiménez, Marvin
author_facet Coto-Jiménez, Marvin
author_sort Coto-Jiménez, Marvin
collection PubMed
description The quality of speech signals is affected by a combination of background noise, reverberation, and other distortions in real-life environments. The processing of such signals presents important challenges for tasks such as voice or speaker recognition. To enhance signals in such challenging conditions several deep learning-based methods have been proposed. Those new methods have proven to be effective, in comparison to classical algorithms based on statistical analysis and signal processing. In particular, recurrent neural networks, especially those with long short-term memory (LSTM and BLSTM), have presented surprising results in tasks related to enhancing speech. One of the most challenging aspects of artificial neural networks is to reduce the high computational cost of the training procedure. In this work, we present a comparative study on transfer learning to accelerate and improve traditional training based on random initialization of the internal weights of the networks. The results show the advantage of the proposal in terms of less training time and better results for the task of denoising speech signals at several signal-to-noise ratio levels of white noise.
format Online
Article
Text
id pubmed-7297578
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-72975782020-06-17 Experimental Study on Transfer Learning in Denoising Autoencoders for Speech Enhancement Coto-Jiménez, Marvin Pattern Recognition Article The quality of speech signals is affected by a combination of background noise, reverberation, and other distortions in real-life environments. The processing of such signals presents important challenges for tasks such as voice or speaker recognition. To enhance signals in such challenging conditions several deep learning-based methods have been proposed. Those new methods have proven to be effective, in comparison to classical algorithms based on statistical analysis and signal processing. In particular, recurrent neural networks, especially those with long short-term memory (LSTM and BLSTM), have presented surprising results in tasks related to enhancing speech. One of the most challenging aspects of artificial neural networks is to reduce the high computational cost of the training procedure. In this work, we present a comparative study on transfer learning to accelerate and improve traditional training based on random initialization of the internal weights of the networks. The results show the advantage of the proposal in terms of less training time and better results for the task of denoising speech signals at several signal-to-noise ratio levels of white noise. 2020-04-29 /pmc/articles/PMC7297578/ http://dx.doi.org/10.1007/978-3-030-49076-8_29 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Coto-Jiménez, Marvin
Experimental Study on Transfer Learning in Denoising Autoencoders for Speech Enhancement
title Experimental Study on Transfer Learning in Denoising Autoencoders for Speech Enhancement
title_full Experimental Study on Transfer Learning in Denoising Autoencoders for Speech Enhancement
title_fullStr Experimental Study on Transfer Learning in Denoising Autoencoders for Speech Enhancement
title_full_unstemmed Experimental Study on Transfer Learning in Denoising Autoencoders for Speech Enhancement
title_short Experimental Study on Transfer Learning in Denoising Autoencoders for Speech Enhancement
title_sort experimental study on transfer learning in denoising autoencoders for speech enhancement
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7297578/
http://dx.doi.org/10.1007/978-3-030-49076-8_29
work_keys_str_mv AT cotojimenezmarvin experimentalstudyontransferlearningindenoisingautoencodersforspeechenhancement