Cargando…

Speech enhancement using long short term memory with trained speech features and adaptive wiener filter

Speech enhancement is the process of enhancing the clarity and intelligibility of speech signals that have been degraded due to background noise. With the assistance of deep learning, a novel speech signal enhancement model is introduced in this research. The proposed model is divided into two phase...

Descripción completa

Detalles Bibliográficos
Autor principal:	Garg, Anil
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer US 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9281249/ https://www.ncbi.nlm.nih.gov/pubmed/35855772 http://dx.doi.org/10.1007/s11042-022-13302-3

_version_	1784746837740093440
author	Garg, Anil
author_facet	Garg, Anil
author_sort	Garg, Anil
collection	PubMed
description	Speech enhancement is the process of enhancing the clarity and intelligibility of speech signals that have been degraded due to background noise. With the assistance of deep learning, a novel speech signal enhancement model is introduced in this research. The proposed model is divided into two phases: (i) Training (ii) Testing. In the training phase, the noise spectrum and signal spectrum are estimated via a Non-negative Matrix Factorization (NMF) from the noisy input signal. Then, Empirical Mean Decomposition (EMD) features are extracted from the Wiener filter. The de-noised signal is acquired from EMD, the bark frequency is evaluated and the Fractional Delta AMS features are extracted. The key contribution of this study is the use of the Long Short Term Memory (LSTM) model to properly estimate the tuning factor η of the Wiener filter for all input signals. The LSTM was trained by the extracted features (EMD) via a modified wiener filter for decomposing the spectral signal and the output of EMD is the denoised enhanced speech signal. A comparative evaluation is carried out between the proposed and existing models in terms of error measures.
format	Online Article Text
id	pubmed-9281249
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Springer US
record_format	MEDLINE/PubMed
spelling	pubmed-92812492022-07-14 Speech enhancement using long short term memory with trained speech features and adaptive wiener filter Garg, Anil Multimed Tools Appl Article Speech enhancement is the process of enhancing the clarity and intelligibility of speech signals that have been degraded due to background noise. With the assistance of deep learning, a novel speech signal enhancement model is introduced in this research. The proposed model is divided into two phases: (i) Training (ii) Testing. In the training phase, the noise spectrum and signal spectrum are estimated via a Non-negative Matrix Factorization (NMF) from the noisy input signal. Then, Empirical Mean Decomposition (EMD) features are extracted from the Wiener filter. The de-noised signal is acquired from EMD, the bark frequency is evaluated and the Fractional Delta AMS features are extracted. The key contribution of this study is the use of the Long Short Term Memory (LSTM) model to properly estimate the tuning factor η of the Wiener filter for all input signals. The LSTM was trained by the extracted features (EMD) via a modified wiener filter for decomposing the spectral signal and the output of EMD is the denoised enhanced speech signal. A comparative evaluation is carried out between the proposed and existing models in terms of error measures. Springer US 2022-07-14 2023 /pmc/articles/PMC9281249/ /pubmed/35855772 http://dx.doi.org/10.1007/s11042-022-13302-3 Text en © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle	Article Garg, Anil Speech enhancement using long short term memory with trained speech features and adaptive wiener filter
title	Speech enhancement using long short term memory with trained speech features and adaptive wiener filter
title_full	Speech enhancement using long short term memory with trained speech features and adaptive wiener filter
title_fullStr	Speech enhancement using long short term memory with trained speech features and adaptive wiener filter
title_full_unstemmed	Speech enhancement using long short term memory with trained speech features and adaptive wiener filter
title_short	Speech enhancement using long short term memory with trained speech features and adaptive wiener filter
title_sort	speech enhancement using long short term memory with trained speech features and adaptive wiener filter
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9281249/ https://www.ncbi.nlm.nih.gov/pubmed/35855772 http://dx.doi.org/10.1007/s11042-022-13302-3
work_keys_str_mv	AT garganil speechenhancementusinglongshorttermmemorywithtrainedspeechfeaturesandadaptivewienerfilter

Speech enhancement using long short term memory with trained speech features and adaptive wiener filter

Ejemplares similares