Cargando…

Pareto-Optimized Non-Negative Matrix Factorization Approach to the Cleaning of Alaryngeal Speech Signals

SIMPLE SUMMARY: This paper introduces a new method for cleaning impaired speech by combining Pareto-optimized deep learning with Non-negative Matrix Factorization (NMF). The approach effectively reduces noise in impaired speech while preserving the desired speech quality. The method involves calcula...

Descripción completa

Detalles Bibliográficos
Autores principales:	Maskeliūnas, Rytis, Damaševičius, Robertas, Kulikajevas, Audrius, Pribuišis, Kipras, Ulozaitė-Stanienė, Nora, Uloza, Virgilijus
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10377391/ https://www.ncbi.nlm.nih.gov/pubmed/37509305 http://dx.doi.org/10.3390/cancers15143644

_version_	1785079505895817216
author	Maskeliūnas, Rytis Damaševičius, Robertas Kulikajevas, Audrius Pribuišis, Kipras Ulozaitė-Stanienė, Nora Uloza, Virgilijus
author_facet	Maskeliūnas, Rytis Damaševičius, Robertas Kulikajevas, Audrius Pribuišis, Kipras Ulozaitė-Stanienė, Nora Uloza, Virgilijus
author_sort	Maskeliūnas, Rytis
collection	PubMed
description	SIMPLE SUMMARY: This paper introduces a new method for cleaning impaired speech by combining Pareto-optimized deep learning with Non-negative Matrix Factorization (NMF). The approach effectively reduces noise in impaired speech while preserving the desired speech quality. The method involves calculating the spectrogram of a noisy voice clip, determining a noise threshold, computing a noise-to-signal mask, and smoothing it to avoid abrupt transitions. Using a Pareto-optimized NMF, the modified spectrogram is decomposed into basis functions and weights, allowing for reconstruction of the clean speech spectrogram. The final result is a noise-reduced waveform achieved by inverting the clean speech spectrogram. Experimental results validate the method’s effectiveness in cleaning alaryngeal speech signals, indicating its potential for real-world applications. ABSTRACT: The problem of cleaning impaired speech is crucial for various applications such as speech recognition, telecommunication, and assistive technologies. In this paper, we propose a novel approach that combines Pareto-optimized deep learning with non-negative matrix factorization (NMF) to effectively reduce noise in impaired speech signals while preserving the quality of the desired speech. Our method begins by calculating the spectrogram of a noisy voice clip and extracting frequency statistics. A threshold is then determined based on the desired noise sensitivity, and a noise-to-signal mask is computed. This mask is smoothed to avoid abrupt transitions in noise levels, and the modified spectrogram is obtained by applying the smoothed mask to the signal spectrogram. We then employ a Pareto-optimized NMF to decompose the modified spectrogram into basis functions and corresponding weights, which are used to reconstruct the clean speech spectrogram. The final noise-reduced waveform is obtained by inverting the clean speech spectrogram. Our proposed method achieves a balance between various objectives, such as noise suppression, speech quality preservation, and computational efficiency, by leveraging Pareto optimization in the deep learning model. The experimental results demonstrate the effectiveness of our approach in cleaning alaryngeal speech signals, making it a promising solution for various real-world applications.
format	Online Article Text
id	pubmed-10377391
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-103773912023-07-29 Pareto-Optimized Non-Negative Matrix Factorization Approach to the Cleaning of Alaryngeal Speech Signals Maskeliūnas, Rytis Damaševičius, Robertas Kulikajevas, Audrius Pribuišis, Kipras Ulozaitė-Stanienė, Nora Uloza, Virgilijus Cancers (Basel) Article SIMPLE SUMMARY: This paper introduces a new method for cleaning impaired speech by combining Pareto-optimized deep learning with Non-negative Matrix Factorization (NMF). The approach effectively reduces noise in impaired speech while preserving the desired speech quality. The method involves calculating the spectrogram of a noisy voice clip, determining a noise threshold, computing a noise-to-signal mask, and smoothing it to avoid abrupt transitions. Using a Pareto-optimized NMF, the modified spectrogram is decomposed into basis functions and weights, allowing for reconstruction of the clean speech spectrogram. The final result is a noise-reduced waveform achieved by inverting the clean speech spectrogram. Experimental results validate the method’s effectiveness in cleaning alaryngeal speech signals, indicating its potential for real-world applications. ABSTRACT: The problem of cleaning impaired speech is crucial for various applications such as speech recognition, telecommunication, and assistive technologies. In this paper, we propose a novel approach that combines Pareto-optimized deep learning with non-negative matrix factorization (NMF) to effectively reduce noise in impaired speech signals while preserving the quality of the desired speech. Our method begins by calculating the spectrogram of a noisy voice clip and extracting frequency statistics. A threshold is then determined based on the desired noise sensitivity, and a noise-to-signal mask is computed. This mask is smoothed to avoid abrupt transitions in noise levels, and the modified spectrogram is obtained by applying the smoothed mask to the signal spectrogram. We then employ a Pareto-optimized NMF to decompose the modified spectrogram into basis functions and corresponding weights, which are used to reconstruct the clean speech spectrogram. The final noise-reduced waveform is obtained by inverting the clean speech spectrogram. Our proposed method achieves a balance between various objectives, such as noise suppression, speech quality preservation, and computational efficiency, by leveraging Pareto optimization in the deep learning model. The experimental results demonstrate the effectiveness of our approach in cleaning alaryngeal speech signals, making it a promising solution for various real-world applications. MDPI 2023-07-16 /pmc/articles/PMC10377391/ /pubmed/37509305 http://dx.doi.org/10.3390/cancers15143644 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Maskeliūnas, Rytis Damaševičius, Robertas Kulikajevas, Audrius Pribuišis, Kipras Ulozaitė-Stanienė, Nora Uloza, Virgilijus Pareto-Optimized Non-Negative Matrix Factorization Approach to the Cleaning of Alaryngeal Speech Signals
title	Pareto-Optimized Non-Negative Matrix Factorization Approach to the Cleaning of Alaryngeal Speech Signals
title_full	Pareto-Optimized Non-Negative Matrix Factorization Approach to the Cleaning of Alaryngeal Speech Signals
title_fullStr	Pareto-Optimized Non-Negative Matrix Factorization Approach to the Cleaning of Alaryngeal Speech Signals
title_full_unstemmed	Pareto-Optimized Non-Negative Matrix Factorization Approach to the Cleaning of Alaryngeal Speech Signals
title_short	Pareto-Optimized Non-Negative Matrix Factorization Approach to the Cleaning of Alaryngeal Speech Signals
title_sort	pareto-optimized non-negative matrix factorization approach to the cleaning of alaryngeal speech signals
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10377391/ https://www.ncbi.nlm.nih.gov/pubmed/37509305 http://dx.doi.org/10.3390/cancers15143644
work_keys_str_mv	AT maskeliunasrytis paretooptimizednonnegativematrixfactorizationapproachtothecleaningofalaryngealspeechsignals AT damaseviciusrobertas paretooptimizednonnegativematrixfactorizationapproachtothecleaningofalaryngealspeechsignals AT kulikajevasaudrius paretooptimizednonnegativematrixfactorizationapproachtothecleaningofalaryngealspeechsignals AT pribuisiskipras paretooptimizednonnegativematrixfactorizationapproachtothecleaningofalaryngealspeechsignals AT ulozaitestanienenora paretooptimizednonnegativematrixfactorizationapproachtothecleaningofalaryngealspeechsignals AT ulozavirgilijus paretooptimizednonnegativematrixfactorizationapproachtothecleaningofalaryngealspeechsignals

Pareto-Optimized Non-Negative Matrix Factorization Approach to the Cleaning of Alaryngeal Speech Signals

Ejemplares similares