Cargando…

Transient Noise Reduction Using a Deep Recurrent Neural Network: Effects on Subjective Speech Intelligibility and Listening Comfort

A deep recurrent neural network (RNN) for reducing transient sounds was developed and its effects on subjective speech intelligibility and listening comfort were investigated. The RNN was trained using sentences spoken with different accents and corrupted by transient sounds, using the clean speech...

Descripción completa

Detalles Bibliográficos
Autores principales: Keshavarzi, Mahmoud, Reichenbach, Tobias, Moore, Brian C. J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8642050/
https://www.ncbi.nlm.nih.gov/pubmed/34606381
http://dx.doi.org/10.1177/23312165211041475
_version_ 1784609611429445632
author Keshavarzi, Mahmoud
Reichenbach, Tobias
Moore, Brian C. J.
author_facet Keshavarzi, Mahmoud
Reichenbach, Tobias
Moore, Brian C. J.
author_sort Keshavarzi, Mahmoud
collection PubMed
description A deep recurrent neural network (RNN) for reducing transient sounds was developed and its effects on subjective speech intelligibility and listening comfort were investigated. The RNN was trained using sentences spoken with different accents and corrupted by transient sounds, using the clean speech as the target. It was tested using sentences spoken by unseen talkers and corrupted by unseen transient sounds. A paired-comparison procedure was used to compare all possible combinations of three conditions for subjective speech intelligibility and listening comfort for two relative levels of the transients. The conditions were: no processing (NP); processing using the RNN; and processing using a multi-channel transient reduction method (MCTR). Ten participants with normal hearing and ten with mild-to-moderate hearing loss participated. For the latter, frequency-dependent linear amplification was applied to all stimuli to compensate for individual audibility losses. For the normal-hearing participants, processing using the RNN was significantly preferred over that for NP for subjective intelligibility and comfort, processing using the RNN was significantly preferred over that for MCTR for subjective intelligibility, and processing using the MCTR was significantly preferred over that for NP for comfort for the higher transient level only. For the hearing-impaired participants, processing using the RNN was significantly preferred over that for NP for both subjective intelligibility and comfort, processing using the RNN was significantly preferred over that for MCTR for comfort, and processing using the MCTR was significantly preferred over that for NP for comfort.
format Online
Article
Text
id pubmed-8642050
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-86420502021-12-04 Transient Noise Reduction Using a Deep Recurrent Neural Network: Effects on Subjective Speech Intelligibility and Listening Comfort Keshavarzi, Mahmoud Reichenbach, Tobias Moore, Brian C. J. Trends Hear Original Article A deep recurrent neural network (RNN) for reducing transient sounds was developed and its effects on subjective speech intelligibility and listening comfort were investigated. The RNN was trained using sentences spoken with different accents and corrupted by transient sounds, using the clean speech as the target. It was tested using sentences spoken by unseen talkers and corrupted by unseen transient sounds. A paired-comparison procedure was used to compare all possible combinations of three conditions for subjective speech intelligibility and listening comfort for two relative levels of the transients. The conditions were: no processing (NP); processing using the RNN; and processing using a multi-channel transient reduction method (MCTR). Ten participants with normal hearing and ten with mild-to-moderate hearing loss participated. For the latter, frequency-dependent linear amplification was applied to all stimuli to compensate for individual audibility losses. For the normal-hearing participants, processing using the RNN was significantly preferred over that for NP for subjective intelligibility and comfort, processing using the RNN was significantly preferred over that for MCTR for subjective intelligibility, and processing using the MCTR was significantly preferred over that for NP for comfort for the higher transient level only. For the hearing-impaired participants, processing using the RNN was significantly preferred over that for NP for both subjective intelligibility and comfort, processing using the RNN was significantly preferred over that for MCTR for comfort, and processing using the MCTR was significantly preferred over that for NP for comfort. SAGE Publications 2021-10-04 /pmc/articles/PMC8642050/ /pubmed/34606381 http://dx.doi.org/10.1177/23312165211041475 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/This article is distributed under the terms of the Creative Commons Attribution 4.0 License (https://creativecommons.org/licenses/by/4.0/) which permits any use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Original Article
Keshavarzi, Mahmoud
Reichenbach, Tobias
Moore, Brian C. J.
Transient Noise Reduction Using a Deep Recurrent Neural Network: Effects on Subjective Speech Intelligibility and Listening Comfort
title Transient Noise Reduction Using a Deep Recurrent Neural Network: Effects on Subjective Speech Intelligibility and Listening Comfort
title_full Transient Noise Reduction Using a Deep Recurrent Neural Network: Effects on Subjective Speech Intelligibility and Listening Comfort
title_fullStr Transient Noise Reduction Using a Deep Recurrent Neural Network: Effects on Subjective Speech Intelligibility and Listening Comfort
title_full_unstemmed Transient Noise Reduction Using a Deep Recurrent Neural Network: Effects on Subjective Speech Intelligibility and Listening Comfort
title_short Transient Noise Reduction Using a Deep Recurrent Neural Network: Effects on Subjective Speech Intelligibility and Listening Comfort
title_sort transient noise reduction using a deep recurrent neural network: effects on subjective speech intelligibility and listening comfort
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8642050/
https://www.ncbi.nlm.nih.gov/pubmed/34606381
http://dx.doi.org/10.1177/23312165211041475
work_keys_str_mv AT keshavarzimahmoud transientnoisereductionusingadeeprecurrentneuralnetworkeffectsonsubjectivespeechintelligibilityandlisteningcomfort
AT reichenbachtobias transientnoisereductionusingadeeprecurrentneuralnetworkeffectsonsubjectivespeechintelligibilityandlisteningcomfort
AT moorebriancj transientnoisereductionusingadeeprecurrentneuralnetworkeffectsonsubjectivespeechintelligibilityandlisteningcomfort