Cargando…
Transient Noise Reduction Using a Deep Recurrent Neural Network: Effects on Subjective Speech Intelligibility and Listening Comfort
A deep recurrent neural network (RNN) for reducing transient sounds was developed and its effects on subjective speech intelligibility and listening comfort were investigated. The RNN was trained using sentences spoken with different accents and corrupted by transient sounds, using the clean speech...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
SAGE Publications
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8642050/ https://www.ncbi.nlm.nih.gov/pubmed/34606381 http://dx.doi.org/10.1177/23312165211041475 |
_version_ | 1784609611429445632 |
---|---|
author | Keshavarzi, Mahmoud Reichenbach, Tobias Moore, Brian C. J. |
author_facet | Keshavarzi, Mahmoud Reichenbach, Tobias Moore, Brian C. J. |
author_sort | Keshavarzi, Mahmoud |
collection | PubMed |
description | A deep recurrent neural network (RNN) for reducing transient sounds was developed and its effects on subjective speech intelligibility and listening comfort were investigated. The RNN was trained using sentences spoken with different accents and corrupted by transient sounds, using the clean speech as the target. It was tested using sentences spoken by unseen talkers and corrupted by unseen transient sounds. A paired-comparison procedure was used to compare all possible combinations of three conditions for subjective speech intelligibility and listening comfort for two relative levels of the transients. The conditions were: no processing (NP); processing using the RNN; and processing using a multi-channel transient reduction method (MCTR). Ten participants with normal hearing and ten with mild-to-moderate hearing loss participated. For the latter, frequency-dependent linear amplification was applied to all stimuli to compensate for individual audibility losses. For the normal-hearing participants, processing using the RNN was significantly preferred over that for NP for subjective intelligibility and comfort, processing using the RNN was significantly preferred over that for MCTR for subjective intelligibility, and processing using the MCTR was significantly preferred over that for NP for comfort for the higher transient level only. For the hearing-impaired participants, processing using the RNN was significantly preferred over that for NP for both subjective intelligibility and comfort, processing using the RNN was significantly preferred over that for MCTR for comfort, and processing using the MCTR was significantly preferred over that for NP for comfort. |
format | Online Article Text |
id | pubmed-8642050 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | SAGE Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-86420502021-12-04 Transient Noise Reduction Using a Deep Recurrent Neural Network: Effects on Subjective Speech Intelligibility and Listening Comfort Keshavarzi, Mahmoud Reichenbach, Tobias Moore, Brian C. J. Trends Hear Original Article A deep recurrent neural network (RNN) for reducing transient sounds was developed and its effects on subjective speech intelligibility and listening comfort were investigated. The RNN was trained using sentences spoken with different accents and corrupted by transient sounds, using the clean speech as the target. It was tested using sentences spoken by unseen talkers and corrupted by unseen transient sounds. A paired-comparison procedure was used to compare all possible combinations of three conditions for subjective speech intelligibility and listening comfort for two relative levels of the transients. The conditions were: no processing (NP); processing using the RNN; and processing using a multi-channel transient reduction method (MCTR). Ten participants with normal hearing and ten with mild-to-moderate hearing loss participated. For the latter, frequency-dependent linear amplification was applied to all stimuli to compensate for individual audibility losses. For the normal-hearing participants, processing using the RNN was significantly preferred over that for NP for subjective intelligibility and comfort, processing using the RNN was significantly preferred over that for MCTR for subjective intelligibility, and processing using the MCTR was significantly preferred over that for NP for comfort for the higher transient level only. For the hearing-impaired participants, processing using the RNN was significantly preferred over that for NP for both subjective intelligibility and comfort, processing using the RNN was significantly preferred over that for MCTR for comfort, and processing using the MCTR was significantly preferred over that for NP for comfort. SAGE Publications 2021-10-04 /pmc/articles/PMC8642050/ /pubmed/34606381 http://dx.doi.org/10.1177/23312165211041475 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/This article is distributed under the terms of the Creative Commons Attribution 4.0 License (https://creativecommons.org/licenses/by/4.0/) which permits any use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage). |
spellingShingle | Original Article Keshavarzi, Mahmoud Reichenbach, Tobias Moore, Brian C. J. Transient Noise Reduction Using a Deep Recurrent Neural Network: Effects on Subjective Speech Intelligibility and Listening Comfort |
title | Transient Noise Reduction Using a Deep Recurrent Neural Network:
Effects on Subjective Speech Intelligibility and Listening
Comfort |
title_full | Transient Noise Reduction Using a Deep Recurrent Neural Network:
Effects on Subjective Speech Intelligibility and Listening
Comfort |
title_fullStr | Transient Noise Reduction Using a Deep Recurrent Neural Network:
Effects on Subjective Speech Intelligibility and Listening
Comfort |
title_full_unstemmed | Transient Noise Reduction Using a Deep Recurrent Neural Network:
Effects on Subjective Speech Intelligibility and Listening
Comfort |
title_short | Transient Noise Reduction Using a Deep Recurrent Neural Network:
Effects on Subjective Speech Intelligibility and Listening
Comfort |
title_sort | transient noise reduction using a deep recurrent neural network:
effects on subjective speech intelligibility and listening
comfort |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8642050/ https://www.ncbi.nlm.nih.gov/pubmed/34606381 http://dx.doi.org/10.1177/23312165211041475 |
work_keys_str_mv | AT keshavarzimahmoud transientnoisereductionusingadeeprecurrentneuralnetworkeffectsonsubjectivespeechintelligibilityandlisteningcomfort AT reichenbachtobias transientnoisereductionusingadeeprecurrentneuralnetworkeffectsonsubjectivespeechintelligibilityandlisteningcomfort AT moorebriancj transientnoisereductionusingadeeprecurrentneuralnetworkeffectsonsubjectivespeechintelligibilityandlisteningcomfort |