Cargando…
Modeling Binaural Unmasking of Speech Using a Blind Binaural Processing Stage
The equalization cancellation model is often used to predict the binaural masking level difference. Previously its application to speech in noise has required separate knowledge about the speech and noise signals to maximize the signal-to-noise ratio (SNR). Here, a novel, blind equalization cancella...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
SAGE Publications
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7734536/ https://www.ncbi.nlm.nih.gov/pubmed/33305690 http://dx.doi.org/10.1177/2331216520975630 |
_version_ | 1783622492207710208 |
---|---|
author | Hauth, Christopher F. Berning, Simon C. Kollmeier, Birger Brand, Thomas |
author_facet | Hauth, Christopher F. Berning, Simon C. Kollmeier, Birger Brand, Thomas |
author_sort | Hauth, Christopher F. |
collection | PubMed |
description | The equalization cancellation model is often used to predict the binaural masking level difference. Previously its application to speech in noise has required separate knowledge about the speech and noise signals to maximize the signal-to-noise ratio (SNR). Here, a novel, blind equalization cancellation model is introduced that can use the mixed signals. This approach does not require any assumptions about particular sound source directions. It uses different strategies for positive and negative SNRs, with the switching between the two steered by a blind decision stage utilizing modulation cues. The output of the model is a single-channel signal with enhanced SNR, which we analyzed using the speech intelligibility index to compare speech intelligibility predictions. In a first experiment, the model was tested on experimental data obtained in a scenario with spatially separated target and masker signals. Predicted speech recognition thresholds were in good agreement with measured speech recognition thresholds with a root mean square error less than 1 dB. A second experiment investigated signals at positive SNRs, which was achieved using time compressed and low-pass filtered speech. The results demonstrated that binaural unmasking of speech occurs at positive SNRs and that the modulation-based switching strategy can predict the experimental results. |
format | Online Article Text |
id | pubmed-7734536 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | SAGE Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-77345362020-12-21 Modeling Binaural Unmasking of Speech Using a Blind Binaural Processing Stage Hauth, Christopher F. Berning, Simon C. Kollmeier, Birger Brand, Thomas Trends Hear Original Article The equalization cancellation model is often used to predict the binaural masking level difference. Previously its application to speech in noise has required separate knowledge about the speech and noise signals to maximize the signal-to-noise ratio (SNR). Here, a novel, blind equalization cancellation model is introduced that can use the mixed signals. This approach does not require any assumptions about particular sound source directions. It uses different strategies for positive and negative SNRs, with the switching between the two steered by a blind decision stage utilizing modulation cues. The output of the model is a single-channel signal with enhanced SNR, which we analyzed using the speech intelligibility index to compare speech intelligibility predictions. In a first experiment, the model was tested on experimental data obtained in a scenario with spatially separated target and masker signals. Predicted speech recognition thresholds were in good agreement with measured speech recognition thresholds with a root mean square error less than 1 dB. A second experiment investigated signals at positive SNRs, which was achieved using time compressed and low-pass filtered speech. The results demonstrated that binaural unmasking of speech occurs at positive SNRs and that the modulation-based switching strategy can predict the experimental results. SAGE Publications 2020-12-11 /pmc/articles/PMC7734536/ /pubmed/33305690 http://dx.doi.org/10.1177/2331216520975630 Text en © The Author(s) 2020 https://creativecommons.org/licenses/by-nc/4.0/ Creative Commons Non Commercial CC BY-NC: This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage). |
spellingShingle | Original Article Hauth, Christopher F. Berning, Simon C. Kollmeier, Birger Brand, Thomas Modeling Binaural Unmasking of Speech Using a Blind Binaural Processing Stage |
title | Modeling Binaural Unmasking of Speech Using a Blind Binaural Processing Stage |
title_full | Modeling Binaural Unmasking of Speech Using a Blind Binaural Processing Stage |
title_fullStr | Modeling Binaural Unmasking of Speech Using a Blind Binaural Processing Stage |
title_full_unstemmed | Modeling Binaural Unmasking of Speech Using a Blind Binaural Processing Stage |
title_short | Modeling Binaural Unmasking of Speech Using a Blind Binaural Processing Stage |
title_sort | modeling binaural unmasking of speech using a blind binaural processing stage |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7734536/ https://www.ncbi.nlm.nih.gov/pubmed/33305690 http://dx.doi.org/10.1177/2331216520975630 |
work_keys_str_mv | AT hauthchristopherf modelingbinauralunmaskingofspeechusingablindbinauralprocessingstage AT berningsimonc modelingbinauralunmaskingofspeechusingablindbinauralprocessingstage AT kollmeierbirger modelingbinauralunmaskingofspeechusingablindbinauralprocessingstage AT brandthomas modelingbinauralunmaskingofspeechusingablindbinauralprocessingstage |