Cargando…

Modeling Binaural Unmasking of Speech Using a Blind Binaural Processing Stage

The equalization cancellation model is often used to predict the binaural masking level difference. Previously its application to speech in noise has required separate knowledge about the speech and noise signals to maximize the signal-to-noise ratio (SNR). Here, a novel, blind equalization cancella...

Descripción completa

Detalles Bibliográficos
Autores principales: Hauth, Christopher F., Berning, Simon C., Kollmeier, Birger, Brand, Thomas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7734536/
https://www.ncbi.nlm.nih.gov/pubmed/33305690
http://dx.doi.org/10.1177/2331216520975630
_version_ 1783622492207710208
author Hauth, Christopher F.
Berning, Simon C.
Kollmeier, Birger
Brand, Thomas
author_facet Hauth, Christopher F.
Berning, Simon C.
Kollmeier, Birger
Brand, Thomas
author_sort Hauth, Christopher F.
collection PubMed
description The equalization cancellation model is often used to predict the binaural masking level difference. Previously its application to speech in noise has required separate knowledge about the speech and noise signals to maximize the signal-to-noise ratio (SNR). Here, a novel, blind equalization cancellation model is introduced that can use the mixed signals. This approach does not require any assumptions about particular sound source directions. It uses different strategies for positive and negative SNRs, with the switching between the two steered by a blind decision stage utilizing modulation cues. The output of the model is a single-channel signal with enhanced SNR, which we analyzed using the speech intelligibility index to compare speech intelligibility predictions. In a first experiment, the model was tested on experimental data obtained in a scenario with spatially separated target and masker signals. Predicted speech recognition thresholds were in good agreement with measured speech recognition thresholds with a root mean square error less than 1 dB. A second experiment investigated signals at positive SNRs, which was achieved using time compressed and low-pass filtered speech. The results demonstrated that binaural unmasking of speech occurs at positive SNRs and that the modulation-based switching strategy can predict the experimental results.
format Online
Article
Text
id pubmed-7734536
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-77345362020-12-21 Modeling Binaural Unmasking of Speech Using a Blind Binaural Processing Stage Hauth, Christopher F. Berning, Simon C. Kollmeier, Birger Brand, Thomas Trends Hear Original Article The equalization cancellation model is often used to predict the binaural masking level difference. Previously its application to speech in noise has required separate knowledge about the speech and noise signals to maximize the signal-to-noise ratio (SNR). Here, a novel, blind equalization cancellation model is introduced that can use the mixed signals. This approach does not require any assumptions about particular sound source directions. It uses different strategies for positive and negative SNRs, with the switching between the two steered by a blind decision stage utilizing modulation cues. The output of the model is a single-channel signal with enhanced SNR, which we analyzed using the speech intelligibility index to compare speech intelligibility predictions. In a first experiment, the model was tested on experimental data obtained in a scenario with spatially separated target and masker signals. Predicted speech recognition thresholds were in good agreement with measured speech recognition thresholds with a root mean square error less than 1 dB. A second experiment investigated signals at positive SNRs, which was achieved using time compressed and low-pass filtered speech. The results demonstrated that binaural unmasking of speech occurs at positive SNRs and that the modulation-based switching strategy can predict the experimental results. SAGE Publications 2020-12-11 /pmc/articles/PMC7734536/ /pubmed/33305690 http://dx.doi.org/10.1177/2331216520975630 Text en © The Author(s) 2020 https://creativecommons.org/licenses/by-nc/4.0/ Creative Commons Non Commercial CC BY-NC: This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Original Article
Hauth, Christopher F.
Berning, Simon C.
Kollmeier, Birger
Brand, Thomas
Modeling Binaural Unmasking of Speech Using a Blind Binaural Processing Stage
title Modeling Binaural Unmasking of Speech Using a Blind Binaural Processing Stage
title_full Modeling Binaural Unmasking of Speech Using a Blind Binaural Processing Stage
title_fullStr Modeling Binaural Unmasking of Speech Using a Blind Binaural Processing Stage
title_full_unstemmed Modeling Binaural Unmasking of Speech Using a Blind Binaural Processing Stage
title_short Modeling Binaural Unmasking of Speech Using a Blind Binaural Processing Stage
title_sort modeling binaural unmasking of speech using a blind binaural processing stage
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7734536/
https://www.ncbi.nlm.nih.gov/pubmed/33305690
http://dx.doi.org/10.1177/2331216520975630
work_keys_str_mv AT hauthchristopherf modelingbinauralunmaskingofspeechusingablindbinauralprocessingstage
AT berningsimonc modelingbinauralunmaskingofspeechusingablindbinauralprocessingstage
AT kollmeierbirger modelingbinauralunmaskingofspeechusingablindbinauralprocessingstage
AT brandthomas modelingbinauralunmaskingofspeechusingablindbinauralprocessingstage