Cargando…

Measurement and Prediction of Binaural-Temporal Integration of Speech Reflections

For speech intelligibility in rooms, the temporal integration of speech reflections is typically modeled by separating the room impulse response (RIR) into an early (assumed beneficial for speech intelligibility) and a late part (assumed detrimental). This concept was challenged in this study by emp...

Descripción completa

Detalles Bibliográficos
Autores principales: Rennies, Jan, Warzybok, Anna, Brand, Thomas, Kollmeier, Birger
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6593929/
https://www.ncbi.nlm.nih.gov/pubmed/31234732
http://dx.doi.org/10.1177/2331216519854267
_version_ 1783430153345433600
author Rennies, Jan
Warzybok, Anna
Brand, Thomas
Kollmeier, Birger
author_facet Rennies, Jan
Warzybok, Anna
Brand, Thomas
Kollmeier, Birger
author_sort Rennies, Jan
collection PubMed
description For speech intelligibility in rooms, the temporal integration of speech reflections is typically modeled by separating the room impulse response (RIR) into an early (assumed beneficial for speech intelligibility) and a late part (assumed detrimental). This concept was challenged in this study by employing binaural RIRs with systematically varied interaural phase differences (IPDs) and amplitude of the direct sound and a variable number of reflections delayed by up to 200 ms. Speech recognition thresholds in stationary noise were measured in normal-hearing listeners for 86 conditions. The data showed that direct sound and one or several early speech reflections could be perfectly integrated when they had the same IPD. Early reflections with the same IPD as the noise (but not as the direct sound) could not be perfectly integrated with the direct sound. All conditions in which the dominant speech information was within the early RIR components could be well predicted by a binaural speech intelligibility model using classic early/late separation. In contrast, when amplitude or IPD favored late RIR components, listeners appeared to be capable of focusing on these components rather than on the precedent direct sound. This could not be modeled by an early/late separation window but required a temporal integration window that can be flexibly shifted along the RIR.
format Online
Article
Text
id pubmed-6593929
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-65939292019-07-01 Measurement and Prediction of Binaural-Temporal Integration of Speech Reflections Rennies, Jan Warzybok, Anna Brand, Thomas Kollmeier, Birger Trends Hear Original Article For speech intelligibility in rooms, the temporal integration of speech reflections is typically modeled by separating the room impulse response (RIR) into an early (assumed beneficial for speech intelligibility) and a late part (assumed detrimental). This concept was challenged in this study by employing binaural RIRs with systematically varied interaural phase differences (IPDs) and amplitude of the direct sound and a variable number of reflections delayed by up to 200 ms. Speech recognition thresholds in stationary noise were measured in normal-hearing listeners for 86 conditions. The data showed that direct sound and one or several early speech reflections could be perfectly integrated when they had the same IPD. Early reflections with the same IPD as the noise (but not as the direct sound) could not be perfectly integrated with the direct sound. All conditions in which the dominant speech information was within the early RIR components could be well predicted by a binaural speech intelligibility model using classic early/late separation. In contrast, when amplitude or IPD favored late RIR components, listeners appeared to be capable of focusing on these components rather than on the precedent direct sound. This could not be modeled by an early/late separation window but required a temporal integration window that can be flexibly shifted along the RIR. SAGE Publications 2019-06-25 /pmc/articles/PMC6593929/ /pubmed/31234732 http://dx.doi.org/10.1177/2331216519854267 Text en © The Author(s) 2019 http://creativecommons.org/licenses/by-nc/4.0/ Creative Commons Non Commercial CC BY-NC: This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (http://www.creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Original Article
Rennies, Jan
Warzybok, Anna
Brand, Thomas
Kollmeier, Birger
Measurement and Prediction of Binaural-Temporal Integration of Speech Reflections
title Measurement and Prediction of Binaural-Temporal Integration of Speech Reflections
title_full Measurement and Prediction of Binaural-Temporal Integration of Speech Reflections
title_fullStr Measurement and Prediction of Binaural-Temporal Integration of Speech Reflections
title_full_unstemmed Measurement and Prediction of Binaural-Temporal Integration of Speech Reflections
title_short Measurement and Prediction of Binaural-Temporal Integration of Speech Reflections
title_sort measurement and prediction of binaural-temporal integration of speech reflections
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6593929/
https://www.ncbi.nlm.nih.gov/pubmed/31234732
http://dx.doi.org/10.1177/2331216519854267
work_keys_str_mv AT renniesjan measurementandpredictionofbinauraltemporalintegrationofspeechreflections
AT warzybokanna measurementandpredictionofbinauraltemporalintegrationofspeechreflections
AT brandthomas measurementandpredictionofbinauraltemporalintegrationofspeechreflections
AT kollmeierbirger measurementandpredictionofbinauraltemporalintegrationofspeechreflections