Cargando…
Band importance for speech-in-speech recognition
Predicting masked speech perception typically relies on estimates of the spectral distribution of cues supporting recognition. Current methods for estimating band importance for speech-in-noise use filtered stimuli. These methods are not appropriate for speech-in-speech because filtering can modify...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Acoustical Society of America
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8499852/ https://www.ncbi.nlm.nih.gov/pubmed/34661194 http://dx.doi.org/10.1121/10.0005762 |
_version_ | 1784580370852741120 |
---|---|
author | Buss, Emily Bosen, Adam |
author_facet | Buss, Emily Bosen, Adam |
author_sort | Buss, Emily |
collection | PubMed |
description | Predicting masked speech perception typically relies on estimates of the spectral distribution of cues supporting recognition. Current methods for estimating band importance for speech-in-noise use filtered stimuli. These methods are not appropriate for speech-in-speech because filtering can modify stimulus features affecting auditory stream segregation. Here, band importance is estimated by quantifying the relationship between speech recognition accuracy for full-spectrum speech and the target-to-masker ratio by channel at the output of an auditory filterbank. Preliminary results provide support for this approach and indicate that frequencies below 2 kHz may contribute more to speech recognition in two-talker speech than in speech-shaped noise. |
format | Online Article Text |
id | pubmed-8499852 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Acoustical Society of America |
record_format | MEDLINE/PubMed |
spelling | pubmed-84998522021-10-14 Band importance for speech-in-speech recognition Buss, Emily Bosen, Adam JASA Express Lett Psychological and Physiological Acoustics Predicting masked speech perception typically relies on estimates of the spectral distribution of cues supporting recognition. Current methods for estimating band importance for speech-in-noise use filtered stimuli. These methods are not appropriate for speech-in-speech because filtering can modify stimulus features affecting auditory stream segregation. Here, band importance is estimated by quantifying the relationship between speech recognition accuracy for full-spectrum speech and the target-to-masker ratio by channel at the output of an auditory filterbank. Preliminary results provide support for this approach and indicate that frequencies below 2 kHz may contribute more to speech recognition in two-talker speech than in speech-shaped noise. Acoustical Society of America 2021-08 2021-08-02 /pmc/articles/PMC8499852/ /pubmed/34661194 http://dx.doi.org/10.1121/10.0005762 Text en © 2021 Author(s). 2691-1191/2021/1(8)/084402/6 https://creativecommons.org/licenses/by/4.0/All article content, except where otherwise noted, is licensed under a Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ). |
spellingShingle | Psychological and Physiological Acoustics Buss, Emily Bosen, Adam Band importance for speech-in-speech recognition |
title | Band importance for speech-in-speech recognition |
title_full | Band importance for speech-in-speech recognition |
title_fullStr | Band importance for speech-in-speech recognition |
title_full_unstemmed | Band importance for speech-in-speech recognition |
title_short | Band importance for speech-in-speech recognition |
title_sort | band importance for speech-in-speech recognition |
topic | Psychological and Physiological Acoustics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8499852/ https://www.ncbi.nlm.nih.gov/pubmed/34661194 http://dx.doi.org/10.1121/10.0005762 |
work_keys_str_mv | AT bussemily bandimportanceforspeechinspeechrecognition AT bosenadam bandimportanceforspeechinspeechrecognition |