Cargando…

Band importance for speech-in-speech recognition

Predicting masked speech perception typically relies on estimates of the spectral distribution of cues supporting recognition. Current methods for estimating band importance for speech-in-noise use filtered stimuli. These methods are not appropriate for speech-in-speech because filtering can modify...

Descripción completa

Detalles Bibliográficos
Autores principales: Buss, Emily, Bosen, Adam
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Acoustical Society of America 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8499852/
https://www.ncbi.nlm.nih.gov/pubmed/34661194
http://dx.doi.org/10.1121/10.0005762
_version_ 1784580370852741120
author Buss, Emily
Bosen, Adam
author_facet Buss, Emily
Bosen, Adam
author_sort Buss, Emily
collection PubMed
description Predicting masked speech perception typically relies on estimates of the spectral distribution of cues supporting recognition. Current methods for estimating band importance for speech-in-noise use filtered stimuli. These methods are not appropriate for speech-in-speech because filtering can modify stimulus features affecting auditory stream segregation. Here, band importance is estimated by quantifying the relationship between speech recognition accuracy for full-spectrum speech and the target-to-masker ratio by channel at the output of an auditory filterbank. Preliminary results provide support for this approach and indicate that frequencies below 2 kHz may contribute more to speech recognition in two-talker speech than in speech-shaped noise.
format Online
Article
Text
id pubmed-8499852
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Acoustical Society of America
record_format MEDLINE/PubMed
spelling pubmed-84998522021-10-14 Band importance for speech-in-speech recognition Buss, Emily Bosen, Adam JASA Express Lett Psychological and Physiological Acoustics Predicting masked speech perception typically relies on estimates of the spectral distribution of cues supporting recognition. Current methods for estimating band importance for speech-in-noise use filtered stimuli. These methods are not appropriate for speech-in-speech because filtering can modify stimulus features affecting auditory stream segregation. Here, band importance is estimated by quantifying the relationship between speech recognition accuracy for full-spectrum speech and the target-to-masker ratio by channel at the output of an auditory filterbank. Preliminary results provide support for this approach and indicate that frequencies below 2 kHz may contribute more to speech recognition in two-talker speech than in speech-shaped noise. Acoustical Society of America 2021-08 2021-08-02 /pmc/articles/PMC8499852/ /pubmed/34661194 http://dx.doi.org/10.1121/10.0005762 Text en © 2021 Author(s). 2691-1191/2021/1(8)/084402/6 https://creativecommons.org/licenses/by/4.0/All article content, except where otherwise noted, is licensed under a Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ).
spellingShingle Psychological and Physiological Acoustics
Buss, Emily
Bosen, Adam
Band importance for speech-in-speech recognition
title Band importance for speech-in-speech recognition
title_full Band importance for speech-in-speech recognition
title_fullStr Band importance for speech-in-speech recognition
title_full_unstemmed Band importance for speech-in-speech recognition
title_short Band importance for speech-in-speech recognition
title_sort band importance for speech-in-speech recognition
topic Psychological and Physiological Acoustics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8499852/
https://www.ncbi.nlm.nih.gov/pubmed/34661194
http://dx.doi.org/10.1121/10.0005762
work_keys_str_mv AT bussemily bandimportanceforspeechinspeechrecognition
AT bosenadam bandimportanceforspeechinspeechrecognition