Cargando…

Extended High Frequencies Provide Both Spectral and Temporal Information to Improve Speech-in-Speech Recognition

Several studies have demonstrated that extended high frequencies (EHFs; >8 kHz) in speech are not only audible but also have some utility for speech recognition, including for speech-in-speech recognition when maskers are facing away from the listener. However, the contribution of EHF spectral ve...

Descripción completa

Detalles Bibliográficos
Autores principales: Trine, Allison, Monson, Brian B.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7756042/
https://www.ncbi.nlm.nih.gov/pubmed/33345755
http://dx.doi.org/10.1177/2331216520980299
_version_ 1783626455407656960
author Trine, Allison
Monson, Brian B.
author_facet Trine, Allison
Monson, Brian B.
author_sort Trine, Allison
collection PubMed
description Several studies have demonstrated that extended high frequencies (EHFs; >8 kHz) in speech are not only audible but also have some utility for speech recognition, including for speech-in-speech recognition when maskers are facing away from the listener. However, the contribution of EHF spectral versus temporal information to speech recognition is unknown. Here, we show that access to EHF temporal information improved speech-in-speech recognition relative to speech bandlimited at 8 kHz but that additional access to EHF spectral detail provided an additional small but significant benefit. Results suggest that both EHF spectral structure and the temporal envelope contribute to the observed EHF benefit. Speech recognition performance was quite sensitive to masker head orientation, with a rotation of only 15° providing a highly significant benefit. An exploratory analysis indicated that pure-tone thresholds at EHFs are better predictors of speech recognition performance than low-frequency pure-tone thresholds.
format Online
Article
Text
id pubmed-7756042
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-77560422021-01-07 Extended High Frequencies Provide Both Spectral and Temporal Information to Improve Speech-in-Speech Recognition Trine, Allison Monson, Brian B. Trends Hear Original Article Several studies have demonstrated that extended high frequencies (EHFs; >8 kHz) in speech are not only audible but also have some utility for speech recognition, including for speech-in-speech recognition when maskers are facing away from the listener. However, the contribution of EHF spectral versus temporal information to speech recognition is unknown. Here, we show that access to EHF temporal information improved speech-in-speech recognition relative to speech bandlimited at 8 kHz but that additional access to EHF spectral detail provided an additional small but significant benefit. Results suggest that both EHF spectral structure and the temporal envelope contribute to the observed EHF benefit. Speech recognition performance was quite sensitive to masker head orientation, with a rotation of only 15° providing a highly significant benefit. An exploratory analysis indicated that pure-tone thresholds at EHFs are better predictors of speech recognition performance than low-frequency pure-tone thresholds. SAGE Publications 2020-12-21 /pmc/articles/PMC7756042/ /pubmed/33345755 http://dx.doi.org/10.1177/2331216520980299 Text en © The Author(s) 2020 https://creativecommons.org/licenses/by-nc/4.0/ Creative Commons Non Commercial CC BY-NC: This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Original Article
Trine, Allison
Monson, Brian B.
Extended High Frequencies Provide Both Spectral and Temporal Information to Improve Speech-in-Speech Recognition
title Extended High Frequencies Provide Both Spectral and Temporal Information to Improve Speech-in-Speech Recognition
title_full Extended High Frequencies Provide Both Spectral and Temporal Information to Improve Speech-in-Speech Recognition
title_fullStr Extended High Frequencies Provide Both Spectral and Temporal Information to Improve Speech-in-Speech Recognition
title_full_unstemmed Extended High Frequencies Provide Both Spectral and Temporal Information to Improve Speech-in-Speech Recognition
title_short Extended High Frequencies Provide Both Spectral and Temporal Information to Improve Speech-in-Speech Recognition
title_sort extended high frequencies provide both spectral and temporal information to improve speech-in-speech recognition
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7756042/
https://www.ncbi.nlm.nih.gov/pubmed/33345755
http://dx.doi.org/10.1177/2331216520980299
work_keys_str_mv AT trineallison extendedhighfrequenciesprovidebothspectralandtemporalinformationtoimprovespeechinspeechrecognition
AT monsonbrianb extendedhighfrequenciesprovidebothspectralandtemporalinformationtoimprovespeechinspeechrecognition