Cargando…

The Modulation Transfer Function for Speech Intelligibility

We systematically determined which spectrotemporal modulations in speech are necessary for comprehension by human listeners. Speech comprehension has been shown to be robust to spectral and temporal degradations, but the specific relevance of particular degradations is arguable due to the complexity...

Descripción completa

Detalles Bibliográficos
Autores principales: Elliott, Taffeta M., Theunissen, Frédéric E.
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2639724/
https://www.ncbi.nlm.nih.gov/pubmed/19266016
http://dx.doi.org/10.1371/journal.pcbi.1000302
_version_ 1782164499714801664
author Elliott, Taffeta M.
Theunissen, Frédéric E.
author_facet Elliott, Taffeta M.
Theunissen, Frédéric E.
author_sort Elliott, Taffeta M.
collection PubMed
description We systematically determined which spectrotemporal modulations in speech are necessary for comprehension by human listeners. Speech comprehension has been shown to be robust to spectral and temporal degradations, but the specific relevance of particular degradations is arguable due to the complexity of the joint spectral and temporal information in the speech signal. We applied a novel modulation filtering technique to recorded sentences to restrict acoustic information quantitatively and to obtain a joint spectrotemporal modulation transfer function for speech comprehension, the speech MTF. For American English, the speech MTF showed the criticality of low modulation frequencies in both time and frequency. Comprehension was significantly impaired when temporal modulations <12 Hz or spectral modulations <4 cycles/kHz were removed. More specifically, the MTF was bandpass in temporal modulations and low-pass in spectral modulations: temporal modulations from 1 to 7 Hz and spectral modulations <1 cycles/kHz were the most important. We evaluated the importance of spectrotemporal modulations for vocal gender identification and found a different region of interest: removing spectral modulations between 3 and 7 cycles/kHz significantly increases gender misidentifications of female speakers. The determination of the speech MTF furnishes an additional method for producing speech signals with reduced bandwidth but high intelligibility. Such compression could be used for audio applications such as file compression or noise removal and for clinical applications such as signal processing for cochlear implants.
format Text
id pubmed-2639724
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-26397242009-03-06 The Modulation Transfer Function for Speech Intelligibility Elliott, Taffeta M. Theunissen, Frédéric E. PLoS Comput Biol Research Article We systematically determined which spectrotemporal modulations in speech are necessary for comprehension by human listeners. Speech comprehension has been shown to be robust to spectral and temporal degradations, but the specific relevance of particular degradations is arguable due to the complexity of the joint spectral and temporal information in the speech signal. We applied a novel modulation filtering technique to recorded sentences to restrict acoustic information quantitatively and to obtain a joint spectrotemporal modulation transfer function for speech comprehension, the speech MTF. For American English, the speech MTF showed the criticality of low modulation frequencies in both time and frequency. Comprehension was significantly impaired when temporal modulations <12 Hz or spectral modulations <4 cycles/kHz were removed. More specifically, the MTF was bandpass in temporal modulations and low-pass in spectral modulations: temporal modulations from 1 to 7 Hz and spectral modulations <1 cycles/kHz were the most important. We evaluated the importance of spectrotemporal modulations for vocal gender identification and found a different region of interest: removing spectral modulations between 3 and 7 cycles/kHz significantly increases gender misidentifications of female speakers. The determination of the speech MTF furnishes an additional method for producing speech signals with reduced bandwidth but high intelligibility. Such compression could be used for audio applications such as file compression or noise removal and for clinical applications such as signal processing for cochlear implants. Public Library of Science 2009-03-06 /pmc/articles/PMC2639724/ /pubmed/19266016 http://dx.doi.org/10.1371/journal.pcbi.1000302 Text en Elliott, Theunissen. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Elliott, Taffeta M.
Theunissen, Frédéric E.
The Modulation Transfer Function for Speech Intelligibility
title The Modulation Transfer Function for Speech Intelligibility
title_full The Modulation Transfer Function for Speech Intelligibility
title_fullStr The Modulation Transfer Function for Speech Intelligibility
title_full_unstemmed The Modulation Transfer Function for Speech Intelligibility
title_short The Modulation Transfer Function for Speech Intelligibility
title_sort modulation transfer function for speech intelligibility
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2639724/
https://www.ncbi.nlm.nih.gov/pubmed/19266016
http://dx.doi.org/10.1371/journal.pcbi.1000302
work_keys_str_mv AT elliotttaffetam themodulationtransferfunctionforspeechintelligibility
AT theunissenfrederice themodulationtransferfunctionforspeechintelligibility
AT elliotttaffetam modulationtransferfunctionforspeechintelligibility
AT theunissenfrederice modulationtransferfunctionforspeechintelligibility