Cargando…

An acoustic key to eight languages/dialects: Factor analyses of critical-band-filtered speech

The peripheral auditory system functions like a frequency analyser, often modelled as a bank of non-overlapping band-pass filters called critical bands; 20 bands are necessary for simulating frequency resolution of the ear within an ordinary frequency range of speech (up to 7,000 Hz). A far smaller...

Descripción completa

Detalles Bibliográficos
Autores principales: Ueda, Kazuo, Nakajima, Yoshitaka
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5309770/
https://www.ncbi.nlm.nih.gov/pubmed/28198405
http://dx.doi.org/10.1038/srep42468
_version_ 1782507763784482816
author Ueda, Kazuo
Nakajima, Yoshitaka
author_facet Ueda, Kazuo
Nakajima, Yoshitaka
author_sort Ueda, Kazuo
collection PubMed
description The peripheral auditory system functions like a frequency analyser, often modelled as a bank of non-overlapping band-pass filters called critical bands; 20 bands are necessary for simulating frequency resolution of the ear within an ordinary frequency range of speech (up to 7,000 Hz). A far smaller number of filters seemed sufficient, however, to re-synthesise intelligible speech sentences with power fluctuations of the speech signals passing through them; nevertheless, the number and frequency ranges of the frequency bands for efficient speech communication are yet unknown. We derived four common frequency bands—covering approximately 50–540, 540–1,700, 1,700–3,300, and above 3,300 Hz—from factor analyses of spectral fluctuations in eight different spoken languages/dialects. The analyses robustly led to three factors common to all languages investigated—the low & mid-high factor related to the two separate frequency ranges of 50–540 and 1,700–3,300 Hz, the mid-low factor the range of 540–1,700 Hz, and the high factor the range above 3,300 Hz—in these different languages/dialects, suggesting a language universal.
format Online
Article
Text
id pubmed-5309770
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-53097702017-02-22 An acoustic key to eight languages/dialects: Factor analyses of critical-band-filtered speech Ueda, Kazuo Nakajima, Yoshitaka Sci Rep Article The peripheral auditory system functions like a frequency analyser, often modelled as a bank of non-overlapping band-pass filters called critical bands; 20 bands are necessary for simulating frequency resolution of the ear within an ordinary frequency range of speech (up to 7,000 Hz). A far smaller number of filters seemed sufficient, however, to re-synthesise intelligible speech sentences with power fluctuations of the speech signals passing through them; nevertheless, the number and frequency ranges of the frequency bands for efficient speech communication are yet unknown. We derived four common frequency bands—covering approximately 50–540, 540–1,700, 1,700–3,300, and above 3,300 Hz—from factor analyses of spectral fluctuations in eight different spoken languages/dialects. The analyses robustly led to three factors common to all languages investigated—the low & mid-high factor related to the two separate frequency ranges of 50–540 and 1,700–3,300 Hz, the mid-low factor the range of 540–1,700 Hz, and the high factor the range above 3,300 Hz—in these different languages/dialects, suggesting a language universal. Nature Publishing Group 2017-02-15 /pmc/articles/PMC5309770/ /pubmed/28198405 http://dx.doi.org/10.1038/srep42468 Text en Copyright © 2017, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Ueda, Kazuo
Nakajima, Yoshitaka
An acoustic key to eight languages/dialects: Factor analyses of critical-band-filtered speech
title An acoustic key to eight languages/dialects: Factor analyses of critical-band-filtered speech
title_full An acoustic key to eight languages/dialects: Factor analyses of critical-band-filtered speech
title_fullStr An acoustic key to eight languages/dialects: Factor analyses of critical-band-filtered speech
title_full_unstemmed An acoustic key to eight languages/dialects: Factor analyses of critical-band-filtered speech
title_short An acoustic key to eight languages/dialects: Factor analyses of critical-band-filtered speech
title_sort acoustic key to eight languages/dialects: factor analyses of critical-band-filtered speech
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5309770/
https://www.ncbi.nlm.nih.gov/pubmed/28198405
http://dx.doi.org/10.1038/srep42468
work_keys_str_mv AT uedakazuo anacoustickeytoeightlanguagesdialectsfactoranalysesofcriticalbandfilteredspeech
AT nakajimayoshitaka anacoustickeytoeightlanguagesdialectsfactoranalysesofcriticalbandfilteredspeech
AT uedakazuo acoustickeytoeightlanguagesdialectsfactoranalysesofcriticalbandfilteredspeech
AT nakajimayoshitaka acoustickeytoeightlanguagesdialectsfactoranalysesofcriticalbandfilteredspeech