Cargando…

On Short-Time Estimation of Vocal Tract Length from Formant Frequencies

Vocal tract length is highly variable across speakers and determines many aspects of the acoustic speech signal, making it an essential parameter to consider for explaining behavioral variability. A method for accurate estimation of vocal tract length from formant frequencies would afford normalizat...

Descripción completa

Detalles Bibliográficos
Autores principales: Lammert, Adam C., Narayanan, Shrikanth S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4503663/
https://www.ncbi.nlm.nih.gov/pubmed/26177102
http://dx.doi.org/10.1371/journal.pone.0132193
_version_ 1782381335782883328
author Lammert, Adam C.
Narayanan, Shrikanth S.
author_facet Lammert, Adam C.
Narayanan, Shrikanth S.
author_sort Lammert, Adam C.
collection PubMed
description Vocal tract length is highly variable across speakers and determines many aspects of the acoustic speech signal, making it an essential parameter to consider for explaining behavioral variability. A method for accurate estimation of vocal tract length from formant frequencies would afford normalization of interspeaker variability and facilitate acoustic comparisons across speakers. A framework for considering estimation methods is developed from the basic principles of vocal tract acoustics, and an estimation method is proposed that follows naturally from this framework. The proposed method is evaluated using acoustic characteristics of simulated vocal tracts ranging from 14 to 19 cm in length, as well as real-time magnetic resonance imaging data with synchronous audio from five speakers whose vocal tracts range from 14.5 to 18.0 cm in length. Evaluations show improvements in accuracy over previously proposed methods, with 0.631 and 1.277 cm root mean square error on simulated and human speech data, respectively. Empirical results show that the effectiveness of the proposed method is based on emphasizing higher formant frequencies, which seem less affected by speech articulation. Theoretical predictions of formant sensitivity reinforce this empirical finding. Moreover, theoretical insights are explained regarding the reason for differences in formant sensitivity.
format Online
Article
Text
id pubmed-4503663
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-45036632015-07-17 On Short-Time Estimation of Vocal Tract Length from Formant Frequencies Lammert, Adam C. Narayanan, Shrikanth S. PLoS One Research Article Vocal tract length is highly variable across speakers and determines many aspects of the acoustic speech signal, making it an essential parameter to consider for explaining behavioral variability. A method for accurate estimation of vocal tract length from formant frequencies would afford normalization of interspeaker variability and facilitate acoustic comparisons across speakers. A framework for considering estimation methods is developed from the basic principles of vocal tract acoustics, and an estimation method is proposed that follows naturally from this framework. The proposed method is evaluated using acoustic characteristics of simulated vocal tracts ranging from 14 to 19 cm in length, as well as real-time magnetic resonance imaging data with synchronous audio from five speakers whose vocal tracts range from 14.5 to 18.0 cm in length. Evaluations show improvements in accuracy over previously proposed methods, with 0.631 and 1.277 cm root mean square error on simulated and human speech data, respectively. Empirical results show that the effectiveness of the proposed method is based on emphasizing higher formant frequencies, which seem less affected by speech articulation. Theoretical predictions of formant sensitivity reinforce this empirical finding. Moreover, theoretical insights are explained regarding the reason for differences in formant sensitivity. Public Library of Science 2015-07-15 /pmc/articles/PMC4503663/ /pubmed/26177102 http://dx.doi.org/10.1371/journal.pone.0132193 Text en © 2015 Lammert, Narayanan http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Lammert, Adam C.
Narayanan, Shrikanth S.
On Short-Time Estimation of Vocal Tract Length from Formant Frequencies
title On Short-Time Estimation of Vocal Tract Length from Formant Frequencies
title_full On Short-Time Estimation of Vocal Tract Length from Formant Frequencies
title_fullStr On Short-Time Estimation of Vocal Tract Length from Formant Frequencies
title_full_unstemmed On Short-Time Estimation of Vocal Tract Length from Formant Frequencies
title_short On Short-Time Estimation of Vocal Tract Length from Formant Frequencies
title_sort on short-time estimation of vocal tract length from formant frequencies
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4503663/
https://www.ncbi.nlm.nih.gov/pubmed/26177102
http://dx.doi.org/10.1371/journal.pone.0132193
work_keys_str_mv AT lammertadamc onshorttimeestimationofvocaltractlengthfromformantfrequencies
AT narayananshrikanths onshorttimeestimationofvocaltractlengthfromformantfrequencies