Cargando…
On Short-Time Estimation of Vocal Tract Length from Formant Frequencies
Vocal tract length is highly variable across speakers and determines many aspects of the acoustic speech signal, making it an essential parameter to consider for explaining behavioral variability. A method for accurate estimation of vocal tract length from formant frequencies would afford normalizat...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4503663/ https://www.ncbi.nlm.nih.gov/pubmed/26177102 http://dx.doi.org/10.1371/journal.pone.0132193 |
_version_ | 1782381335782883328 |
---|---|
author | Lammert, Adam C. Narayanan, Shrikanth S. |
author_facet | Lammert, Adam C. Narayanan, Shrikanth S. |
author_sort | Lammert, Adam C. |
collection | PubMed |
description | Vocal tract length is highly variable across speakers and determines many aspects of the acoustic speech signal, making it an essential parameter to consider for explaining behavioral variability. A method for accurate estimation of vocal tract length from formant frequencies would afford normalization of interspeaker variability and facilitate acoustic comparisons across speakers. A framework for considering estimation methods is developed from the basic principles of vocal tract acoustics, and an estimation method is proposed that follows naturally from this framework. The proposed method is evaluated using acoustic characteristics of simulated vocal tracts ranging from 14 to 19 cm in length, as well as real-time magnetic resonance imaging data with synchronous audio from five speakers whose vocal tracts range from 14.5 to 18.0 cm in length. Evaluations show improvements in accuracy over previously proposed methods, with 0.631 and 1.277 cm root mean square error on simulated and human speech data, respectively. Empirical results show that the effectiveness of the proposed method is based on emphasizing higher formant frequencies, which seem less affected by speech articulation. Theoretical predictions of formant sensitivity reinforce this empirical finding. Moreover, theoretical insights are explained regarding the reason for differences in formant sensitivity. |
format | Online Article Text |
id | pubmed-4503663 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-45036632015-07-17 On Short-Time Estimation of Vocal Tract Length from Formant Frequencies Lammert, Adam C. Narayanan, Shrikanth S. PLoS One Research Article Vocal tract length is highly variable across speakers and determines many aspects of the acoustic speech signal, making it an essential parameter to consider for explaining behavioral variability. A method for accurate estimation of vocal tract length from formant frequencies would afford normalization of interspeaker variability and facilitate acoustic comparisons across speakers. A framework for considering estimation methods is developed from the basic principles of vocal tract acoustics, and an estimation method is proposed that follows naturally from this framework. The proposed method is evaluated using acoustic characteristics of simulated vocal tracts ranging from 14 to 19 cm in length, as well as real-time magnetic resonance imaging data with synchronous audio from five speakers whose vocal tracts range from 14.5 to 18.0 cm in length. Evaluations show improvements in accuracy over previously proposed methods, with 0.631 and 1.277 cm root mean square error on simulated and human speech data, respectively. Empirical results show that the effectiveness of the proposed method is based on emphasizing higher formant frequencies, which seem less affected by speech articulation. Theoretical predictions of formant sensitivity reinforce this empirical finding. Moreover, theoretical insights are explained regarding the reason for differences in formant sensitivity. Public Library of Science 2015-07-15 /pmc/articles/PMC4503663/ /pubmed/26177102 http://dx.doi.org/10.1371/journal.pone.0132193 Text en © 2015 Lammert, Narayanan http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Lammert, Adam C. Narayanan, Shrikanth S. On Short-Time Estimation of Vocal Tract Length from Formant Frequencies |
title | On Short-Time Estimation of Vocal Tract Length from Formant Frequencies |
title_full | On Short-Time Estimation of Vocal Tract Length from Formant Frequencies |
title_fullStr | On Short-Time Estimation of Vocal Tract Length from Formant Frequencies |
title_full_unstemmed | On Short-Time Estimation of Vocal Tract Length from Formant Frequencies |
title_short | On Short-Time Estimation of Vocal Tract Length from Formant Frequencies |
title_sort | on short-time estimation of vocal tract length from formant frequencies |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4503663/ https://www.ncbi.nlm.nih.gov/pubmed/26177102 http://dx.doi.org/10.1371/journal.pone.0132193 |
work_keys_str_mv | AT lammertadamc onshorttimeestimationofvocaltractlengthfromformantfrequencies AT narayananshrikanths onshorttimeestimationofvocaltractlengthfromformantfrequencies |