Cargando…

Detrending the Waveforms of Steady-State Vowels

Steady-state vowels are vowels that are uttered with a momentarily fixed vocal tract configuration and with steady vibration of the vocal folds. In this steady-state, the vowel waveform appears as a quasi-periodic string of elementary units called pitch periods. Humans perceive this quasi-periodic r...

Descripción completa

Detalles Bibliográficos
Autores principales: Van Soom, Marnix, de Boer, Bart
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7516789/
https://www.ncbi.nlm.nih.gov/pubmed/33286105
http://dx.doi.org/10.3390/e22030331
_version_ 1783587082328866816
author Van Soom, Marnix
de Boer, Bart
author_facet Van Soom, Marnix
de Boer, Bart
author_sort Van Soom, Marnix
collection PubMed
description Steady-state vowels are vowels that are uttered with a momentarily fixed vocal tract configuration and with steady vibration of the vocal folds. In this steady-state, the vowel waveform appears as a quasi-periodic string of elementary units called pitch periods. Humans perceive this quasi-periodic regularity as a definite pitch. Likewise, so-called pitch-synchronous methods exploit this regularity by using the duration of the pitch periods as a natural time scale for their analysis. In this work, we present a simple pitch-synchronous method using a Bayesian approach for estimating formants that slightly generalizes the basic approach of modeling the pitch periods as a superposition of decaying sinusoids, one for each vowel formant, by explicitly taking into account the additional low-frequency content in the waveform which arises not from formants but rather from the glottal pulse. We model this low-frequency content in the time domain as a polynomial trend function that is added to the decaying sinusoids. The problem then reduces to a rather familiar one in macroeconomics: estimate the cycles (our decaying sinusoids) independently from the trend (our polynomial trend function); in other words, detrend the waveform of steady-state waveforms. We show how to do this efficiently.
format Online
Article
Text
id pubmed-7516789
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75167892020-11-09 Detrending the Waveforms of Steady-State Vowels Van Soom, Marnix de Boer, Bart Entropy (Basel) Article Steady-state vowels are vowels that are uttered with a momentarily fixed vocal tract configuration and with steady vibration of the vocal folds. In this steady-state, the vowel waveform appears as a quasi-periodic string of elementary units called pitch periods. Humans perceive this quasi-periodic regularity as a definite pitch. Likewise, so-called pitch-synchronous methods exploit this regularity by using the duration of the pitch periods as a natural time scale for their analysis. In this work, we present a simple pitch-synchronous method using a Bayesian approach for estimating formants that slightly generalizes the basic approach of modeling the pitch periods as a superposition of decaying sinusoids, one for each vowel formant, by explicitly taking into account the additional low-frequency content in the waveform which arises not from formants but rather from the glottal pulse. We model this low-frequency content in the time domain as a polynomial trend function that is added to the decaying sinusoids. The problem then reduces to a rather familiar one in macroeconomics: estimate the cycles (our decaying sinusoids) independently from the trend (our polynomial trend function); in other words, detrend the waveform of steady-state waveforms. We show how to do this efficiently. MDPI 2020-03-13 /pmc/articles/PMC7516789/ /pubmed/33286105 http://dx.doi.org/10.3390/e22030331 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Van Soom, Marnix
de Boer, Bart
Detrending the Waveforms of Steady-State Vowels
title Detrending the Waveforms of Steady-State Vowels
title_full Detrending the Waveforms of Steady-State Vowels
title_fullStr Detrending the Waveforms of Steady-State Vowels
title_full_unstemmed Detrending the Waveforms of Steady-State Vowels
title_short Detrending the Waveforms of Steady-State Vowels
title_sort detrending the waveforms of steady-state vowels
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7516789/
https://www.ncbi.nlm.nih.gov/pubmed/33286105
http://dx.doi.org/10.3390/e22030331
work_keys_str_mv AT vansoommarnix detrendingthewaveformsofsteadystatevowels
AT deboerbart detrendingthewaveformsofsteadystatevowels