Cargando…

Sigma-Lognormal Modeling of Speech

Human movement studies and analyses have been fundamental in many scientific domains, ranging from neuroscience to education, pattern recognition to robotics, health care to sports, and beyond. Previous speech motor models were proposed to understand how speech movement is produced and how the resul...

Descripción completa

Detalles Bibliográficos
Autores principales: Carmona-Duarte, C., Ferrer, M. A., Plamondon, R., Gómez-Rodellar, A., Gómez-Vilda, P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7943521/
https://www.ncbi.nlm.nih.gov/pubmed/33786072
http://dx.doi.org/10.1007/s12559-020-09803-8
_version_ 1783662511663349760
author Carmona-Duarte, C.
Ferrer, M. A.
Plamondon, R.
Gómez-Rodellar, A.
Gómez-Vilda, P.
author_facet Carmona-Duarte, C.
Ferrer, M. A.
Plamondon, R.
Gómez-Rodellar, A.
Gómez-Vilda, P.
author_sort Carmona-Duarte, C.
collection PubMed
description Human movement studies and analyses have been fundamental in many scientific domains, ranging from neuroscience to education, pattern recognition to robotics, health care to sports, and beyond. Previous speech motor models were proposed to understand how speech movement is produced and how the resulting speech varies when some parameters are changed. However, the inverse approach, in which the muscular response parameters and the subject’s age are derived from real continuous speech, is not possible with such models. Instead, in the handwriting field, the kinematic theory of rapid human movements and its associated Sigma-lognormal model have been applied successfully to obtain the muscular response parameters. This work presents a speech kinematics-based model that can be used to study, analyze, and reconstruct complex speech kinematics in a simplified manner. A method based on the kinematic theory of rapid human movements and its associated Sigma-lognormal model are applied to describe and to parameterize the asymptotic impulse response of the neuromuscular networks involved in speech as a response to a neuromotor command. The method used to carry out transformations from formants to a movement observation is also presented. Experiments carried out with the (English) VTR-TIMIT database and the (German) Saarbrucken Voice Database, including people of different ages, with and without laryngeal pathologies, corroborate the link between the extracted parameters and aging, on the one hand, and the proportion between the first and second formants required in applying the kinematic theory of rapid human movements, on the other. The results should drive innovative developments in the modeling and understanding of speech kinematics.
format Online
Article
Text
id pubmed-7943521
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-79435212021-03-28 Sigma-Lognormal Modeling of Speech Carmona-Duarte, C. Ferrer, M. A. Plamondon, R. Gómez-Rodellar, A. Gómez-Vilda, P. Cognit Comput Article Human movement studies and analyses have been fundamental in many scientific domains, ranging from neuroscience to education, pattern recognition to robotics, health care to sports, and beyond. Previous speech motor models were proposed to understand how speech movement is produced and how the resulting speech varies when some parameters are changed. However, the inverse approach, in which the muscular response parameters and the subject’s age are derived from real continuous speech, is not possible with such models. Instead, in the handwriting field, the kinematic theory of rapid human movements and its associated Sigma-lognormal model have been applied successfully to obtain the muscular response parameters. This work presents a speech kinematics-based model that can be used to study, analyze, and reconstruct complex speech kinematics in a simplified manner. A method based on the kinematic theory of rapid human movements and its associated Sigma-lognormal model are applied to describe and to parameterize the asymptotic impulse response of the neuromuscular networks involved in speech as a response to a neuromotor command. The method used to carry out transformations from formants to a movement observation is also presented. Experiments carried out with the (English) VTR-TIMIT database and the (German) Saarbrucken Voice Database, including people of different ages, with and without laryngeal pathologies, corroborate the link between the extracted parameters and aging, on the one hand, and the proportion between the first and second formants required in applying the kinematic theory of rapid human movements, on the other. The results should drive innovative developments in the modeling and understanding of speech kinematics. Springer US 2021-02-07 2021 /pmc/articles/PMC7943521/ /pubmed/33786072 http://dx.doi.org/10.1007/s12559-020-09803-8 Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Carmona-Duarte, C.
Ferrer, M. A.
Plamondon, R.
Gómez-Rodellar, A.
Gómez-Vilda, P.
Sigma-Lognormal Modeling of Speech
title Sigma-Lognormal Modeling of Speech
title_full Sigma-Lognormal Modeling of Speech
title_fullStr Sigma-Lognormal Modeling of Speech
title_full_unstemmed Sigma-Lognormal Modeling of Speech
title_short Sigma-Lognormal Modeling of Speech
title_sort sigma-lognormal modeling of speech
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7943521/
https://www.ncbi.nlm.nih.gov/pubmed/33786072
http://dx.doi.org/10.1007/s12559-020-09803-8
work_keys_str_mv AT carmonaduartec sigmalognormalmodelingofspeech
AT ferrerma sigmalognormalmodelingofspeech
AT plamondonr sigmalognormalmodelingofspeech
AT gomezrodellara sigmalognormalmodelingofspeech
AT gomezvildap sigmalognormalmodelingofspeech