Cargando…

Modeling Consonant-Vowel Coarticulation for Articulatory Speech Synthesis

A central challenge for articulatory speech synthesis is the simulation of realistic articulatory movements, which is critical for the generation of highly natural and intelligible speech. This includes modeling coarticulation, i.e., the context-dependent variation of the articulatory and acoustic r...

Descripción completa

Detalles Bibliográficos
Autor principal: Birkholz, Peter
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3628899/
https://www.ncbi.nlm.nih.gov/pubmed/23613734
http://dx.doi.org/10.1371/journal.pone.0060603
_version_ 1782266483144916992
author Birkholz, Peter
author_facet Birkholz, Peter
author_sort Birkholz, Peter
collection PubMed
description A central challenge for articulatory speech synthesis is the simulation of realistic articulatory movements, which is critical for the generation of highly natural and intelligible speech. This includes modeling coarticulation, i.e., the context-dependent variation of the articulatory and acoustic realization of phonemes, especially of consonants. Here we propose a method to simulate the context-sensitive articulation of consonants in consonant-vowel syllables. To achieve this, the vocal tract target shape of a consonant in the context of a given vowel is derived as the weighted average of three measured and acoustically-optimized reference vocal tract shapes for that consonant in the context of the corner vowels /a/, /i/, and /u/. The weights are determined by mapping the target shape of the given context vowel into the vowel subspace spanned by the corner vowels. The model was applied for the synthesis of consonant-vowel syllables with the consonants /b/, /d/, /g/, /l/, /r/, /m/, /n/ in all combinations with the eight long German vowels. In a perception test, the mean recognition rate for the consonants in the isolated syllables was 82.4%. This demonstrates the potential of the approach for highly intelligible articulatory speech synthesis.
format Online
Article
Text
id pubmed-3628899
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-36288992013-04-23 Modeling Consonant-Vowel Coarticulation for Articulatory Speech Synthesis Birkholz, Peter PLoS One Research Article A central challenge for articulatory speech synthesis is the simulation of realistic articulatory movements, which is critical for the generation of highly natural and intelligible speech. This includes modeling coarticulation, i.e., the context-dependent variation of the articulatory and acoustic realization of phonemes, especially of consonants. Here we propose a method to simulate the context-sensitive articulation of consonants in consonant-vowel syllables. To achieve this, the vocal tract target shape of a consonant in the context of a given vowel is derived as the weighted average of three measured and acoustically-optimized reference vocal tract shapes for that consonant in the context of the corner vowels /a/, /i/, and /u/. The weights are determined by mapping the target shape of the given context vowel into the vowel subspace spanned by the corner vowels. The model was applied for the synthesis of consonant-vowel syllables with the consonants /b/, /d/, /g/, /l/, /r/, /m/, /n/ in all combinations with the eight long German vowels. In a perception test, the mean recognition rate for the consonants in the isolated syllables was 82.4%. This demonstrates the potential of the approach for highly intelligible articulatory speech synthesis. Public Library of Science 2013-04-16 /pmc/articles/PMC3628899/ /pubmed/23613734 http://dx.doi.org/10.1371/journal.pone.0060603 Text en © 2013 Peter Birkholz http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Birkholz, Peter
Modeling Consonant-Vowel Coarticulation for Articulatory Speech Synthesis
title Modeling Consonant-Vowel Coarticulation for Articulatory Speech Synthesis
title_full Modeling Consonant-Vowel Coarticulation for Articulatory Speech Synthesis
title_fullStr Modeling Consonant-Vowel Coarticulation for Articulatory Speech Synthesis
title_full_unstemmed Modeling Consonant-Vowel Coarticulation for Articulatory Speech Synthesis
title_short Modeling Consonant-Vowel Coarticulation for Articulatory Speech Synthesis
title_sort modeling consonant-vowel coarticulation for articulatory speech synthesis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3628899/
https://www.ncbi.nlm.nih.gov/pubmed/23613734
http://dx.doi.org/10.1371/journal.pone.0060603
work_keys_str_mv AT birkholzpeter modelingconsonantvowelcoarticulationforarticulatoryspeechsynthesis