Cargando…

3D Dynamic Spatiotemporal Atlas of the Vocal Tract during Consonant–Vowel Production from 2D Real Time MRI

In this work, we address the problem of creating a 3D dynamic atlas of the vocal tract that captures the dynamics of the articulators in all three dimensions in order to create a global speaker model independent of speaker-specific characteristics. The core steps of the proposed method are the tempo...

Descripción completa

Detalles Bibliográficos
Autores principales: Douros, Ioannis K., Xie, Yu, Dourou, Chrysanthi, Isaieva, Karyna, Vuissoz, Pierre-André, Felblinger, Jacques, Laprie, Yves
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9504642/
https://www.ncbi.nlm.nih.gov/pubmed/36135393
http://dx.doi.org/10.3390/jimaging8090227
_version_ 1784796267971346432
author Douros, Ioannis K.
Xie, Yu
Dourou, Chrysanthi
Isaieva, Karyna
Vuissoz, Pierre-André
Felblinger, Jacques
Laprie, Yves
author_facet Douros, Ioannis K.
Xie, Yu
Dourou, Chrysanthi
Isaieva, Karyna
Vuissoz, Pierre-André
Felblinger, Jacques
Laprie, Yves
author_sort Douros, Ioannis K.
collection PubMed
description In this work, we address the problem of creating a 3D dynamic atlas of the vocal tract that captures the dynamics of the articulators in all three dimensions in order to create a global speaker model independent of speaker-specific characteristics. The core steps of the proposed method are the temporal alignment of the real-time MR images acquired in several sagittal planes and their combination with adaptive kernel regression. As a preprocessing step, a reference space was created to be used in order to remove anatomical information of the speakers and keep only the variability in speech production for the construction of the atlas. The adaptive kernel regression makes the choice of atlas time points independently of the time points of the frames that are used as an input for the construction. The evaluation of this atlas construction method was made by mapping two new speakers to the atlas and by checking how similar the resulting mapped images are. The use of the atlas helps in reducing subject variability. The results show that the use of the proposed atlas can capture the dynamic behavior of the articulators and is able to generalize the speech production process by creating a universal-speaker reference space.
format Online
Article
Text
id pubmed-9504642
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-95046422022-09-24 3D Dynamic Spatiotemporal Atlas of the Vocal Tract during Consonant–Vowel Production from 2D Real Time MRI Douros, Ioannis K. Xie, Yu Dourou, Chrysanthi Isaieva, Karyna Vuissoz, Pierre-André Felblinger, Jacques Laprie, Yves J Imaging Article In this work, we address the problem of creating a 3D dynamic atlas of the vocal tract that captures the dynamics of the articulators in all three dimensions in order to create a global speaker model independent of speaker-specific characteristics. The core steps of the proposed method are the temporal alignment of the real-time MR images acquired in several sagittal planes and their combination with adaptive kernel regression. As a preprocessing step, a reference space was created to be used in order to remove anatomical information of the speakers and keep only the variability in speech production for the construction of the atlas. The adaptive kernel regression makes the choice of atlas time points independently of the time points of the frames that are used as an input for the construction. The evaluation of this atlas construction method was made by mapping two new speakers to the atlas and by checking how similar the resulting mapped images are. The use of the atlas helps in reducing subject variability. The results show that the use of the proposed atlas can capture the dynamic behavior of the articulators and is able to generalize the speech production process by creating a universal-speaker reference space. MDPI 2022-08-25 /pmc/articles/PMC9504642/ /pubmed/36135393 http://dx.doi.org/10.3390/jimaging8090227 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Douros, Ioannis K.
Xie, Yu
Dourou, Chrysanthi
Isaieva, Karyna
Vuissoz, Pierre-André
Felblinger, Jacques
Laprie, Yves
3D Dynamic Spatiotemporal Atlas of the Vocal Tract during Consonant–Vowel Production from 2D Real Time MRI
title 3D Dynamic Spatiotemporal Atlas of the Vocal Tract during Consonant–Vowel Production from 2D Real Time MRI
title_full 3D Dynamic Spatiotemporal Atlas of the Vocal Tract during Consonant–Vowel Production from 2D Real Time MRI
title_fullStr 3D Dynamic Spatiotemporal Atlas of the Vocal Tract during Consonant–Vowel Production from 2D Real Time MRI
title_full_unstemmed 3D Dynamic Spatiotemporal Atlas of the Vocal Tract during Consonant–Vowel Production from 2D Real Time MRI
title_short 3D Dynamic Spatiotemporal Atlas of the Vocal Tract during Consonant–Vowel Production from 2D Real Time MRI
title_sort 3d dynamic spatiotemporal atlas of the vocal tract during consonant–vowel production from 2d real time mri
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9504642/
https://www.ncbi.nlm.nih.gov/pubmed/36135393
http://dx.doi.org/10.3390/jimaging8090227
work_keys_str_mv AT dourosioannisk 3ddynamicspatiotemporalatlasofthevocaltractduringconsonantvowelproductionfrom2drealtimemri
AT xieyu 3ddynamicspatiotemporalatlasofthevocaltractduringconsonantvowelproductionfrom2drealtimemri
AT dourouchrysanthi 3ddynamicspatiotemporalatlasofthevocaltractduringconsonantvowelproductionfrom2drealtimemri
AT isaievakaryna 3ddynamicspatiotemporalatlasofthevocaltractduringconsonantvowelproductionfrom2drealtimemri
AT vuissozpierreandre 3ddynamicspatiotemporalatlasofthevocaltractduringconsonantvowelproductionfrom2drealtimemri
AT felblingerjacques 3ddynamicspatiotemporalatlasofthevocaltractduringconsonantvowelproductionfrom2drealtimemri
AT laprieyves 3ddynamicspatiotemporalatlasofthevocaltractduringconsonantvowelproductionfrom2drealtimemri