Cargando…

Super-Resolved Dynamic 3D Reconstruction of the Vocal Tract during Natural Speech

MRI is the gold standard modality for speech imaging. However, it remains relatively slow, which complicates imaging of fast movements. Thus, an MRI of the vocal tract is often performed in 2D. While 3D MRI provides more information, the quality of such images is often insufficient. The goal of this...

Descripción completa

Detalles Bibliográficos
Autores principales: Isaieva, Karyna, Odille, Freddy, Laprie, Yves, Drouot, Guillaume, Felblinger, Jacques, Vuissoz, Pierre-André
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10607793/
https://www.ncbi.nlm.nih.gov/pubmed/37888339
http://dx.doi.org/10.3390/jimaging9100233
_version_ 1785127626762878976
author Isaieva, Karyna
Odille, Freddy
Laprie, Yves
Drouot, Guillaume
Felblinger, Jacques
Vuissoz, Pierre-André
author_facet Isaieva, Karyna
Odille, Freddy
Laprie, Yves
Drouot, Guillaume
Felblinger, Jacques
Vuissoz, Pierre-André
author_sort Isaieva, Karyna
collection PubMed
description MRI is the gold standard modality for speech imaging. However, it remains relatively slow, which complicates imaging of fast movements. Thus, an MRI of the vocal tract is often performed in 2D. While 3D MRI provides more information, the quality of such images is often insufficient. The goal of this study was to test the applicability of super-resolution algorithms for dynamic vocal tract MRI. In total, 25 sagittal slices of 8 mm with an in-plane resolution of 1.6 × 1.6 mm(2) were acquired consecutively using a highly-undersampled radial 2D FLASH sequence. The volunteers were reading a text in French with two different protocols. The slices were aligned using the simultaneously recorded sound. The super-resolution strategy was used to reconstruct 1.6 × 1.6 × 1.6 mm(3) isotropic volumes. The resulting images were less sharp than the native 2D images but demonstrated a higher signal-to-noise ratio. It was also shown that the super-resolution allows for eliminating inconsistencies leading to regular transitions between the slices. Additionally, it was demonstrated that using visual stimuli and shorter text fragments improves the inter-slice consistency and the super-resolved image sharpness. Therefore, with a correct speech task choice, the proposed method allows for the reconstruction of high-quality dynamic 3D volumes of the vocal tract during natural speech.
format Online
Article
Text
id pubmed-10607793
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-106077932023-10-28 Super-Resolved Dynamic 3D Reconstruction of the Vocal Tract during Natural Speech Isaieva, Karyna Odille, Freddy Laprie, Yves Drouot, Guillaume Felblinger, Jacques Vuissoz, Pierre-André J Imaging Article MRI is the gold standard modality for speech imaging. However, it remains relatively slow, which complicates imaging of fast movements. Thus, an MRI of the vocal tract is often performed in 2D. While 3D MRI provides more information, the quality of such images is often insufficient. The goal of this study was to test the applicability of super-resolution algorithms for dynamic vocal tract MRI. In total, 25 sagittal slices of 8 mm with an in-plane resolution of 1.6 × 1.6 mm(2) were acquired consecutively using a highly-undersampled radial 2D FLASH sequence. The volunteers were reading a text in French with two different protocols. The slices were aligned using the simultaneously recorded sound. The super-resolution strategy was used to reconstruct 1.6 × 1.6 × 1.6 mm(3) isotropic volumes. The resulting images were less sharp than the native 2D images but demonstrated a higher signal-to-noise ratio. It was also shown that the super-resolution allows for eliminating inconsistencies leading to regular transitions between the slices. Additionally, it was demonstrated that using visual stimuli and shorter text fragments improves the inter-slice consistency and the super-resolved image sharpness. Therefore, with a correct speech task choice, the proposed method allows for the reconstruction of high-quality dynamic 3D volumes of the vocal tract during natural speech. MDPI 2023-10-20 /pmc/articles/PMC10607793/ /pubmed/37888339 http://dx.doi.org/10.3390/jimaging9100233 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Isaieva, Karyna
Odille, Freddy
Laprie, Yves
Drouot, Guillaume
Felblinger, Jacques
Vuissoz, Pierre-André
Super-Resolved Dynamic 3D Reconstruction of the Vocal Tract during Natural Speech
title Super-Resolved Dynamic 3D Reconstruction of the Vocal Tract during Natural Speech
title_full Super-Resolved Dynamic 3D Reconstruction of the Vocal Tract during Natural Speech
title_fullStr Super-Resolved Dynamic 3D Reconstruction of the Vocal Tract during Natural Speech
title_full_unstemmed Super-Resolved Dynamic 3D Reconstruction of the Vocal Tract during Natural Speech
title_short Super-Resolved Dynamic 3D Reconstruction of the Vocal Tract during Natural Speech
title_sort super-resolved dynamic 3d reconstruction of the vocal tract during natural speech
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10607793/
https://www.ncbi.nlm.nih.gov/pubmed/37888339
http://dx.doi.org/10.3390/jimaging9100233
work_keys_str_mv AT isaievakaryna superresolveddynamic3dreconstructionofthevocaltractduringnaturalspeech
AT odillefreddy superresolveddynamic3dreconstructionofthevocaltractduringnaturalspeech
AT laprieyves superresolveddynamic3dreconstructionofthevocaltractduringnaturalspeech
AT drouotguillaume superresolveddynamic3dreconstructionofthevocaltractduringnaturalspeech
AT felblingerjacques superresolveddynamic3dreconstructionofthevocaltractduringnaturalspeech
AT vuissozpierreandre superresolveddynamic3dreconstructionofthevocaltractduringnaturalspeech