Cargando…

Segmentation of Glottal Images from High-Speed Videoendoscopy Optimized by Synchronous Acoustic Recordings

Laryngeal high-speed videoendoscopy (LHSV) is an imaging technique offering novel visualization quality of the vibratory activity of the vocal folds. However, in most image analysis methods, the interaction of the medical personnel and access to ground truth annotations are required to achieve accur...

Descripción completa

Detalles Bibliográficos
Autores principales: Kopczynski, Bartosz, Niebudek-Bogusz, Ewa, Pietruszewska, Wioletta, Strumillo, Pawel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8915112/
https://www.ncbi.nlm.nih.gov/pubmed/35270897
http://dx.doi.org/10.3390/s22051751
_version_ 1784667936860930048
author Kopczynski, Bartosz
Niebudek-Bogusz, Ewa
Pietruszewska, Wioletta
Strumillo, Pawel
author_facet Kopczynski, Bartosz
Niebudek-Bogusz, Ewa
Pietruszewska, Wioletta
Strumillo, Pawel
author_sort Kopczynski, Bartosz
collection PubMed
description Laryngeal high-speed videoendoscopy (LHSV) is an imaging technique offering novel visualization quality of the vibratory activity of the vocal folds. However, in most image analysis methods, the interaction of the medical personnel and access to ground truth annotations are required to achieve accurate detection of vocal folds edges. In our fully automatic method, we combine video and acoustic data that are synchronously recorded during the laryngeal endoscopy. We show that the image segmentation algorithm of the glottal area can be optimized by matching the Fourier spectra of the pre-processed video and the spectra of the acoustic recording during the phonation of sustained vowel /i:/. We verify our method on a set of LHSV recordings taken from subjects with normophonic voice and patients with voice disorders due to glottal insufficiency. We show that the computed geometric indices of the glottal area make it possible to discriminate between normal and pathologic voices. The median of the Open Quotient and Minimal Relative Glottal Area values for healthy subjects were 0.69 and 0.06, respectively, while for dysphonic subjects were 1 and 0.35, respectively. We also validate these results using independent phoniatrician experts.
format Online
Article
Text
id pubmed-8915112
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-89151122022-03-12 Segmentation of Glottal Images from High-Speed Videoendoscopy Optimized by Synchronous Acoustic Recordings Kopczynski, Bartosz Niebudek-Bogusz, Ewa Pietruszewska, Wioletta Strumillo, Pawel Sensors (Basel) Article Laryngeal high-speed videoendoscopy (LHSV) is an imaging technique offering novel visualization quality of the vibratory activity of the vocal folds. However, in most image analysis methods, the interaction of the medical personnel and access to ground truth annotations are required to achieve accurate detection of vocal folds edges. In our fully automatic method, we combine video and acoustic data that are synchronously recorded during the laryngeal endoscopy. We show that the image segmentation algorithm of the glottal area can be optimized by matching the Fourier spectra of the pre-processed video and the spectra of the acoustic recording during the phonation of sustained vowel /i:/. We verify our method on a set of LHSV recordings taken from subjects with normophonic voice and patients with voice disorders due to glottal insufficiency. We show that the computed geometric indices of the glottal area make it possible to discriminate between normal and pathologic voices. The median of the Open Quotient and Minimal Relative Glottal Area values for healthy subjects were 0.69 and 0.06, respectively, while for dysphonic subjects were 1 and 0.35, respectively. We also validate these results using independent phoniatrician experts. MDPI 2022-02-23 /pmc/articles/PMC8915112/ /pubmed/35270897 http://dx.doi.org/10.3390/s22051751 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Kopczynski, Bartosz
Niebudek-Bogusz, Ewa
Pietruszewska, Wioletta
Strumillo, Pawel
Segmentation of Glottal Images from High-Speed Videoendoscopy Optimized by Synchronous Acoustic Recordings
title Segmentation of Glottal Images from High-Speed Videoendoscopy Optimized by Synchronous Acoustic Recordings
title_full Segmentation of Glottal Images from High-Speed Videoendoscopy Optimized by Synchronous Acoustic Recordings
title_fullStr Segmentation of Glottal Images from High-Speed Videoendoscopy Optimized by Synchronous Acoustic Recordings
title_full_unstemmed Segmentation of Glottal Images from High-Speed Videoendoscopy Optimized by Synchronous Acoustic Recordings
title_short Segmentation of Glottal Images from High-Speed Videoendoscopy Optimized by Synchronous Acoustic Recordings
title_sort segmentation of glottal images from high-speed videoendoscopy optimized by synchronous acoustic recordings
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8915112/
https://www.ncbi.nlm.nih.gov/pubmed/35270897
http://dx.doi.org/10.3390/s22051751
work_keys_str_mv AT kopczynskibartosz segmentationofglottalimagesfromhighspeedvideoendoscopyoptimizedbysynchronousacousticrecordings
AT niebudekboguszewa segmentationofglottalimagesfromhighspeedvideoendoscopyoptimizedbysynchronousacousticrecordings
AT pietruszewskawioletta segmentationofglottalimagesfromhighspeedvideoendoscopyoptimizedbysynchronousacousticrecordings
AT strumillopawel segmentationofglottalimagesfromhighspeedvideoendoscopyoptimizedbysynchronousacousticrecordings