Cargando…
Segmentation of Glottal Images from High-Speed Videoendoscopy Optimized by Synchronous Acoustic Recordings
Laryngeal high-speed videoendoscopy (LHSV) is an imaging technique offering novel visualization quality of the vibratory activity of the vocal folds. However, in most image analysis methods, the interaction of the medical personnel and access to ground truth annotations are required to achieve accur...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8915112/ https://www.ncbi.nlm.nih.gov/pubmed/35270897 http://dx.doi.org/10.3390/s22051751 |
_version_ | 1784667936860930048 |
---|---|
author | Kopczynski, Bartosz Niebudek-Bogusz, Ewa Pietruszewska, Wioletta Strumillo, Pawel |
author_facet | Kopczynski, Bartosz Niebudek-Bogusz, Ewa Pietruszewska, Wioletta Strumillo, Pawel |
author_sort | Kopczynski, Bartosz |
collection | PubMed |
description | Laryngeal high-speed videoendoscopy (LHSV) is an imaging technique offering novel visualization quality of the vibratory activity of the vocal folds. However, in most image analysis methods, the interaction of the medical personnel and access to ground truth annotations are required to achieve accurate detection of vocal folds edges. In our fully automatic method, we combine video and acoustic data that are synchronously recorded during the laryngeal endoscopy. We show that the image segmentation algorithm of the glottal area can be optimized by matching the Fourier spectra of the pre-processed video and the spectra of the acoustic recording during the phonation of sustained vowel /i:/. We verify our method on a set of LHSV recordings taken from subjects with normophonic voice and patients with voice disorders due to glottal insufficiency. We show that the computed geometric indices of the glottal area make it possible to discriminate between normal and pathologic voices. The median of the Open Quotient and Minimal Relative Glottal Area values for healthy subjects were 0.69 and 0.06, respectively, while for dysphonic subjects were 1 and 0.35, respectively. We also validate these results using independent phoniatrician experts. |
format | Online Article Text |
id | pubmed-8915112 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-89151122022-03-12 Segmentation of Glottal Images from High-Speed Videoendoscopy Optimized by Synchronous Acoustic Recordings Kopczynski, Bartosz Niebudek-Bogusz, Ewa Pietruszewska, Wioletta Strumillo, Pawel Sensors (Basel) Article Laryngeal high-speed videoendoscopy (LHSV) is an imaging technique offering novel visualization quality of the vibratory activity of the vocal folds. However, in most image analysis methods, the interaction of the medical personnel and access to ground truth annotations are required to achieve accurate detection of vocal folds edges. In our fully automatic method, we combine video and acoustic data that are synchronously recorded during the laryngeal endoscopy. We show that the image segmentation algorithm of the glottal area can be optimized by matching the Fourier spectra of the pre-processed video and the spectra of the acoustic recording during the phonation of sustained vowel /i:/. We verify our method on a set of LHSV recordings taken from subjects with normophonic voice and patients with voice disorders due to glottal insufficiency. We show that the computed geometric indices of the glottal area make it possible to discriminate between normal and pathologic voices. The median of the Open Quotient and Minimal Relative Glottal Area values for healthy subjects were 0.69 and 0.06, respectively, while for dysphonic subjects were 1 and 0.35, respectively. We also validate these results using independent phoniatrician experts. MDPI 2022-02-23 /pmc/articles/PMC8915112/ /pubmed/35270897 http://dx.doi.org/10.3390/s22051751 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Kopczynski, Bartosz Niebudek-Bogusz, Ewa Pietruszewska, Wioletta Strumillo, Pawel Segmentation of Glottal Images from High-Speed Videoendoscopy Optimized by Synchronous Acoustic Recordings |
title | Segmentation of Glottal Images from High-Speed Videoendoscopy Optimized by Synchronous Acoustic Recordings |
title_full | Segmentation of Glottal Images from High-Speed Videoendoscopy Optimized by Synchronous Acoustic Recordings |
title_fullStr | Segmentation of Glottal Images from High-Speed Videoendoscopy Optimized by Synchronous Acoustic Recordings |
title_full_unstemmed | Segmentation of Glottal Images from High-Speed Videoendoscopy Optimized by Synchronous Acoustic Recordings |
title_short | Segmentation of Glottal Images from High-Speed Videoendoscopy Optimized by Synchronous Acoustic Recordings |
title_sort | segmentation of glottal images from high-speed videoendoscopy optimized by synchronous acoustic recordings |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8915112/ https://www.ncbi.nlm.nih.gov/pubmed/35270897 http://dx.doi.org/10.3390/s22051751 |
work_keys_str_mv | AT kopczynskibartosz segmentationofglottalimagesfromhighspeedvideoendoscopyoptimizedbysynchronousacousticrecordings AT niebudekboguszewa segmentationofglottalimagesfromhighspeedvideoendoscopyoptimizedbysynchronousacousticrecordings AT pietruszewskawioletta segmentationofglottalimagesfromhighspeedvideoendoscopyoptimizedbysynchronousacousticrecordings AT strumillopawel segmentationofglottalimagesfromhighspeedvideoendoscopyoptimizedbysynchronousacousticrecordings |