Cargando…

Application of deep learning technology for temporal analysis of videofluoroscopic swallowing studies

Temporal parameters during swallowing are analyzed for objective and quantitative evaluation of videofluoroscopic swallowing studies (VFSS). Manual analysis by clinicians is time-consuming, complicated and prone to human error during interpretation; therefore, automated analysis using deep learning...

Descripción completa

Detalles Bibliográficos
Autores principales: Jeong, Seong Yun, Kim, Jeong Min, Park, Ji Eun, Baek, Seung Jun, Yang, Seung Nam
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10579219/
https://www.ncbi.nlm.nih.gov/pubmed/37845272
http://dx.doi.org/10.1038/s41598-023-44802-3
_version_ 1785121678132510720
author Jeong, Seong Yun
Kim, Jeong Min
Park, Ji Eun
Baek, Seung Jun
Yang, Seung Nam
author_facet Jeong, Seong Yun
Kim, Jeong Min
Park, Ji Eun
Baek, Seung Jun
Yang, Seung Nam
author_sort Jeong, Seong Yun
collection PubMed
description Temporal parameters during swallowing are analyzed for objective and quantitative evaluation of videofluoroscopic swallowing studies (VFSS). Manual analysis by clinicians is time-consuming, complicated and prone to human error during interpretation; therefore, automated analysis using deep learning has been attempted. We aimed to develop a model for the automatic measurement of various temporal parameters of swallowing using deep learning. Overall, 547 VFSS video clips were included. Seven temporal parameters were manually measured by two physiatrists as ground-truth data: oral phase duration, pharyngeal delay time, pharyngeal response time, pharyngeal transit time, laryngeal vestibule closure reaction time, laryngeal vestibule closure duration, and upper esophageal sphincter opening duration. ResNet3D was selected as the base model for the deep learning of temporal parameters. The performances of ResNet3D variants were compared with those of the VGG and I3D models used previously. The average accuracy of the proposed ResNet3D variants was from 0.901 to 0.981. The F1 scores and average precision were 0.794 to 0.941 and 0.714 to 0.899, respectively. Compared to the VGG and I3D models, our model achieved the best results in terms of accuracy, F1 score, and average precision values. Through the clinical application of this automatic model, temporal analysis of VFSS will be easier and more accurate.
format Online
Article
Text
id pubmed-10579219
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-105792192023-10-18 Application of deep learning technology for temporal analysis of videofluoroscopic swallowing studies Jeong, Seong Yun Kim, Jeong Min Park, Ji Eun Baek, Seung Jun Yang, Seung Nam Sci Rep Article Temporal parameters during swallowing are analyzed for objective and quantitative evaluation of videofluoroscopic swallowing studies (VFSS). Manual analysis by clinicians is time-consuming, complicated and prone to human error during interpretation; therefore, automated analysis using deep learning has been attempted. We aimed to develop a model for the automatic measurement of various temporal parameters of swallowing using deep learning. Overall, 547 VFSS video clips were included. Seven temporal parameters were manually measured by two physiatrists as ground-truth data: oral phase duration, pharyngeal delay time, pharyngeal response time, pharyngeal transit time, laryngeal vestibule closure reaction time, laryngeal vestibule closure duration, and upper esophageal sphincter opening duration. ResNet3D was selected as the base model for the deep learning of temporal parameters. The performances of ResNet3D variants were compared with those of the VGG and I3D models used previously. The average accuracy of the proposed ResNet3D variants was from 0.901 to 0.981. The F1 scores and average precision were 0.794 to 0.941 and 0.714 to 0.899, respectively. Compared to the VGG and I3D models, our model achieved the best results in terms of accuracy, F1 score, and average precision values. Through the clinical application of this automatic model, temporal analysis of VFSS will be easier and more accurate. Nature Publishing Group UK 2023-10-16 /pmc/articles/PMC10579219/ /pubmed/37845272 http://dx.doi.org/10.1038/s41598-023-44802-3 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Jeong, Seong Yun
Kim, Jeong Min
Park, Ji Eun
Baek, Seung Jun
Yang, Seung Nam
Application of deep learning technology for temporal analysis of videofluoroscopic swallowing studies
title Application of deep learning technology for temporal analysis of videofluoroscopic swallowing studies
title_full Application of deep learning technology for temporal analysis of videofluoroscopic swallowing studies
title_fullStr Application of deep learning technology for temporal analysis of videofluoroscopic swallowing studies
title_full_unstemmed Application of deep learning technology for temporal analysis of videofluoroscopic swallowing studies
title_short Application of deep learning technology for temporal analysis of videofluoroscopic swallowing studies
title_sort application of deep learning technology for temporal analysis of videofluoroscopic swallowing studies
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10579219/
https://www.ncbi.nlm.nih.gov/pubmed/37845272
http://dx.doi.org/10.1038/s41598-023-44802-3
work_keys_str_mv AT jeongseongyun applicationofdeeplearningtechnologyfortemporalanalysisofvideofluoroscopicswallowingstudies
AT kimjeongmin applicationofdeeplearningtechnologyfortemporalanalysisofvideofluoroscopicswallowingstudies
AT parkjieun applicationofdeeplearningtechnologyfortemporalanalysisofvideofluoroscopicswallowingstudies
AT baekseungjun applicationofdeeplearningtechnologyfortemporalanalysisofvideofluoroscopicswallowingstudies
AT yangseungnam applicationofdeeplearningtechnologyfortemporalanalysisofvideofluoroscopicswallowingstudies