Cargando…

Re-Training of Convolutional Neural Networks for Glottis Segmentation in Endoscopic High-Speed Videos

Endoscopic high-speed video (HSV) systems for visualization and assessment of vocal fold dynamics in the larynx are diverse and technically advancing. To consider resulting “concepts shifts” for neural network (NN)-based image processing, re-training of already trained and used NNs is necessary to a...

Descripción completa

Detalles Bibliográficos
Autores principales: Döllinger, Michael, Schraut, Tobias, Henrich, Lea A., Chhetri, Dinesh, Echternach, Matthias, Johnson, Aaron M., Kunduk, Melda, Maryn, Youri, Patel, Rita R., Samlan, Robin, Semmler, Marion, Schützenberger, Anne
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10427138/
https://www.ncbi.nlm.nih.gov/pubmed/37583544
http://dx.doi.org/10.3390/app12199791
_version_ 1785090168468799488
author Döllinger, Michael
Schraut, Tobias
Henrich, Lea A.
Chhetri, Dinesh
Echternach, Matthias
Johnson, Aaron M.
Kunduk, Melda
Maryn, Youri
Patel, Rita R.
Samlan, Robin
Semmler, Marion
Schützenberger, Anne
author_facet Döllinger, Michael
Schraut, Tobias
Henrich, Lea A.
Chhetri, Dinesh
Echternach, Matthias
Johnson, Aaron M.
Kunduk, Melda
Maryn, Youri
Patel, Rita R.
Samlan, Robin
Semmler, Marion
Schützenberger, Anne
author_sort Döllinger, Michael
collection PubMed
description Endoscopic high-speed video (HSV) systems for visualization and assessment of vocal fold dynamics in the larynx are diverse and technically advancing. To consider resulting “concepts shifts” for neural network (NN)-based image processing, re-training of already trained and used NNs is necessary to allow for sufficiently accurate image processing for new recording modalities. We propose and discuss several re-training approaches for convolutional neural networks (CNN) being used for HSV image segmentation. Our baseline CNN was trained on the BAGLS data set (58,750 images). The new BAGLS-RT data set consists of additional 21,050 images from previously unused HSV systems, light sources, and different spatial resolutions. Results showed that increasing data diversity by means of preprocessing already improves the segmentation accuracy (mIoU + 6.35%). Subsequent re-training further increases segmentation performance (mIoU + 2.81%). For re-training, finetuning with dynamic knowledge distillation showed the most promising results. Data variety for training and additional re-training is a helpful tool to boost HSV image segmentation quality. However, when performing re-training, the phenomenon of catastrophic forgetting should be kept in mind, i.e., adaption to new data while forgetting already learned knowledge.
format Online
Article
Text
id pubmed-10427138
institution National Center for Biotechnology Information
language English
publishDate 2022
record_format MEDLINE/PubMed
spelling pubmed-104271382023-08-15 Re-Training of Convolutional Neural Networks for Glottis Segmentation in Endoscopic High-Speed Videos Döllinger, Michael Schraut, Tobias Henrich, Lea A. Chhetri, Dinesh Echternach, Matthias Johnson, Aaron M. Kunduk, Melda Maryn, Youri Patel, Rita R. Samlan, Robin Semmler, Marion Schützenberger, Anne Appl Sci (Basel) Article Endoscopic high-speed video (HSV) systems for visualization and assessment of vocal fold dynamics in the larynx are diverse and technically advancing. To consider resulting “concepts shifts” for neural network (NN)-based image processing, re-training of already trained and used NNs is necessary to allow for sufficiently accurate image processing for new recording modalities. We propose and discuss several re-training approaches for convolutional neural networks (CNN) being used for HSV image segmentation. Our baseline CNN was trained on the BAGLS data set (58,750 images). The new BAGLS-RT data set consists of additional 21,050 images from previously unused HSV systems, light sources, and different spatial resolutions. Results showed that increasing data diversity by means of preprocessing already improves the segmentation accuracy (mIoU + 6.35%). Subsequent re-training further increases segmentation performance (mIoU + 2.81%). For re-training, finetuning with dynamic knowledge distillation showed the most promising results. Data variety for training and additional re-training is a helpful tool to boost HSV image segmentation quality. However, when performing re-training, the phenomenon of catastrophic forgetting should be kept in mind, i.e., adaption to new data while forgetting already learned knowledge. 2022-10 2022-09-28 /pmc/articles/PMC10427138/ /pubmed/37583544 http://dx.doi.org/10.3390/app12199791 Text en https://creativecommons.org/licenses/by/4.0/This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Döllinger, Michael
Schraut, Tobias
Henrich, Lea A.
Chhetri, Dinesh
Echternach, Matthias
Johnson, Aaron M.
Kunduk, Melda
Maryn, Youri
Patel, Rita R.
Samlan, Robin
Semmler, Marion
Schützenberger, Anne
Re-Training of Convolutional Neural Networks for Glottis Segmentation in Endoscopic High-Speed Videos
title Re-Training of Convolutional Neural Networks for Glottis Segmentation in Endoscopic High-Speed Videos
title_full Re-Training of Convolutional Neural Networks for Glottis Segmentation in Endoscopic High-Speed Videos
title_fullStr Re-Training of Convolutional Neural Networks for Glottis Segmentation in Endoscopic High-Speed Videos
title_full_unstemmed Re-Training of Convolutional Neural Networks for Glottis Segmentation in Endoscopic High-Speed Videos
title_short Re-Training of Convolutional Neural Networks for Glottis Segmentation in Endoscopic High-Speed Videos
title_sort re-training of convolutional neural networks for glottis segmentation in endoscopic high-speed videos
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10427138/
https://www.ncbi.nlm.nih.gov/pubmed/37583544
http://dx.doi.org/10.3390/app12199791
work_keys_str_mv AT dollingermichael retrainingofconvolutionalneuralnetworksforglottissegmentationinendoscopichighspeedvideos
AT schrauttobias retrainingofconvolutionalneuralnetworksforglottissegmentationinendoscopichighspeedvideos
AT henrichleaa retrainingofconvolutionalneuralnetworksforglottissegmentationinendoscopichighspeedvideos
AT chhetridinesh retrainingofconvolutionalneuralnetworksforglottissegmentationinendoscopichighspeedvideos
AT echternachmatthias retrainingofconvolutionalneuralnetworksforglottissegmentationinendoscopichighspeedvideos
AT johnsonaaronm retrainingofconvolutionalneuralnetworksforglottissegmentationinendoscopichighspeedvideos
AT kundukmelda retrainingofconvolutionalneuralnetworksforglottissegmentationinendoscopichighspeedvideos
AT marynyouri retrainingofconvolutionalneuralnetworksforglottissegmentationinendoscopichighspeedvideos
AT patelritar retrainingofconvolutionalneuralnetworksforglottissegmentationinendoscopichighspeedvideos
AT samlanrobin retrainingofconvolutionalneuralnetworksforglottissegmentationinendoscopichighspeedvideos
AT semmlermarion retrainingofconvolutionalneuralnetworksforglottissegmentationinendoscopichighspeedvideos
AT schutzenbergeranne retrainingofconvolutionalneuralnetworksforglottissegmentationinendoscopichighspeedvideos