Cargando…

A single latent channel is sufficient for biomedical glottis segmentation

Glottis segmentation is a crucial step to quantify endoscopic footage in laryngeal high-speed videoendoscopy. Recent advances in deep neural networks for glottis segmentation allow for a fully automatic workflow. However, exact knowledge of integral parts of these deep segmentation networks remains...

Descripción completa

Detalles Bibliográficos
Autores principales: Kist, Andreas M., Breininger, Katharina, Dörrich, Marion, Dürr, Stephan, Schützenberger, Anne, Semmler, Marion
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9395348/
https://www.ncbi.nlm.nih.gov/pubmed/35995933
http://dx.doi.org/10.1038/s41598-022-17764-1
_version_ 1784771671768432640
author Kist, Andreas M.
Breininger, Katharina
Dörrich, Marion
Dürr, Stephan
Schützenberger, Anne
Semmler, Marion
author_facet Kist, Andreas M.
Breininger, Katharina
Dörrich, Marion
Dürr, Stephan
Schützenberger, Anne
Semmler, Marion
author_sort Kist, Andreas M.
collection PubMed
description Glottis segmentation is a crucial step to quantify endoscopic footage in laryngeal high-speed videoendoscopy. Recent advances in deep neural networks for glottis segmentation allow for a fully automatic workflow. However, exact knowledge of integral parts of these deep segmentation networks remains unknown, and understanding the inner workings is crucial for acceptance in clinical practice. Here, we show that a single latent channel as a bottleneck layer is sufficient for glottal area segmentation using systematic ablations. We further demonstrate that the latent space is an abstraction of the glottal area segmentation relying on three spatially defined pixel subtypes allowing for a transparent interpretation. We further provide evidence that the latent space is highly correlated with the glottal area waveform, can be encoded with four bits, and decoded using lean decoders while maintaining a high reconstruction accuracy. Our findings suggest that glottis segmentation is a task that can be highly optimized to gain very efficient and explainable deep neural networks, important for application in the clinic. In the future, we believe that online deep learning-assisted monitoring is a game-changer in laryngeal examinations.
format Online
Article
Text
id pubmed-9395348
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-93953482022-08-24 A single latent channel is sufficient for biomedical glottis segmentation Kist, Andreas M. Breininger, Katharina Dörrich, Marion Dürr, Stephan Schützenberger, Anne Semmler, Marion Sci Rep Article Glottis segmentation is a crucial step to quantify endoscopic footage in laryngeal high-speed videoendoscopy. Recent advances in deep neural networks for glottis segmentation allow for a fully automatic workflow. However, exact knowledge of integral parts of these deep segmentation networks remains unknown, and understanding the inner workings is crucial for acceptance in clinical practice. Here, we show that a single latent channel as a bottleneck layer is sufficient for glottal area segmentation using systematic ablations. We further demonstrate that the latent space is an abstraction of the glottal area segmentation relying on three spatially defined pixel subtypes allowing for a transparent interpretation. We further provide evidence that the latent space is highly correlated with the glottal area waveform, can be encoded with four bits, and decoded using lean decoders while maintaining a high reconstruction accuracy. Our findings suggest that glottis segmentation is a task that can be highly optimized to gain very efficient and explainable deep neural networks, important for application in the clinic. In the future, we believe that online deep learning-assisted monitoring is a game-changer in laryngeal examinations. Nature Publishing Group UK 2022-08-22 /pmc/articles/PMC9395348/ /pubmed/35995933 http://dx.doi.org/10.1038/s41598-022-17764-1 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Kist, Andreas M.
Breininger, Katharina
Dörrich, Marion
Dürr, Stephan
Schützenberger, Anne
Semmler, Marion
A single latent channel is sufficient for biomedical glottis segmentation
title A single latent channel is sufficient for biomedical glottis segmentation
title_full A single latent channel is sufficient for biomedical glottis segmentation
title_fullStr A single latent channel is sufficient for biomedical glottis segmentation
title_full_unstemmed A single latent channel is sufficient for biomedical glottis segmentation
title_short A single latent channel is sufficient for biomedical glottis segmentation
title_sort single latent channel is sufficient for biomedical glottis segmentation
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9395348/
https://www.ncbi.nlm.nih.gov/pubmed/35995933
http://dx.doi.org/10.1038/s41598-022-17764-1
work_keys_str_mv AT kistandreasm asinglelatentchannelissufficientforbiomedicalglottissegmentation
AT breiningerkatharina asinglelatentchannelissufficientforbiomedicalglottissegmentation
AT dorrichmarion asinglelatentchannelissufficientforbiomedicalglottissegmentation
AT durrstephan asinglelatentchannelissufficientforbiomedicalglottissegmentation
AT schutzenbergeranne asinglelatentchannelissufficientforbiomedicalglottissegmentation
AT semmlermarion asinglelatentchannelissufficientforbiomedicalglottissegmentation
AT kistandreasm singlelatentchannelissufficientforbiomedicalglottissegmentation
AT breiningerkatharina singlelatentchannelissufficientforbiomedicalglottissegmentation
AT dorrichmarion singlelatentchannelissufficientforbiomedicalglottissegmentation
AT durrstephan singlelatentchannelissufficientforbiomedicalglottissegmentation
AT schutzenbergeranne singlelatentchannelissufficientforbiomedicalglottissegmentation
AT semmlermarion singlelatentchannelissufficientforbiomedicalglottissegmentation