Cargando…
A single latent channel is sufficient for biomedical glottis segmentation
Glottis segmentation is a crucial step to quantify endoscopic footage in laryngeal high-speed videoendoscopy. Recent advances in deep neural networks for glottis segmentation allow for a fully automatic workflow. However, exact knowledge of integral parts of these deep segmentation networks remains...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9395348/ https://www.ncbi.nlm.nih.gov/pubmed/35995933 http://dx.doi.org/10.1038/s41598-022-17764-1 |
_version_ | 1784771671768432640 |
---|---|
author | Kist, Andreas M. Breininger, Katharina Dörrich, Marion Dürr, Stephan Schützenberger, Anne Semmler, Marion |
author_facet | Kist, Andreas M. Breininger, Katharina Dörrich, Marion Dürr, Stephan Schützenberger, Anne Semmler, Marion |
author_sort | Kist, Andreas M. |
collection | PubMed |
description | Glottis segmentation is a crucial step to quantify endoscopic footage in laryngeal high-speed videoendoscopy. Recent advances in deep neural networks for glottis segmentation allow for a fully automatic workflow. However, exact knowledge of integral parts of these deep segmentation networks remains unknown, and understanding the inner workings is crucial for acceptance in clinical practice. Here, we show that a single latent channel as a bottleneck layer is sufficient for glottal area segmentation using systematic ablations. We further demonstrate that the latent space is an abstraction of the glottal area segmentation relying on three spatially defined pixel subtypes allowing for a transparent interpretation. We further provide evidence that the latent space is highly correlated with the glottal area waveform, can be encoded with four bits, and decoded using lean decoders while maintaining a high reconstruction accuracy. Our findings suggest that glottis segmentation is a task that can be highly optimized to gain very efficient and explainable deep neural networks, important for application in the clinic. In the future, we believe that online deep learning-assisted monitoring is a game-changer in laryngeal examinations. |
format | Online Article Text |
id | pubmed-9395348 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-93953482022-08-24 A single latent channel is sufficient for biomedical glottis segmentation Kist, Andreas M. Breininger, Katharina Dörrich, Marion Dürr, Stephan Schützenberger, Anne Semmler, Marion Sci Rep Article Glottis segmentation is a crucial step to quantify endoscopic footage in laryngeal high-speed videoendoscopy. Recent advances in deep neural networks for glottis segmentation allow for a fully automatic workflow. However, exact knowledge of integral parts of these deep segmentation networks remains unknown, and understanding the inner workings is crucial for acceptance in clinical practice. Here, we show that a single latent channel as a bottleneck layer is sufficient for glottal area segmentation using systematic ablations. We further demonstrate that the latent space is an abstraction of the glottal area segmentation relying on three spatially defined pixel subtypes allowing for a transparent interpretation. We further provide evidence that the latent space is highly correlated with the glottal area waveform, can be encoded with four bits, and decoded using lean decoders while maintaining a high reconstruction accuracy. Our findings suggest that glottis segmentation is a task that can be highly optimized to gain very efficient and explainable deep neural networks, important for application in the clinic. In the future, we believe that online deep learning-assisted monitoring is a game-changer in laryngeal examinations. Nature Publishing Group UK 2022-08-22 /pmc/articles/PMC9395348/ /pubmed/35995933 http://dx.doi.org/10.1038/s41598-022-17764-1 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Kist, Andreas M. Breininger, Katharina Dörrich, Marion Dürr, Stephan Schützenberger, Anne Semmler, Marion A single latent channel is sufficient for biomedical glottis segmentation |
title | A single latent channel is sufficient for biomedical glottis segmentation |
title_full | A single latent channel is sufficient for biomedical glottis segmentation |
title_fullStr | A single latent channel is sufficient for biomedical glottis segmentation |
title_full_unstemmed | A single latent channel is sufficient for biomedical glottis segmentation |
title_short | A single latent channel is sufficient for biomedical glottis segmentation |
title_sort | single latent channel is sufficient for biomedical glottis segmentation |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9395348/ https://www.ncbi.nlm.nih.gov/pubmed/35995933 http://dx.doi.org/10.1038/s41598-022-17764-1 |
work_keys_str_mv | AT kistandreasm asinglelatentchannelissufficientforbiomedicalglottissegmentation AT breiningerkatharina asinglelatentchannelissufficientforbiomedicalglottissegmentation AT dorrichmarion asinglelatentchannelissufficientforbiomedicalglottissegmentation AT durrstephan asinglelatentchannelissufficientforbiomedicalglottissegmentation AT schutzenbergeranne asinglelatentchannelissufficientforbiomedicalglottissegmentation AT semmlermarion asinglelatentchannelissufficientforbiomedicalglottissegmentation AT kistandreasm singlelatentchannelissufficientforbiomedicalglottissegmentation AT breiningerkatharina singlelatentchannelissufficientforbiomedicalglottissegmentation AT dorrichmarion singlelatentchannelissufficientforbiomedicalglottissegmentation AT durrstephan singlelatentchannelissufficientforbiomedicalglottissegmentation AT schutzenbergeranne singlelatentchannelissufficientforbiomedicalglottissegmentation AT semmlermarion singlelatentchannelissufficientforbiomedicalglottissegmentation |