Cargando…

Using spatial-temporal ensembles of convolutional neural networks for lumen segmentation in ureteroscopy

PURPOSE: Ureteroscopy is an efficient endoscopic minimally invasive technique for the diagnosis and treatment of upper tract urothelial carcinoma. During ureteroscopy, the automatic segmentation of the hollow lumen is of primary importance, since it indicates the path that the endoscope should follo...

Descripción completa

Detalles Bibliográficos
Autores principales: Lazo, Jorge F., Marzullo, Aldo, Moccia, Sara, Catellani, Michele, Rosa, Benoit, de Mathelin, Michel, De Momi, Elena
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8166718/
https://www.ncbi.nlm.nih.gov/pubmed/33909264
http://dx.doi.org/10.1007/s11548-021-02376-3
_version_ 1783701554545557504
author Lazo, Jorge F.
Marzullo, Aldo
Moccia, Sara
Catellani, Michele
Rosa, Benoit
de Mathelin, Michel
De Momi, Elena
author_facet Lazo, Jorge F.
Marzullo, Aldo
Moccia, Sara
Catellani, Michele
Rosa, Benoit
de Mathelin, Michel
De Momi, Elena
author_sort Lazo, Jorge F.
collection PubMed
description PURPOSE: Ureteroscopy is an efficient endoscopic minimally invasive technique for the diagnosis and treatment of upper tract urothelial carcinoma. During ureteroscopy, the automatic segmentation of the hollow lumen is of primary importance, since it indicates the path that the endoscope should follow. In order to obtain an accurate segmentation of the hollow lumen, this paper presents an automatic method based on convolutional neural networks (CNNs). METHODS: The proposed method is based on an ensemble of 4 parallel CNNs to simultaneously process single and multi-frame information. Of these, two architectures are taken as core-models, namely U-Net based in residual blocks ([Formula: see text] ) and Mask-RCNN ([Formula: see text] ), which are fed with single still-frames I(t). The other two models ([Formula: see text] , [Formula: see text] ) are modifications of the former ones consisting on the addition of a stage which makes use of 3D convolutions to process temporal information. [Formula: see text] , [Formula: see text] are fed with triplets of frames ([Formula: see text] , I(t), [Formula: see text] ) to produce the segmentation for I(t). RESULTS: The proposed method was evaluated using a custom dataset of 11 videos (2673 frames) which were collected and manually annotated from 6 patients. We obtain a Dice similarity coefficient of 0.80, outperforming previous state-of-the-art methods. CONCLUSION: The obtained results show that spatial-temporal information can be effectively exploited by the ensemble model to improve hollow lumen segmentation in ureteroscopic images. The method is effective also in the presence of poor visibility, occasional bleeding, or specular reflections. SUPPLEMENTARY INFORMATION: The online version supplementary material available at 10.1007/s11548-021-02376-3.
format Online
Article
Text
id pubmed-8166718
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-81667182021-06-03 Using spatial-temporal ensembles of convolutional neural networks for lumen segmentation in ureteroscopy Lazo, Jorge F. Marzullo, Aldo Moccia, Sara Catellani, Michele Rosa, Benoit de Mathelin, Michel De Momi, Elena Int J Comput Assist Radiol Surg Original Article PURPOSE: Ureteroscopy is an efficient endoscopic minimally invasive technique for the diagnosis and treatment of upper tract urothelial carcinoma. During ureteroscopy, the automatic segmentation of the hollow lumen is of primary importance, since it indicates the path that the endoscope should follow. In order to obtain an accurate segmentation of the hollow lumen, this paper presents an automatic method based on convolutional neural networks (CNNs). METHODS: The proposed method is based on an ensemble of 4 parallel CNNs to simultaneously process single and multi-frame information. Of these, two architectures are taken as core-models, namely U-Net based in residual blocks ([Formula: see text] ) and Mask-RCNN ([Formula: see text] ), which are fed with single still-frames I(t). The other two models ([Formula: see text] , [Formula: see text] ) are modifications of the former ones consisting on the addition of a stage which makes use of 3D convolutions to process temporal information. [Formula: see text] , [Formula: see text] are fed with triplets of frames ([Formula: see text] , I(t), [Formula: see text] ) to produce the segmentation for I(t). RESULTS: The proposed method was evaluated using a custom dataset of 11 videos (2673 frames) which were collected and manually annotated from 6 patients. We obtain a Dice similarity coefficient of 0.80, outperforming previous state-of-the-art methods. CONCLUSION: The obtained results show that spatial-temporal information can be effectively exploited by the ensemble model to improve hollow lumen segmentation in ureteroscopic images. The method is effective also in the presence of poor visibility, occasional bleeding, or specular reflections. SUPPLEMENTARY INFORMATION: The online version supplementary material available at 10.1007/s11548-021-02376-3. Springer International Publishing 2021-04-28 2021 /pmc/articles/PMC8166718/ /pubmed/33909264 http://dx.doi.org/10.1007/s11548-021-02376-3 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Original Article
Lazo, Jorge F.
Marzullo, Aldo
Moccia, Sara
Catellani, Michele
Rosa, Benoit
de Mathelin, Michel
De Momi, Elena
Using spatial-temporal ensembles of convolutional neural networks for lumen segmentation in ureteroscopy
title Using spatial-temporal ensembles of convolutional neural networks for lumen segmentation in ureteroscopy
title_full Using spatial-temporal ensembles of convolutional neural networks for lumen segmentation in ureteroscopy
title_fullStr Using spatial-temporal ensembles of convolutional neural networks for lumen segmentation in ureteroscopy
title_full_unstemmed Using spatial-temporal ensembles of convolutional neural networks for lumen segmentation in ureteroscopy
title_short Using spatial-temporal ensembles of convolutional neural networks for lumen segmentation in ureteroscopy
title_sort using spatial-temporal ensembles of convolutional neural networks for lumen segmentation in ureteroscopy
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8166718/
https://www.ncbi.nlm.nih.gov/pubmed/33909264
http://dx.doi.org/10.1007/s11548-021-02376-3
work_keys_str_mv AT lazojorgef usingspatialtemporalensemblesofconvolutionalneuralnetworksforlumensegmentationinureteroscopy
AT marzulloaldo usingspatialtemporalensemblesofconvolutionalneuralnetworksforlumensegmentationinureteroscopy
AT mocciasara usingspatialtemporalensemblesofconvolutionalneuralnetworksforlumensegmentationinureteroscopy
AT catellanimichele usingspatialtemporalensemblesofconvolutionalneuralnetworksforlumensegmentationinureteroscopy
AT rosabenoit usingspatialtemporalensemblesofconvolutionalneuralnetworksforlumensegmentationinureteroscopy
AT demathelinmichel usingspatialtemporalensemblesofconvolutionalneuralnetworksforlumensegmentationinureteroscopy
AT demomielena usingspatialtemporalensemblesofconvolutionalneuralnetworksforlumensegmentationinureteroscopy