Cargando…
Deep Ensembles Are Robust to Occasional Catastrophic Failures of Individual DNNs for Organs Segmentations in CT Images
Deep neural networks (DNNs) have recently showed remarkable performance in various computer vision tasks, including classification and segmentation of medical images. Deep ensembles (an aggregated prediction of multiple DNNs) were shown to improve a DNN’s performance in various classification tasks....
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10502003/ https://www.ncbi.nlm.nih.gov/pubmed/37291384 http://dx.doi.org/10.1007/s10278-023-00857-2 |
_version_ | 1785106223857664000 |
---|---|
author | Petrov, Yury Malik, Bilal Fredrickson, Jill Jemaa, Skander Carano, Richard A. D. |
author_facet | Petrov, Yury Malik, Bilal Fredrickson, Jill Jemaa, Skander Carano, Richard A. D. |
author_sort | Petrov, Yury |
collection | PubMed |
description | Deep neural networks (DNNs) have recently showed remarkable performance in various computer vision tasks, including classification and segmentation of medical images. Deep ensembles (an aggregated prediction of multiple DNNs) were shown to improve a DNN’s performance in various classification tasks. Here we explore how deep ensembles perform in the image segmentation task, in particular, organ segmentations in CT (Computed Tomography) images. Ensembles of V-Nets were trained to segment multiple organs using several in-house and publicly available clinical studies. The ensembles segmentations were tested on images from a different set of studies, and the effects of ensemble size as well as other ensemble parameters were explored for various organs. Compared to single models, Deep Ensembles significantly improved the average segmentation accuracy, especially for those organs where the accuracy was lower. More importantly, Deep Ensembles strongly reduced occasional “catastrophic” segmentation failures characteristic of single models and variability of the segmentation accuracy from image to image. To quantify this we defined the “high risk images”: images for which at least one model produced an outlier metric (performed in the lower 5% percentile). These images comprised about 12% of the test images across all organs. Ensembles performed without outliers for 68%–100% of the “high risk images” depending on the performance metric used. |
format | Online Article Text |
id | pubmed-10502003 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-105020032023-09-16 Deep Ensembles Are Robust to Occasional Catastrophic Failures of Individual DNNs for Organs Segmentations in CT Images Petrov, Yury Malik, Bilal Fredrickson, Jill Jemaa, Skander Carano, Richard A. D. J Digit Imaging Article Deep neural networks (DNNs) have recently showed remarkable performance in various computer vision tasks, including classification and segmentation of medical images. Deep ensembles (an aggregated prediction of multiple DNNs) were shown to improve a DNN’s performance in various classification tasks. Here we explore how deep ensembles perform in the image segmentation task, in particular, organ segmentations in CT (Computed Tomography) images. Ensembles of V-Nets were trained to segment multiple organs using several in-house and publicly available clinical studies. The ensembles segmentations were tested on images from a different set of studies, and the effects of ensemble size as well as other ensemble parameters were explored for various organs. Compared to single models, Deep Ensembles significantly improved the average segmentation accuracy, especially for those organs where the accuracy was lower. More importantly, Deep Ensembles strongly reduced occasional “catastrophic” segmentation failures characteristic of single models and variability of the segmentation accuracy from image to image. To quantify this we defined the “high risk images”: images for which at least one model produced an outlier metric (performed in the lower 5% percentile). These images comprised about 12% of the test images across all organs. Ensembles performed without outliers for 68%–100% of the “high risk images” depending on the performance metric used. Springer International Publishing 2023-06-08 2023-10 /pmc/articles/PMC10502003/ /pubmed/37291384 http://dx.doi.org/10.1007/s10278-023-00857-2 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Petrov, Yury Malik, Bilal Fredrickson, Jill Jemaa, Skander Carano, Richard A. D. Deep Ensembles Are Robust to Occasional Catastrophic Failures of Individual DNNs for Organs Segmentations in CT Images |
title | Deep Ensembles Are Robust to Occasional Catastrophic Failures of Individual DNNs for Organs Segmentations in CT Images |
title_full | Deep Ensembles Are Robust to Occasional Catastrophic Failures of Individual DNNs for Organs Segmentations in CT Images |
title_fullStr | Deep Ensembles Are Robust to Occasional Catastrophic Failures of Individual DNNs for Organs Segmentations in CT Images |
title_full_unstemmed | Deep Ensembles Are Robust to Occasional Catastrophic Failures of Individual DNNs for Organs Segmentations in CT Images |
title_short | Deep Ensembles Are Robust to Occasional Catastrophic Failures of Individual DNNs for Organs Segmentations in CT Images |
title_sort | deep ensembles are robust to occasional catastrophic failures of individual dnns for organs segmentations in ct images |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10502003/ https://www.ncbi.nlm.nih.gov/pubmed/37291384 http://dx.doi.org/10.1007/s10278-023-00857-2 |
work_keys_str_mv | AT petrovyury deepensemblesarerobusttooccasionalcatastrophicfailuresofindividualdnnsfororganssegmentationsinctimages AT malikbilal deepensemblesarerobusttooccasionalcatastrophicfailuresofindividualdnnsfororganssegmentationsinctimages AT fredricksonjill deepensemblesarerobusttooccasionalcatastrophicfailuresofindividualdnnsfororganssegmentationsinctimages AT jemaaskander deepensemblesarerobusttooccasionalcatastrophicfailuresofindividualdnnsfororganssegmentationsinctimages AT caranorichardad deepensemblesarerobusttooccasionalcatastrophicfailuresofindividualdnnsfororganssegmentationsinctimages |