Cargando…

Limitations of Out-of-Distribution Detection in 3D Medical Image Segmentation

Deep learning models perform unreliably when the data come from a distribution different from the training one. In critical applications such as medical imaging, out-of-distribution (OOD) detection methods help to identify such data samples, preventing erroneous predictions. In this paper, we furthe...

Descripción completa

Detalles Bibliográficos
Autores principales: Vasiliuk, Anton, Frolova, Daria, Belyaev, Mikhail, Shirokikh, Boris
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10532230/
https://www.ncbi.nlm.nih.gov/pubmed/37754955
http://dx.doi.org/10.3390/jimaging9090191
_version_ 1785111907026337792
author Vasiliuk, Anton
Frolova, Daria
Belyaev, Mikhail
Shirokikh, Boris
author_facet Vasiliuk, Anton
Frolova, Daria
Belyaev, Mikhail
Shirokikh, Boris
author_sort Vasiliuk, Anton
collection PubMed
description Deep learning models perform unreliably when the data come from a distribution different from the training one. In critical applications such as medical imaging, out-of-distribution (OOD) detection methods help to identify such data samples, preventing erroneous predictions. In this paper, we further investigate OOD detection effectiveness when applied to 3D medical image segmentation. We designed several OOD challenges representing clinically occurring cases and found that none of the methods achieved acceptable performance. Methods not dedicated to segmentation severely failed to perform in the designed setups; the best mean false-positive rate at a 95% true-positive rate (FPR) was 0.59. Segmentation-dedicated methods still achieved suboptimal performance, with the best mean FPR being 0.31 (lower is better). To indicate this suboptimality, we developed a simple method called Intensity Histogram Features (IHF), which performed comparably or better in the same challenges, with a mean FPR of 0.25. Our findings highlight the limitations of the existing OOD detection methods with 3D medical images and present a promising avenue for improving them. To facilitate research in this area, we release the designed challenges as a publicly available benchmark and formulate practical criteria to test the generalization of OOD detection beyond the suggested benchmark. We also propose IHF as a solid baseline to contest emerging methods.
format Online
Article
Text
id pubmed-10532230
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-105322302023-09-28 Limitations of Out-of-Distribution Detection in 3D Medical Image Segmentation Vasiliuk, Anton Frolova, Daria Belyaev, Mikhail Shirokikh, Boris J Imaging Article Deep learning models perform unreliably when the data come from a distribution different from the training one. In critical applications such as medical imaging, out-of-distribution (OOD) detection methods help to identify such data samples, preventing erroneous predictions. In this paper, we further investigate OOD detection effectiveness when applied to 3D medical image segmentation. We designed several OOD challenges representing clinically occurring cases and found that none of the methods achieved acceptable performance. Methods not dedicated to segmentation severely failed to perform in the designed setups; the best mean false-positive rate at a 95% true-positive rate (FPR) was 0.59. Segmentation-dedicated methods still achieved suboptimal performance, with the best mean FPR being 0.31 (lower is better). To indicate this suboptimality, we developed a simple method called Intensity Histogram Features (IHF), which performed comparably or better in the same challenges, with a mean FPR of 0.25. Our findings highlight the limitations of the existing OOD detection methods with 3D medical images and present a promising avenue for improving them. To facilitate research in this area, we release the designed challenges as a publicly available benchmark and formulate practical criteria to test the generalization of OOD detection beyond the suggested benchmark. We also propose IHF as a solid baseline to contest emerging methods. MDPI 2023-09-18 /pmc/articles/PMC10532230/ /pubmed/37754955 http://dx.doi.org/10.3390/jimaging9090191 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Vasiliuk, Anton
Frolova, Daria
Belyaev, Mikhail
Shirokikh, Boris
Limitations of Out-of-Distribution Detection in 3D Medical Image Segmentation
title Limitations of Out-of-Distribution Detection in 3D Medical Image Segmentation
title_full Limitations of Out-of-Distribution Detection in 3D Medical Image Segmentation
title_fullStr Limitations of Out-of-Distribution Detection in 3D Medical Image Segmentation
title_full_unstemmed Limitations of Out-of-Distribution Detection in 3D Medical Image Segmentation
title_short Limitations of Out-of-Distribution Detection in 3D Medical Image Segmentation
title_sort limitations of out-of-distribution detection in 3d medical image segmentation
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10532230/
https://www.ncbi.nlm.nih.gov/pubmed/37754955
http://dx.doi.org/10.3390/jimaging9090191
work_keys_str_mv AT vasiliukanton limitationsofoutofdistributiondetectionin3dmedicalimagesegmentation
AT frolovadaria limitationsofoutofdistributiondetectionin3dmedicalimagesegmentation
AT belyaevmikhail limitationsofoutofdistributiondetectionin3dmedicalimagesegmentation
AT shirokikhboris limitationsofoutofdistributiondetectionin3dmedicalimagesegmentation