Cargando…

Application of a validated prostate MRI deep learning system to independent same-vendor multi-institutional data: demonstration of transferability

OBJECTIVES: To evaluate a fully automatic deep learning system to detect and segment clinically significant prostate cancer (csPCa) on same-vendor prostate MRI from two different institutions not contributing to training of the system. MATERIALS AND METHODS: In this retrospective study, a previously...

Descripción completa

Detalles Bibliográficos
Autores principales: Netzer, Nils, Eith, Carolin, Bethge, Oliver, Hielscher, Thomas, Schwab, Constantin, Stenzinger, Albrecht, Gnirs, Regula, Schlemmer, Heinz-Peter, Maier-Hein, Klaus H., Schimmöller, Lars, Bonekamp, David
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Berlin Heidelberg 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10598076/
https://www.ncbi.nlm.nih.gov/pubmed/37507610
http://dx.doi.org/10.1007/s00330-023-09882-9
_version_ 1785125474516598784
author Netzer, Nils
Eith, Carolin
Bethge, Oliver
Hielscher, Thomas
Schwab, Constantin
Stenzinger, Albrecht
Gnirs, Regula
Schlemmer, Heinz-Peter
Maier-Hein, Klaus H.
Schimmöller, Lars
Bonekamp, David
author_facet Netzer, Nils
Eith, Carolin
Bethge, Oliver
Hielscher, Thomas
Schwab, Constantin
Stenzinger, Albrecht
Gnirs, Regula
Schlemmer, Heinz-Peter
Maier-Hein, Klaus H.
Schimmöller, Lars
Bonekamp, David
author_sort Netzer, Nils
collection PubMed
description OBJECTIVES: To evaluate a fully automatic deep learning system to detect and segment clinically significant prostate cancer (csPCa) on same-vendor prostate MRI from two different institutions not contributing to training of the system. MATERIALS AND METHODS: In this retrospective study, a previously bi-institutionally validated deep learning system (UNETM) was applied to bi-parametric prostate MRI data from one external institution (A), a PI-RADS distribution-matched internal cohort (B), and a csPCa stratified subset of single-institution external public challenge data (C). csPCa was defined as ISUP Grade Group ≥ 2 determined from combined targeted and extended systematic MRI/transrectal US-fusion biopsy. Performance of UNETM was evaluated by comparing ROC AUC and specificity at typical PI-RADS sensitivity levels. Lesion-level analysis between UNETM segmentations and radiologist-delineated segmentations was performed using Dice coefficient, free-response operating characteristic (FROC), and weighted alternative (waFROC). The influence of using different diffusion sequences was analyzed in cohort A. RESULTS: In 250/250/140 exams in cohorts A/B/C, differences in ROC AUC were insignificant with 0.80 (95% CI: 0.74–0.85)/0.87 (95% CI: 0.83–0.92)/0.82 (95% CI: 0.75–0.89). At sensitivities of 95% and 90%, UNETM achieved specificity of 30%/50% in A, 44%/71% in B, and 43%/49% in C, respectively. Dice coefficient of UNETM and radiologist-delineated lesions was 0.36 in A and 0.49 in B. The waFROC AUC was 0.67 (95% CI: 0.60–0.83) in A and 0.7 (95% CI: 0.64–0.78) in B. UNETM performed marginally better on readout-segmented than on single-shot echo-planar-imaging. CONCLUSION: For same-vendor examinations, deep learning provided comparable discrimination of csPCa and non-csPCa lesions and examinations between local and two independent external data sets, demonstrating the applicability of the system to institutions not participating in model training. CLINICAL RELEVANCE STATEMENT: A previously bi-institutionally validated fully automatic deep learning system maintained acceptable exam-level diagnostic performance in two independent external data sets, indicating the potential of deploying AI models without retraining or fine-tuning, and corroborating evidence that AI models extract a substantial amount of transferable domain knowledge about MRI-based prostate cancer assessment. KEY POINTS: • A previously bi-institutionally validated fully automatic deep learning system maintained acceptable exam-level diagnostic performance in two independent external data sets. • Lesion detection performance and segmentation congruence was similar on the institutional and an external data set, as measured by the weighted alternative FROC AUC and Dice coefficient. • Although the system generalized to two external institutions without re-training, achieving expected sensitivity and specificity levels using the deep learning system requires probability thresholds to be adjusted, underlining the importance of institution-specific calibration and quality control.
format Online
Article
Text
id pubmed-10598076
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Springer Berlin Heidelberg
record_format MEDLINE/PubMed
spelling pubmed-105980762023-10-26 Application of a validated prostate MRI deep learning system to independent same-vendor multi-institutional data: demonstration of transferability Netzer, Nils Eith, Carolin Bethge, Oliver Hielscher, Thomas Schwab, Constantin Stenzinger, Albrecht Gnirs, Regula Schlemmer, Heinz-Peter Maier-Hein, Klaus H. Schimmöller, Lars Bonekamp, David Eur Radiol Imaging Informatics and Artificial Intelligence OBJECTIVES: To evaluate a fully automatic deep learning system to detect and segment clinically significant prostate cancer (csPCa) on same-vendor prostate MRI from two different institutions not contributing to training of the system. MATERIALS AND METHODS: In this retrospective study, a previously bi-institutionally validated deep learning system (UNETM) was applied to bi-parametric prostate MRI data from one external institution (A), a PI-RADS distribution-matched internal cohort (B), and a csPCa stratified subset of single-institution external public challenge data (C). csPCa was defined as ISUP Grade Group ≥ 2 determined from combined targeted and extended systematic MRI/transrectal US-fusion biopsy. Performance of UNETM was evaluated by comparing ROC AUC and specificity at typical PI-RADS sensitivity levels. Lesion-level analysis between UNETM segmentations and radiologist-delineated segmentations was performed using Dice coefficient, free-response operating characteristic (FROC), and weighted alternative (waFROC). The influence of using different diffusion sequences was analyzed in cohort A. RESULTS: In 250/250/140 exams in cohorts A/B/C, differences in ROC AUC were insignificant with 0.80 (95% CI: 0.74–0.85)/0.87 (95% CI: 0.83–0.92)/0.82 (95% CI: 0.75–0.89). At sensitivities of 95% and 90%, UNETM achieved specificity of 30%/50% in A, 44%/71% in B, and 43%/49% in C, respectively. Dice coefficient of UNETM and radiologist-delineated lesions was 0.36 in A and 0.49 in B. The waFROC AUC was 0.67 (95% CI: 0.60–0.83) in A and 0.7 (95% CI: 0.64–0.78) in B. UNETM performed marginally better on readout-segmented than on single-shot echo-planar-imaging. CONCLUSION: For same-vendor examinations, deep learning provided comparable discrimination of csPCa and non-csPCa lesions and examinations between local and two independent external data sets, demonstrating the applicability of the system to institutions not participating in model training. CLINICAL RELEVANCE STATEMENT: A previously bi-institutionally validated fully automatic deep learning system maintained acceptable exam-level diagnostic performance in two independent external data sets, indicating the potential of deploying AI models without retraining or fine-tuning, and corroborating evidence that AI models extract a substantial amount of transferable domain knowledge about MRI-based prostate cancer assessment. KEY POINTS: • A previously bi-institutionally validated fully automatic deep learning system maintained acceptable exam-level diagnostic performance in two independent external data sets. • Lesion detection performance and segmentation congruence was similar on the institutional and an external data set, as measured by the weighted alternative FROC AUC and Dice coefficient. • Although the system generalized to two external institutions without re-training, achieving expected sensitivity and specificity levels using the deep learning system requires probability thresholds to be adjusted, underlining the importance of institution-specific calibration and quality control. Springer Berlin Heidelberg 2023-07-28 2023 /pmc/articles/PMC10598076/ /pubmed/37507610 http://dx.doi.org/10.1007/s00330-023-09882-9 Text en © German Cancer Research Center 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Imaging Informatics and Artificial Intelligence
Netzer, Nils
Eith, Carolin
Bethge, Oliver
Hielscher, Thomas
Schwab, Constantin
Stenzinger, Albrecht
Gnirs, Regula
Schlemmer, Heinz-Peter
Maier-Hein, Klaus H.
Schimmöller, Lars
Bonekamp, David
Application of a validated prostate MRI deep learning system to independent same-vendor multi-institutional data: demonstration of transferability
title Application of a validated prostate MRI deep learning system to independent same-vendor multi-institutional data: demonstration of transferability
title_full Application of a validated prostate MRI deep learning system to independent same-vendor multi-institutional data: demonstration of transferability
title_fullStr Application of a validated prostate MRI deep learning system to independent same-vendor multi-institutional data: demonstration of transferability
title_full_unstemmed Application of a validated prostate MRI deep learning system to independent same-vendor multi-institutional data: demonstration of transferability
title_short Application of a validated prostate MRI deep learning system to independent same-vendor multi-institutional data: demonstration of transferability
title_sort application of a validated prostate mri deep learning system to independent same-vendor multi-institutional data: demonstration of transferability
topic Imaging Informatics and Artificial Intelligence
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10598076/
https://www.ncbi.nlm.nih.gov/pubmed/37507610
http://dx.doi.org/10.1007/s00330-023-09882-9
work_keys_str_mv AT netzernils applicationofavalidatedprostatemrideeplearningsystemtoindependentsamevendormultiinstitutionaldatademonstrationoftransferability
AT eithcarolin applicationofavalidatedprostatemrideeplearningsystemtoindependentsamevendormultiinstitutionaldatademonstrationoftransferability
AT bethgeoliver applicationofavalidatedprostatemrideeplearningsystemtoindependentsamevendormultiinstitutionaldatademonstrationoftransferability
AT hielscherthomas applicationofavalidatedprostatemrideeplearningsystemtoindependentsamevendormultiinstitutionaldatademonstrationoftransferability
AT schwabconstantin applicationofavalidatedprostatemrideeplearningsystemtoindependentsamevendormultiinstitutionaldatademonstrationoftransferability
AT stenzingeralbrecht applicationofavalidatedprostatemrideeplearningsystemtoindependentsamevendormultiinstitutionaldatademonstrationoftransferability
AT gnirsregula applicationofavalidatedprostatemrideeplearningsystemtoindependentsamevendormultiinstitutionaldatademonstrationoftransferability
AT schlemmerheinzpeter applicationofavalidatedprostatemrideeplearningsystemtoindependentsamevendormultiinstitutionaldatademonstrationoftransferability
AT maierheinklaush applicationofavalidatedprostatemrideeplearningsystemtoindependentsamevendormultiinstitutionaldatademonstrationoftransferability
AT schimmollerlars applicationofavalidatedprostatemrideeplearningsystemtoindependentsamevendormultiinstitutionaldatademonstrationoftransferability
AT bonekampdavid applicationofavalidatedprostatemrideeplearningsystemtoindependentsamevendormultiinstitutionaldatademonstrationoftransferability