Cargando…
Application of a validated prostate MRI deep learning system to independent same-vendor multi-institutional data: demonstration of transferability
OBJECTIVES: To evaluate a fully automatic deep learning system to detect and segment clinically significant prostate cancer (csPCa) on same-vendor prostate MRI from two different institutions not contributing to training of the system. MATERIALS AND METHODS: In this retrospective study, a previously...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Berlin Heidelberg
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10598076/ https://www.ncbi.nlm.nih.gov/pubmed/37507610 http://dx.doi.org/10.1007/s00330-023-09882-9 |
_version_ | 1785125474516598784 |
---|---|
author | Netzer, Nils Eith, Carolin Bethge, Oliver Hielscher, Thomas Schwab, Constantin Stenzinger, Albrecht Gnirs, Regula Schlemmer, Heinz-Peter Maier-Hein, Klaus H. Schimmöller, Lars Bonekamp, David |
author_facet | Netzer, Nils Eith, Carolin Bethge, Oliver Hielscher, Thomas Schwab, Constantin Stenzinger, Albrecht Gnirs, Regula Schlemmer, Heinz-Peter Maier-Hein, Klaus H. Schimmöller, Lars Bonekamp, David |
author_sort | Netzer, Nils |
collection | PubMed |
description | OBJECTIVES: To evaluate a fully automatic deep learning system to detect and segment clinically significant prostate cancer (csPCa) on same-vendor prostate MRI from two different institutions not contributing to training of the system. MATERIALS AND METHODS: In this retrospective study, a previously bi-institutionally validated deep learning system (UNETM) was applied to bi-parametric prostate MRI data from one external institution (A), a PI-RADS distribution-matched internal cohort (B), and a csPCa stratified subset of single-institution external public challenge data (C). csPCa was defined as ISUP Grade Group ≥ 2 determined from combined targeted and extended systematic MRI/transrectal US-fusion biopsy. Performance of UNETM was evaluated by comparing ROC AUC and specificity at typical PI-RADS sensitivity levels. Lesion-level analysis between UNETM segmentations and radiologist-delineated segmentations was performed using Dice coefficient, free-response operating characteristic (FROC), and weighted alternative (waFROC). The influence of using different diffusion sequences was analyzed in cohort A. RESULTS: In 250/250/140 exams in cohorts A/B/C, differences in ROC AUC were insignificant with 0.80 (95% CI: 0.74–0.85)/0.87 (95% CI: 0.83–0.92)/0.82 (95% CI: 0.75–0.89). At sensitivities of 95% and 90%, UNETM achieved specificity of 30%/50% in A, 44%/71% in B, and 43%/49% in C, respectively. Dice coefficient of UNETM and radiologist-delineated lesions was 0.36 in A and 0.49 in B. The waFROC AUC was 0.67 (95% CI: 0.60–0.83) in A and 0.7 (95% CI: 0.64–0.78) in B. UNETM performed marginally better on readout-segmented than on single-shot echo-planar-imaging. CONCLUSION: For same-vendor examinations, deep learning provided comparable discrimination of csPCa and non-csPCa lesions and examinations between local and two independent external data sets, demonstrating the applicability of the system to institutions not participating in model training. CLINICAL RELEVANCE STATEMENT: A previously bi-institutionally validated fully automatic deep learning system maintained acceptable exam-level diagnostic performance in two independent external data sets, indicating the potential of deploying AI models without retraining or fine-tuning, and corroborating evidence that AI models extract a substantial amount of transferable domain knowledge about MRI-based prostate cancer assessment. KEY POINTS: • A previously bi-institutionally validated fully automatic deep learning system maintained acceptable exam-level diagnostic performance in two independent external data sets. • Lesion detection performance and segmentation congruence was similar on the institutional and an external data set, as measured by the weighted alternative FROC AUC and Dice coefficient. • Although the system generalized to two external institutions without re-training, achieving expected sensitivity and specificity levels using the deep learning system requires probability thresholds to be adjusted, underlining the importance of institution-specific calibration and quality control. |
format | Online Article Text |
id | pubmed-10598076 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Springer Berlin Heidelberg |
record_format | MEDLINE/PubMed |
spelling | pubmed-105980762023-10-26 Application of a validated prostate MRI deep learning system to independent same-vendor multi-institutional data: demonstration of transferability Netzer, Nils Eith, Carolin Bethge, Oliver Hielscher, Thomas Schwab, Constantin Stenzinger, Albrecht Gnirs, Regula Schlemmer, Heinz-Peter Maier-Hein, Klaus H. Schimmöller, Lars Bonekamp, David Eur Radiol Imaging Informatics and Artificial Intelligence OBJECTIVES: To evaluate a fully automatic deep learning system to detect and segment clinically significant prostate cancer (csPCa) on same-vendor prostate MRI from two different institutions not contributing to training of the system. MATERIALS AND METHODS: In this retrospective study, a previously bi-institutionally validated deep learning system (UNETM) was applied to bi-parametric prostate MRI data from one external institution (A), a PI-RADS distribution-matched internal cohort (B), and a csPCa stratified subset of single-institution external public challenge data (C). csPCa was defined as ISUP Grade Group ≥ 2 determined from combined targeted and extended systematic MRI/transrectal US-fusion biopsy. Performance of UNETM was evaluated by comparing ROC AUC and specificity at typical PI-RADS sensitivity levels. Lesion-level analysis between UNETM segmentations and radiologist-delineated segmentations was performed using Dice coefficient, free-response operating characteristic (FROC), and weighted alternative (waFROC). The influence of using different diffusion sequences was analyzed in cohort A. RESULTS: In 250/250/140 exams in cohorts A/B/C, differences in ROC AUC were insignificant with 0.80 (95% CI: 0.74–0.85)/0.87 (95% CI: 0.83–0.92)/0.82 (95% CI: 0.75–0.89). At sensitivities of 95% and 90%, UNETM achieved specificity of 30%/50% in A, 44%/71% in B, and 43%/49% in C, respectively. Dice coefficient of UNETM and radiologist-delineated lesions was 0.36 in A and 0.49 in B. The waFROC AUC was 0.67 (95% CI: 0.60–0.83) in A and 0.7 (95% CI: 0.64–0.78) in B. UNETM performed marginally better on readout-segmented than on single-shot echo-planar-imaging. CONCLUSION: For same-vendor examinations, deep learning provided comparable discrimination of csPCa and non-csPCa lesions and examinations between local and two independent external data sets, demonstrating the applicability of the system to institutions not participating in model training. CLINICAL RELEVANCE STATEMENT: A previously bi-institutionally validated fully automatic deep learning system maintained acceptable exam-level diagnostic performance in two independent external data sets, indicating the potential of deploying AI models without retraining or fine-tuning, and corroborating evidence that AI models extract a substantial amount of transferable domain knowledge about MRI-based prostate cancer assessment. KEY POINTS: • A previously bi-institutionally validated fully automatic deep learning system maintained acceptable exam-level diagnostic performance in two independent external data sets. • Lesion detection performance and segmentation congruence was similar on the institutional and an external data set, as measured by the weighted alternative FROC AUC and Dice coefficient. • Although the system generalized to two external institutions without re-training, achieving expected sensitivity and specificity levels using the deep learning system requires probability thresholds to be adjusted, underlining the importance of institution-specific calibration and quality control. Springer Berlin Heidelberg 2023-07-28 2023 /pmc/articles/PMC10598076/ /pubmed/37507610 http://dx.doi.org/10.1007/s00330-023-09882-9 Text en © German Cancer Research Center 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Imaging Informatics and Artificial Intelligence Netzer, Nils Eith, Carolin Bethge, Oliver Hielscher, Thomas Schwab, Constantin Stenzinger, Albrecht Gnirs, Regula Schlemmer, Heinz-Peter Maier-Hein, Klaus H. Schimmöller, Lars Bonekamp, David Application of a validated prostate MRI deep learning system to independent same-vendor multi-institutional data: demonstration of transferability |
title | Application of a validated prostate MRI deep learning system to independent same-vendor multi-institutional data: demonstration of transferability |
title_full | Application of a validated prostate MRI deep learning system to independent same-vendor multi-institutional data: demonstration of transferability |
title_fullStr | Application of a validated prostate MRI deep learning system to independent same-vendor multi-institutional data: demonstration of transferability |
title_full_unstemmed | Application of a validated prostate MRI deep learning system to independent same-vendor multi-institutional data: demonstration of transferability |
title_short | Application of a validated prostate MRI deep learning system to independent same-vendor multi-institutional data: demonstration of transferability |
title_sort | application of a validated prostate mri deep learning system to independent same-vendor multi-institutional data: demonstration of transferability |
topic | Imaging Informatics and Artificial Intelligence |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10598076/ https://www.ncbi.nlm.nih.gov/pubmed/37507610 http://dx.doi.org/10.1007/s00330-023-09882-9 |
work_keys_str_mv | AT netzernils applicationofavalidatedprostatemrideeplearningsystemtoindependentsamevendormultiinstitutionaldatademonstrationoftransferability AT eithcarolin applicationofavalidatedprostatemrideeplearningsystemtoindependentsamevendormultiinstitutionaldatademonstrationoftransferability AT bethgeoliver applicationofavalidatedprostatemrideeplearningsystemtoindependentsamevendormultiinstitutionaldatademonstrationoftransferability AT hielscherthomas applicationofavalidatedprostatemrideeplearningsystemtoindependentsamevendormultiinstitutionaldatademonstrationoftransferability AT schwabconstantin applicationofavalidatedprostatemrideeplearningsystemtoindependentsamevendormultiinstitutionaldatademonstrationoftransferability AT stenzingeralbrecht applicationofavalidatedprostatemrideeplearningsystemtoindependentsamevendormultiinstitutionaldatademonstrationoftransferability AT gnirsregula applicationofavalidatedprostatemrideeplearningsystemtoindependentsamevendormultiinstitutionaldatademonstrationoftransferability AT schlemmerheinzpeter applicationofavalidatedprostatemrideeplearningsystemtoindependentsamevendormultiinstitutionaldatademonstrationoftransferability AT maierheinklaush applicationofavalidatedprostatemrideeplearningsystemtoindependentsamevendormultiinstitutionaldatademonstrationoftransferability AT schimmollerlars applicationofavalidatedprostatemrideeplearningsystemtoindependentsamevendormultiinstitutionaldatademonstrationoftransferability AT bonekampdavid applicationofavalidatedprostatemrideeplearningsystemtoindependentsamevendormultiinstitutionaldatademonstrationoftransferability |