Cargando…
Simulated clinical deployment of fully automatic deep learning for clinical prostate MRI assessment
OBJECTIVES: To simulate clinical deployment, evaluate performance, and establish quality assurance of a deep learning algorithm (U-Net) for detection, localization, and segmentation of clinically significant prostate cancer (sPC), ISUP grade group ≥ 2, using bi-parametric MRI. METHODS: In 2017, 284...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Berlin Heidelberg
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7755653/ https://www.ncbi.nlm.nih.gov/pubmed/32767102 http://dx.doi.org/10.1007/s00330-020-07086-z |
_version_ | 1783626394788429824 |
---|---|
author | Schelb, Patrick Wang, Xianfeng Radtke, Jan Philipp Wiesenfarth, Manuel Kickingereder, Philipp Stenzinger, Albrecht Hohenfellner, Markus Schlemmer, Heinz-Peter Maier-Hein, Klaus H. Bonekamp, David |
author_facet | Schelb, Patrick Wang, Xianfeng Radtke, Jan Philipp Wiesenfarth, Manuel Kickingereder, Philipp Stenzinger, Albrecht Hohenfellner, Markus Schlemmer, Heinz-Peter Maier-Hein, Klaus H. Bonekamp, David |
author_sort | Schelb, Patrick |
collection | PubMed |
description | OBJECTIVES: To simulate clinical deployment, evaluate performance, and establish quality assurance of a deep learning algorithm (U-Net) for detection, localization, and segmentation of clinically significant prostate cancer (sPC), ISUP grade group ≥ 2, using bi-parametric MRI. METHODS: In 2017, 284 consecutive men in active surveillance, biopsy-naïve or pre-biopsied, received targeted and extended systematic MRI/transrectal US-fusion biopsy, after examination on a single MRI scanner (3 T). A prospective adjustment scheme was evaluated comparing the performance of the Prostate Imaging Reporting and Data System (PI-RADS) and U-Net using sensitivity, specificity, predictive values, and the Dice coefficient. RESULTS: In the 259 eligible men (median 64 [IQR 61–72] years), PI-RADS had a sensitivity of 98% [106/108]/84% [91/108] with a specificity of 17% [25/151]/58% [88/151], for thresholds at ≥ 3/≥ 4 respectively. U-Net using dynamic threshold adjustment had a sensitivity of 99% [107/108]/83% [90/108] (p > 0.99/> 0.99) with a specificity of 24% [36/151]/55% [83/151] (p > 0.99/> 0.99) for probability thresholds d3 and d4 emulating PI-RADS ≥ 3 and ≥ 4 decisions respectively, not statistically different from PI-RADS. Co-occurrence of a radiological PI-RADS ≥ 4 examination and U-Net ≥ d3 assessment significantly improved the positive predictive value from 59 to 63% (p = 0.03), on a per-patient basis. CONCLUSIONS: U-Net has similar performance to PI-RADS in simulated continued clinical use. Regular quality assurance should be implemented to ensure desired performance. KEY POINTS: • U-Net maintained similar diagnostic performance compared to radiological assessment of PI-RADS ≥ 4 when applied in a simulated clinical deployment. • Application of our proposed prospective dynamic calibration method successfully adjusted U-Net performance within acceptable limits of the PI-RADS reference over time, while not being limited to PI-RADS as a reference. • Simultaneous detection by U-Net and radiological assessment significantly improved the positive predictive value on a per-patient and per-lesion basis, while the negative predictive value remained unchanged. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1007/s00330-020-07086-z) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-7755653 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Springer Berlin Heidelberg |
record_format | MEDLINE/PubMed |
spelling | pubmed-77556532020-12-28 Simulated clinical deployment of fully automatic deep learning for clinical prostate MRI assessment Schelb, Patrick Wang, Xianfeng Radtke, Jan Philipp Wiesenfarth, Manuel Kickingereder, Philipp Stenzinger, Albrecht Hohenfellner, Markus Schlemmer, Heinz-Peter Maier-Hein, Klaus H. Bonekamp, David Eur Radiol Urogenital OBJECTIVES: To simulate clinical deployment, evaluate performance, and establish quality assurance of a deep learning algorithm (U-Net) for detection, localization, and segmentation of clinically significant prostate cancer (sPC), ISUP grade group ≥ 2, using bi-parametric MRI. METHODS: In 2017, 284 consecutive men in active surveillance, biopsy-naïve or pre-biopsied, received targeted and extended systematic MRI/transrectal US-fusion biopsy, after examination on a single MRI scanner (3 T). A prospective adjustment scheme was evaluated comparing the performance of the Prostate Imaging Reporting and Data System (PI-RADS) and U-Net using sensitivity, specificity, predictive values, and the Dice coefficient. RESULTS: In the 259 eligible men (median 64 [IQR 61–72] years), PI-RADS had a sensitivity of 98% [106/108]/84% [91/108] with a specificity of 17% [25/151]/58% [88/151], for thresholds at ≥ 3/≥ 4 respectively. U-Net using dynamic threshold adjustment had a sensitivity of 99% [107/108]/83% [90/108] (p > 0.99/> 0.99) with a specificity of 24% [36/151]/55% [83/151] (p > 0.99/> 0.99) for probability thresholds d3 and d4 emulating PI-RADS ≥ 3 and ≥ 4 decisions respectively, not statistically different from PI-RADS. Co-occurrence of a radiological PI-RADS ≥ 4 examination and U-Net ≥ d3 assessment significantly improved the positive predictive value from 59 to 63% (p = 0.03), on a per-patient basis. CONCLUSIONS: U-Net has similar performance to PI-RADS in simulated continued clinical use. Regular quality assurance should be implemented to ensure desired performance. KEY POINTS: • U-Net maintained similar diagnostic performance compared to radiological assessment of PI-RADS ≥ 4 when applied in a simulated clinical deployment. • Application of our proposed prospective dynamic calibration method successfully adjusted U-Net performance within acceptable limits of the PI-RADS reference over time, while not being limited to PI-RADS as a reference. • Simultaneous detection by U-Net and radiological assessment significantly improved the positive predictive value on a per-patient and per-lesion basis, while the negative predictive value remained unchanged. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1007/s00330-020-07086-z) contains supplementary material, which is available to authorized users. Springer Berlin Heidelberg 2020-08-07 2021 /pmc/articles/PMC7755653/ /pubmed/32767102 http://dx.doi.org/10.1007/s00330-020-07086-z Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Urogenital Schelb, Patrick Wang, Xianfeng Radtke, Jan Philipp Wiesenfarth, Manuel Kickingereder, Philipp Stenzinger, Albrecht Hohenfellner, Markus Schlemmer, Heinz-Peter Maier-Hein, Klaus H. Bonekamp, David Simulated clinical deployment of fully automatic deep learning for clinical prostate MRI assessment |
title | Simulated clinical deployment of fully automatic deep learning for clinical prostate MRI assessment |
title_full | Simulated clinical deployment of fully automatic deep learning for clinical prostate MRI assessment |
title_fullStr | Simulated clinical deployment of fully automatic deep learning for clinical prostate MRI assessment |
title_full_unstemmed | Simulated clinical deployment of fully automatic deep learning for clinical prostate MRI assessment |
title_short | Simulated clinical deployment of fully automatic deep learning for clinical prostate MRI assessment |
title_sort | simulated clinical deployment of fully automatic deep learning for clinical prostate mri assessment |
topic | Urogenital |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7755653/ https://www.ncbi.nlm.nih.gov/pubmed/32767102 http://dx.doi.org/10.1007/s00330-020-07086-z |
work_keys_str_mv | AT schelbpatrick simulatedclinicaldeploymentoffullyautomaticdeeplearningforclinicalprostatemriassessment AT wangxianfeng simulatedclinicaldeploymentoffullyautomaticdeeplearningforclinicalprostatemriassessment AT radtkejanphilipp simulatedclinicaldeploymentoffullyautomaticdeeplearningforclinicalprostatemriassessment AT wiesenfarthmanuel simulatedclinicaldeploymentoffullyautomaticdeeplearningforclinicalprostatemriassessment AT kickingerederphilipp simulatedclinicaldeploymentoffullyautomaticdeeplearningforclinicalprostatemriassessment AT stenzingeralbrecht simulatedclinicaldeploymentoffullyautomaticdeeplearningforclinicalprostatemriassessment AT hohenfellnermarkus simulatedclinicaldeploymentoffullyautomaticdeeplearningforclinicalprostatemriassessment AT schlemmerheinzpeter simulatedclinicaldeploymentoffullyautomaticdeeplearningforclinicalprostatemriassessment AT maierheinklaush simulatedclinicaldeploymentoffullyautomaticdeeplearningforclinicalprostatemriassessment AT bonekampdavid simulatedclinicaldeploymentoffullyautomaticdeeplearningforclinicalprostatemriassessment |