Cargando…

Simulated clinical deployment of fully automatic deep learning for clinical prostate MRI assessment

OBJECTIVES: To simulate clinical deployment, evaluate performance, and establish quality assurance of a deep learning algorithm (U-Net) for detection, localization, and segmentation of clinically significant prostate cancer (sPC), ISUP grade group ≥ 2, using bi-parametric MRI. METHODS: In 2017, 284...

Descripción completa

Detalles Bibliográficos
Autores principales: Schelb, Patrick, Wang, Xianfeng, Radtke, Jan Philipp, Wiesenfarth, Manuel, Kickingereder, Philipp, Stenzinger, Albrecht, Hohenfellner, Markus, Schlemmer, Heinz-Peter, Maier-Hein, Klaus H., Bonekamp, David
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Berlin Heidelberg 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7755653/
https://www.ncbi.nlm.nih.gov/pubmed/32767102
http://dx.doi.org/10.1007/s00330-020-07086-z
_version_ 1783626394788429824
author Schelb, Patrick
Wang, Xianfeng
Radtke, Jan Philipp
Wiesenfarth, Manuel
Kickingereder, Philipp
Stenzinger, Albrecht
Hohenfellner, Markus
Schlemmer, Heinz-Peter
Maier-Hein, Klaus H.
Bonekamp, David
author_facet Schelb, Patrick
Wang, Xianfeng
Radtke, Jan Philipp
Wiesenfarth, Manuel
Kickingereder, Philipp
Stenzinger, Albrecht
Hohenfellner, Markus
Schlemmer, Heinz-Peter
Maier-Hein, Klaus H.
Bonekamp, David
author_sort Schelb, Patrick
collection PubMed
description OBJECTIVES: To simulate clinical deployment, evaluate performance, and establish quality assurance of a deep learning algorithm (U-Net) for detection, localization, and segmentation of clinically significant prostate cancer (sPC), ISUP grade group ≥ 2, using bi-parametric MRI. METHODS: In 2017, 284 consecutive men in active surveillance, biopsy-naïve or pre-biopsied, received targeted and extended systematic MRI/transrectal US-fusion biopsy, after examination on a single MRI scanner (3 T). A prospective adjustment scheme was evaluated comparing the performance of the Prostate Imaging Reporting and Data System (PI-RADS) and U-Net using sensitivity, specificity, predictive values, and the Dice coefficient. RESULTS: In the 259 eligible men (median 64 [IQR 61–72] years), PI-RADS had a sensitivity of 98% [106/108]/84% [91/108] with a specificity of 17% [25/151]/58% [88/151], for thresholds at ≥ 3/≥ 4 respectively. U-Net using dynamic threshold adjustment had a sensitivity of 99% [107/108]/83% [90/108] (p > 0.99/> 0.99) with a specificity of 24% [36/151]/55% [83/151] (p > 0.99/> 0.99) for probability thresholds d3 and d4 emulating PI-RADS ≥ 3 and ≥ 4 decisions respectively, not statistically different from PI-RADS. Co-occurrence of a radiological PI-RADS ≥ 4 examination and U-Net ≥ d3 assessment significantly improved the positive predictive value from 59 to 63% (p = 0.03), on a per-patient basis. CONCLUSIONS: U-Net has similar performance to PI-RADS in simulated continued clinical use. Regular quality assurance should be implemented to ensure desired performance. KEY POINTS: • U-Net maintained similar diagnostic performance compared to radiological assessment of PI-RADS ≥ 4 when applied in a simulated clinical deployment. • Application of our proposed prospective dynamic calibration method successfully adjusted U-Net performance within acceptable limits of the PI-RADS reference over time, while not being limited to PI-RADS as a reference. • Simultaneous detection by U-Net and radiological assessment significantly improved the positive predictive value on a per-patient and per-lesion basis, while the negative predictive value remained unchanged. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1007/s00330-020-07086-z) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-7755653
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Springer Berlin Heidelberg
record_format MEDLINE/PubMed
spelling pubmed-77556532020-12-28 Simulated clinical deployment of fully automatic deep learning for clinical prostate MRI assessment Schelb, Patrick Wang, Xianfeng Radtke, Jan Philipp Wiesenfarth, Manuel Kickingereder, Philipp Stenzinger, Albrecht Hohenfellner, Markus Schlemmer, Heinz-Peter Maier-Hein, Klaus H. Bonekamp, David Eur Radiol Urogenital OBJECTIVES: To simulate clinical deployment, evaluate performance, and establish quality assurance of a deep learning algorithm (U-Net) for detection, localization, and segmentation of clinically significant prostate cancer (sPC), ISUP grade group ≥ 2, using bi-parametric MRI. METHODS: In 2017, 284 consecutive men in active surveillance, biopsy-naïve or pre-biopsied, received targeted and extended systematic MRI/transrectal US-fusion biopsy, after examination on a single MRI scanner (3 T). A prospective adjustment scheme was evaluated comparing the performance of the Prostate Imaging Reporting and Data System (PI-RADS) and U-Net using sensitivity, specificity, predictive values, and the Dice coefficient. RESULTS: In the 259 eligible men (median 64 [IQR 61–72] years), PI-RADS had a sensitivity of 98% [106/108]/84% [91/108] with a specificity of 17% [25/151]/58% [88/151], for thresholds at ≥ 3/≥ 4 respectively. U-Net using dynamic threshold adjustment had a sensitivity of 99% [107/108]/83% [90/108] (p > 0.99/> 0.99) with a specificity of 24% [36/151]/55% [83/151] (p > 0.99/> 0.99) for probability thresholds d3 and d4 emulating PI-RADS ≥ 3 and ≥ 4 decisions respectively, not statistically different from PI-RADS. Co-occurrence of a radiological PI-RADS ≥ 4 examination and U-Net ≥ d3 assessment significantly improved the positive predictive value from 59 to 63% (p = 0.03), on a per-patient basis. CONCLUSIONS: U-Net has similar performance to PI-RADS in simulated continued clinical use. Regular quality assurance should be implemented to ensure desired performance. KEY POINTS: • U-Net maintained similar diagnostic performance compared to radiological assessment of PI-RADS ≥ 4 when applied in a simulated clinical deployment. • Application of our proposed prospective dynamic calibration method successfully adjusted U-Net performance within acceptable limits of the PI-RADS reference over time, while not being limited to PI-RADS as a reference. • Simultaneous detection by U-Net and radiological assessment significantly improved the positive predictive value on a per-patient and per-lesion basis, while the negative predictive value remained unchanged. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1007/s00330-020-07086-z) contains supplementary material, which is available to authorized users. Springer Berlin Heidelberg 2020-08-07 2021 /pmc/articles/PMC7755653/ /pubmed/32767102 http://dx.doi.org/10.1007/s00330-020-07086-z Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Urogenital
Schelb, Patrick
Wang, Xianfeng
Radtke, Jan Philipp
Wiesenfarth, Manuel
Kickingereder, Philipp
Stenzinger, Albrecht
Hohenfellner, Markus
Schlemmer, Heinz-Peter
Maier-Hein, Klaus H.
Bonekamp, David
Simulated clinical deployment of fully automatic deep learning for clinical prostate MRI assessment
title Simulated clinical deployment of fully automatic deep learning for clinical prostate MRI assessment
title_full Simulated clinical deployment of fully automatic deep learning for clinical prostate MRI assessment
title_fullStr Simulated clinical deployment of fully automatic deep learning for clinical prostate MRI assessment
title_full_unstemmed Simulated clinical deployment of fully automatic deep learning for clinical prostate MRI assessment
title_short Simulated clinical deployment of fully automatic deep learning for clinical prostate MRI assessment
title_sort simulated clinical deployment of fully automatic deep learning for clinical prostate mri assessment
topic Urogenital
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7755653/
https://www.ncbi.nlm.nih.gov/pubmed/32767102
http://dx.doi.org/10.1007/s00330-020-07086-z
work_keys_str_mv AT schelbpatrick simulatedclinicaldeploymentoffullyautomaticdeeplearningforclinicalprostatemriassessment
AT wangxianfeng simulatedclinicaldeploymentoffullyautomaticdeeplearningforclinicalprostatemriassessment
AT radtkejanphilipp simulatedclinicaldeploymentoffullyautomaticdeeplearningforclinicalprostatemriassessment
AT wiesenfarthmanuel simulatedclinicaldeploymentoffullyautomaticdeeplearningforclinicalprostatemriassessment
AT kickingerederphilipp simulatedclinicaldeploymentoffullyautomaticdeeplearningforclinicalprostatemriassessment
AT stenzingeralbrecht simulatedclinicaldeploymentoffullyautomaticdeeplearningforclinicalprostatemriassessment
AT hohenfellnermarkus simulatedclinicaldeploymentoffullyautomaticdeeplearningforclinicalprostatemriassessment
AT schlemmerheinzpeter simulatedclinicaldeploymentoffullyautomaticdeeplearningforclinicalprostatemriassessment
AT maierheinklaush simulatedclinicaldeploymentoffullyautomaticdeeplearningforclinicalprostatemriassessment
AT bonekampdavid simulatedclinicaldeploymentoffullyautomaticdeeplearningforclinicalprostatemriassessment