Cargando…

Simulated clinical deployment of fully automatic deep learning for clinical prostate MRI assessment

OBJECTIVES: To simulate clinical deployment, evaluate performance, and establish quality assurance of a deep learning algorithm (U-Net) for detection, localization, and segmentation of clinically significant prostate cancer (sPC), ISUP grade group ≥ 2, using bi-parametric MRI. METHODS: In 2017, 284...

Descripción completa

Detalles Bibliográficos
Autores principales:	Schelb, Patrick, Wang, Xianfeng, Radtke, Jan Philipp, Wiesenfarth, Manuel, Kickingereder, Philipp, Stenzinger, Albrecht, Hohenfellner, Markus, Schlemmer, Heinz-Peter, Maier-Hein, Klaus H., Bonekamp, David
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer Berlin Heidelberg 2020
Materias:	Urogenital
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7755653/ https://www.ncbi.nlm.nih.gov/pubmed/32767102 http://dx.doi.org/10.1007/s00330-020-07086-z

_version_	1783626394788429824
author	Schelb, Patrick Wang, Xianfeng Radtke, Jan Philipp Wiesenfarth, Manuel Kickingereder, Philipp Stenzinger, Albrecht Hohenfellner, Markus Schlemmer, Heinz-Peter Maier-Hein, Klaus H. Bonekamp, David
author_facet	Schelb, Patrick Wang, Xianfeng Radtke, Jan Philipp Wiesenfarth, Manuel Kickingereder, Philipp Stenzinger, Albrecht Hohenfellner, Markus Schlemmer, Heinz-Peter Maier-Hein, Klaus H. Bonekamp, David
author_sort	Schelb, Patrick
collection	PubMed
description	OBJECTIVES: To simulate clinical deployment, evaluate performance, and establish quality assurance of a deep learning algorithm (U-Net) for detection, localization, and segmentation of clinically significant prostate cancer (sPC), ISUP grade group ≥ 2, using bi-parametric MRI. METHODS: In 2017, 284 consecutive men in active surveillance, biopsy-naïve or pre-biopsied, received targeted and extended systematic MRI/transrectal US-fusion biopsy, after examination on a single MRI scanner (3 T). A prospective adjustment scheme was evaluated comparing the performance of the Prostate Imaging Reporting and Data System (PI-RADS) and U-Net using sensitivity, specificity, predictive values, and the Dice coefficient. RESULTS: In the 259 eligible men (median 64 [IQR 61–72] years), PI-RADS had a sensitivity of 98% [106/108]/84% [91/108] with a specificity of 17% [25/151]/58% [88/151], for thresholds at ≥ 3/≥ 4 respectively. U-Net using dynamic threshold adjustment had a sensitivity of 99% [107/108]/83% [90/108] (p > 0.99/> 0.99) with a specificity of 24% [36/151]/55% [83/151] (p > 0.99/> 0.99) for probability thresholds d3 and d4 emulating PI-RADS ≥ 3 and ≥ 4 decisions respectively, not statistically different from PI-RADS. Co-occurrence of a radiological PI-RADS ≥ 4 examination and U-Net ≥ d3 assessment significantly improved the positive predictive value from 59 to 63% (p = 0.03), on a per-patient basis. CONCLUSIONS: U-Net has similar performance to PI-RADS in simulated continued clinical use. Regular quality assurance should be implemented to ensure desired performance. KEY POINTS: • U-Net maintained similar diagnostic performance compared to radiological assessment of PI-RADS ≥ 4 when applied in a simulated clinical deployment. • Application of our proposed prospective dynamic calibration method successfully adjusted U-Net performance within acceptable limits of the PI-RADS reference over time, while not being limited to PI-RADS as a reference. • Simultaneous detection by U-Net and radiological assessment significantly improved the positive predictive value on a per-patient and per-lesion basis, while the negative predictive value remained unchanged. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1007/s00330-020-07086-z) contains supplementary material, which is available to authorized users.
format	Online Article Text
id	pubmed-7755653
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	Springer Berlin Heidelberg
record_format	MEDLINE/PubMed
spelling	pubmed-77556532020-12-28 Simulated clinical deployment of fully automatic deep learning for clinical prostate MRI assessment Schelb, Patrick Wang, Xianfeng Radtke, Jan Philipp Wiesenfarth, Manuel Kickingereder, Philipp Stenzinger, Albrecht Hohenfellner, Markus Schlemmer, Heinz-Peter Maier-Hein, Klaus H. Bonekamp, David Eur Radiol Urogenital OBJECTIVES: To simulate clinical deployment, evaluate performance, and establish quality assurance of a deep learning algorithm (U-Net) for detection, localization, and segmentation of clinically significant prostate cancer (sPC), ISUP grade group ≥ 2, using bi-parametric MRI. METHODS: In 2017, 284 consecutive men in active surveillance, biopsy-naïve or pre-biopsied, received targeted and extended systematic MRI/transrectal US-fusion biopsy, after examination on a single MRI scanner (3 T). A prospective adjustment scheme was evaluated comparing the performance of the Prostate Imaging Reporting and Data System (PI-RADS) and U-Net using sensitivity, specificity, predictive values, and the Dice coefficient. RESULTS: In the 259 eligible men (median 64 [IQR 61–72] years), PI-RADS had a sensitivity of 98% [106/108]/84% [91/108] with a specificity of 17% [25/151]/58% [88/151], for thresholds at ≥ 3/≥ 4 respectively. U-Net using dynamic threshold adjustment had a sensitivity of 99% [107/108]/83% [90/108] (p > 0.99/> 0.99) with a specificity of 24% [36/151]/55% [83/151] (p > 0.99/> 0.99) for probability thresholds d3 and d4 emulating PI-RADS ≥ 3 and ≥ 4 decisions respectively, not statistically different from PI-RADS. Co-occurrence of a radiological PI-RADS ≥ 4 examination and U-Net ≥ d3 assessment significantly improved the positive predictive value from 59 to 63% (p = 0.03), on a per-patient basis. CONCLUSIONS: U-Net has similar performance to PI-RADS in simulated continued clinical use. Regular quality assurance should be implemented to ensure desired performance. KEY POINTS: • U-Net maintained similar diagnostic performance compared to radiological assessment of PI-RADS ≥ 4 when applied in a simulated clinical deployment. • Application of our proposed prospective dynamic calibration method successfully adjusted U-Net performance within acceptable limits of the PI-RADS reference over time, while not being limited to PI-RADS as a reference. • Simultaneous detection by U-Net and radiological assessment significantly improved the positive predictive value on a per-patient and per-lesion basis, while the negative predictive value remained unchanged. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1007/s00330-020-07086-z) contains supplementary material, which is available to authorized users. Springer Berlin Heidelberg 2020-08-07 2021 /pmc/articles/PMC7755653/ /pubmed/32767102 http://dx.doi.org/10.1007/s00330-020-07086-z Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle	Urogenital Schelb, Patrick Wang, Xianfeng Radtke, Jan Philipp Wiesenfarth, Manuel Kickingereder, Philipp Stenzinger, Albrecht Hohenfellner, Markus Schlemmer, Heinz-Peter Maier-Hein, Klaus H. Bonekamp, David Simulated clinical deployment of fully automatic deep learning for clinical prostate MRI assessment
title	Simulated clinical deployment of fully automatic deep learning for clinical prostate MRI assessment
title_full	Simulated clinical deployment of fully automatic deep learning for clinical prostate MRI assessment
title_fullStr	Simulated clinical deployment of fully automatic deep learning for clinical prostate MRI assessment
title_full_unstemmed	Simulated clinical deployment of fully automatic deep learning for clinical prostate MRI assessment
title_short	Simulated clinical deployment of fully automatic deep learning for clinical prostate MRI assessment
title_sort	simulated clinical deployment of fully automatic deep learning for clinical prostate mri assessment
topic	Urogenital
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7755653/ https://www.ncbi.nlm.nih.gov/pubmed/32767102 http://dx.doi.org/10.1007/s00330-020-07086-z
work_keys_str_mv	AT schelbpatrick simulatedclinicaldeploymentoffullyautomaticdeeplearningforclinicalprostatemriassessment AT wangxianfeng simulatedclinicaldeploymentoffullyautomaticdeeplearningforclinicalprostatemriassessment AT radtkejanphilipp simulatedclinicaldeploymentoffullyautomaticdeeplearningforclinicalprostatemriassessment AT wiesenfarthmanuel simulatedclinicaldeploymentoffullyautomaticdeeplearningforclinicalprostatemriassessment AT kickingerederphilipp simulatedclinicaldeploymentoffullyautomaticdeeplearningforclinicalprostatemriassessment AT stenzingeralbrecht simulatedclinicaldeploymentoffullyautomaticdeeplearningforclinicalprostatemriassessment AT hohenfellnermarkus simulatedclinicaldeploymentoffullyautomaticdeeplearningforclinicalprostatemriassessment AT schlemmerheinzpeter simulatedclinicaldeploymentoffullyautomaticdeeplearningforclinicalprostatemriassessment AT maierheinklaush simulatedclinicaldeploymentoffullyautomaticdeeplearningforclinicalprostatemriassessment AT bonekampdavid simulatedclinicaldeploymentoffullyautomaticdeeplearningforclinicalprostatemriassessment

Simulated clinical deployment of fully automatic deep learning for clinical prostate MRI assessment

Ejemplares similares