Cargando…

Deep Convolutional Neural Network–Based Diagnosis of Anterior Cruciate Ligament Tears: Performance Comparison of Homogenous Versus Heterogeneous Knee MRI Cohorts With Different Pulse Sequence Protocols and 1.5-T and 3-T Magnetic Field Strengths

OBJECTIVES: The aim of this study was to clinically validate a Deep Convolutional Neural Network (DCNN) for the detection of surgically proven anterior cruciate ligament (ACL) tears in a large patient cohort and to analyze the effect of magnetic resonance examinations from different institutions, va...

Descripción completa

Detalles Bibliográficos
Autores principales: Germann, Christoph, Marbach, Giuseppe, Civardi, Francesco, Fucentese, Sandro F., Fritz, Jan, Sutter, Reto, Pfirrmann, Christian W.A., Fritz, Benjamin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Lippincott Williams & Wilkins 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7343178/
https://www.ncbi.nlm.nih.gov/pubmed/32168039
http://dx.doi.org/10.1097/RLI.0000000000000664
_version_ 1783555717767102464
author Germann, Christoph
Marbach, Giuseppe
Civardi, Francesco
Fucentese, Sandro F.
Fritz, Jan
Sutter, Reto
Pfirrmann, Christian W.A.
Fritz, Benjamin
author_facet Germann, Christoph
Marbach, Giuseppe
Civardi, Francesco
Fucentese, Sandro F.
Fritz, Jan
Sutter, Reto
Pfirrmann, Christian W.A.
Fritz, Benjamin
author_sort Germann, Christoph
collection PubMed
description OBJECTIVES: The aim of this study was to clinically validate a Deep Convolutional Neural Network (DCNN) for the detection of surgically proven anterior cruciate ligament (ACL) tears in a large patient cohort and to analyze the effect of magnetic resonance examinations from different institutions, varying protocols, and field strengths. MATERIALS AND METHODS: After ethics committee approval, this retrospective analysis of prospectively collected data was performed on 512 consecutive subjects, who underwent knee magnetic resonance imaging (MRI) in a total of 59 different institutions followed by arthroscopic knee surgery at our institution. The DCNN and 3 fellowship-trained full-time academic musculoskeletal radiologists evaluated the MRI examinations for full-thickness ACL tears independently. Surgical reports served as the reference standard. Statistics included diagnostic performance metrics, including sensitivity, specificity, area under the receiver operating curve (“AUC ROC”), and kappa statistics. P values less than 0.05 were considered to represent statistical significance. RESULTS: Anterior cruciate ligament tears were present in 45.7% (234/512) and absent in 54.3% (278/512) of the subjects. The DCNN had a sensitivity of 96.1%, which was not significantly different from the readers (97.5%–97.9%; all P ≥ 0.118), but significantly lower specificity of 93.1% (readers, 99.6%–100%; all P < 0.001) and “AUC ROC” of 0.935 (readers, 0.989–0.991; all P < 0.001) for the entire cohort. Subgroup analysis showed a significantly lower sensitivity, specificity, and “AUC ROC” of the DCNN for outside MRI (92.5%, 87.1%, and 0.898, respectively) than in-house MRI (99.0%, 94.4%, and 0.967, respectively) examinations (P = 0.026, P = 0.043, and P < 0.05, respectively). There were no significant differences in DCNN performance for 1.5-T and 3-T MRI examinations (all P ≥ 0.753, respectively). CONCLUSIONS: Deep Convolutional Neural Network performance of ACL tear diagnosis can approach performance levels similar to fellowship-trained full-time academic musculoskeletal radiologists at 1.5 T and 3 T; however, the performance may decrease with increasing MRI examination heterogeneity.
format Online
Article
Text
id pubmed-7343178
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Lippincott Williams & Wilkins
record_format MEDLINE/PubMed
spelling pubmed-73431782020-08-05 Deep Convolutional Neural Network–Based Diagnosis of Anterior Cruciate Ligament Tears: Performance Comparison of Homogenous Versus Heterogeneous Knee MRI Cohorts With Different Pulse Sequence Protocols and 1.5-T and 3-T Magnetic Field Strengths Germann, Christoph Marbach, Giuseppe Civardi, Francesco Fucentese, Sandro F. Fritz, Jan Sutter, Reto Pfirrmann, Christian W.A. Fritz, Benjamin Invest Radiol Original Articles OBJECTIVES: The aim of this study was to clinically validate a Deep Convolutional Neural Network (DCNN) for the detection of surgically proven anterior cruciate ligament (ACL) tears in a large patient cohort and to analyze the effect of magnetic resonance examinations from different institutions, varying protocols, and field strengths. MATERIALS AND METHODS: After ethics committee approval, this retrospective analysis of prospectively collected data was performed on 512 consecutive subjects, who underwent knee magnetic resonance imaging (MRI) in a total of 59 different institutions followed by arthroscopic knee surgery at our institution. The DCNN and 3 fellowship-trained full-time academic musculoskeletal radiologists evaluated the MRI examinations for full-thickness ACL tears independently. Surgical reports served as the reference standard. Statistics included diagnostic performance metrics, including sensitivity, specificity, area under the receiver operating curve (“AUC ROC”), and kappa statistics. P values less than 0.05 were considered to represent statistical significance. RESULTS: Anterior cruciate ligament tears were present in 45.7% (234/512) and absent in 54.3% (278/512) of the subjects. The DCNN had a sensitivity of 96.1%, which was not significantly different from the readers (97.5%–97.9%; all P ≥ 0.118), but significantly lower specificity of 93.1% (readers, 99.6%–100%; all P < 0.001) and “AUC ROC” of 0.935 (readers, 0.989–0.991; all P < 0.001) for the entire cohort. Subgroup analysis showed a significantly lower sensitivity, specificity, and “AUC ROC” of the DCNN for outside MRI (92.5%, 87.1%, and 0.898, respectively) than in-house MRI (99.0%, 94.4%, and 0.967, respectively) examinations (P = 0.026, P = 0.043, and P < 0.05, respectively). There were no significant differences in DCNN performance for 1.5-T and 3-T MRI examinations (all P ≥ 0.753, respectively). CONCLUSIONS: Deep Convolutional Neural Network performance of ACL tear diagnosis can approach performance levels similar to fellowship-trained full-time academic musculoskeletal radiologists at 1.5 T and 3 T; however, the performance may decrease with increasing MRI examination heterogeneity. Lippincott Williams & Wilkins 2020-08 2020-03-11 /pmc/articles/PMC7343178/ /pubmed/32168039 http://dx.doi.org/10.1097/RLI.0000000000000664 Text en Copyright © 2020 The Author(s). Published by Wolters Kluwer Health, Inc. This is an open access article distributed under the Creative Commons Attribution License 4.0 (CCBY) (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. This article is made available via the PMC Open Access Subset for unrestricted re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the COVID-19 pandemic or until permissions are revoked in writing. Upon expiration of these permissions, PMC is granted a perpetual license to make this article available via PMC and Europe PMC, consistent with existing copyright protections.
spellingShingle Original Articles
Germann, Christoph
Marbach, Giuseppe
Civardi, Francesco
Fucentese, Sandro F.
Fritz, Jan
Sutter, Reto
Pfirrmann, Christian W.A.
Fritz, Benjamin
Deep Convolutional Neural Network–Based Diagnosis of Anterior Cruciate Ligament Tears: Performance Comparison of Homogenous Versus Heterogeneous Knee MRI Cohorts With Different Pulse Sequence Protocols and 1.5-T and 3-T Magnetic Field Strengths
title Deep Convolutional Neural Network–Based Diagnosis of Anterior Cruciate Ligament Tears: Performance Comparison of Homogenous Versus Heterogeneous Knee MRI Cohorts With Different Pulse Sequence Protocols and 1.5-T and 3-T Magnetic Field Strengths
title_full Deep Convolutional Neural Network–Based Diagnosis of Anterior Cruciate Ligament Tears: Performance Comparison of Homogenous Versus Heterogeneous Knee MRI Cohorts With Different Pulse Sequence Protocols and 1.5-T and 3-T Magnetic Field Strengths
title_fullStr Deep Convolutional Neural Network–Based Diagnosis of Anterior Cruciate Ligament Tears: Performance Comparison of Homogenous Versus Heterogeneous Knee MRI Cohorts With Different Pulse Sequence Protocols and 1.5-T and 3-T Magnetic Field Strengths
title_full_unstemmed Deep Convolutional Neural Network–Based Diagnosis of Anterior Cruciate Ligament Tears: Performance Comparison of Homogenous Versus Heterogeneous Knee MRI Cohorts With Different Pulse Sequence Protocols and 1.5-T and 3-T Magnetic Field Strengths
title_short Deep Convolutional Neural Network–Based Diagnosis of Anterior Cruciate Ligament Tears: Performance Comparison of Homogenous Versus Heterogeneous Knee MRI Cohorts With Different Pulse Sequence Protocols and 1.5-T and 3-T Magnetic Field Strengths
title_sort deep convolutional neural network–based diagnosis of anterior cruciate ligament tears: performance comparison of homogenous versus heterogeneous knee mri cohorts with different pulse sequence protocols and 1.5-t and 3-t magnetic field strengths
topic Original Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7343178/
https://www.ncbi.nlm.nih.gov/pubmed/32168039
http://dx.doi.org/10.1097/RLI.0000000000000664
work_keys_str_mv AT germannchristoph deepconvolutionalneuralnetworkbaseddiagnosisofanteriorcruciateligamenttearsperformancecomparisonofhomogenousversusheterogeneouskneemricohortswithdifferentpulsesequenceprotocolsand15tand3tmagneticfieldstrengths
AT marbachgiuseppe deepconvolutionalneuralnetworkbaseddiagnosisofanteriorcruciateligamenttearsperformancecomparisonofhomogenousversusheterogeneouskneemricohortswithdifferentpulsesequenceprotocolsand15tand3tmagneticfieldstrengths
AT civardifrancesco deepconvolutionalneuralnetworkbaseddiagnosisofanteriorcruciateligamenttearsperformancecomparisonofhomogenousversusheterogeneouskneemricohortswithdifferentpulsesequenceprotocolsand15tand3tmagneticfieldstrengths
AT fucentesesandrof deepconvolutionalneuralnetworkbaseddiagnosisofanteriorcruciateligamenttearsperformancecomparisonofhomogenousversusheterogeneouskneemricohortswithdifferentpulsesequenceprotocolsand15tand3tmagneticfieldstrengths
AT fritzjan deepconvolutionalneuralnetworkbaseddiagnosisofanteriorcruciateligamenttearsperformancecomparisonofhomogenousversusheterogeneouskneemricohortswithdifferentpulsesequenceprotocolsand15tand3tmagneticfieldstrengths
AT sutterreto deepconvolutionalneuralnetworkbaseddiagnosisofanteriorcruciateligamenttearsperformancecomparisonofhomogenousversusheterogeneouskneemricohortswithdifferentpulsesequenceprotocolsand15tand3tmagneticfieldstrengths
AT pfirrmannchristianwa deepconvolutionalneuralnetworkbaseddiagnosisofanteriorcruciateligamenttearsperformancecomparisonofhomogenousversusheterogeneouskneemricohortswithdifferentpulsesequenceprotocolsand15tand3tmagneticfieldstrengths
AT fritzbenjamin deepconvolutionalneuralnetworkbaseddiagnosisofanteriorcruciateligamenttearsperformancecomparisonofhomogenousversusheterogeneouskneemricohortswithdifferentpulsesequenceprotocolsand15tand3tmagneticfieldstrengths