Cargando…

Does a Deep Learning–Based Computer-Assisted Diagnosis System Outperform Conventional Double Reading by Radiologists in Distinguishing Benign and Malignant Lung Nodules?

BACKGROUND: In differentiating indeterminate pulmonary nodules, multiple studies indicated the superiority of deep learning–based computer-assisted diagnosis system (DL-CADx) over conventional double reading by radiologists, which needs external validation. Therefore, our aim was to externally valid...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Zhou, Li, Li, Li, Tianran, Luo, Douqiang, Wang, Xiaoliang, Luo, Dehong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7581733/
https://www.ncbi.nlm.nih.gov/pubmed/33163395
http://dx.doi.org/10.3389/fonc.2020.545862
_version_ 1783599035068710912
author Liu, Zhou
Li, Li
Li, Tianran
Luo, Douqiang
Wang, Xiaoliang
Luo, Dehong
author_facet Liu, Zhou
Li, Li
Li, Tianran
Luo, Douqiang
Wang, Xiaoliang
Luo, Dehong
author_sort Liu, Zhou
collection PubMed
description BACKGROUND: In differentiating indeterminate pulmonary nodules, multiple studies indicated the superiority of deep learning–based computer-assisted diagnosis system (DL-CADx) over conventional double reading by radiologists, which needs external validation. Therefore, our aim was to externally validate the performance of a commercial DL-CADx in differentiating benign and malignant lung nodules. METHODS: In this retrospective study, 233 patients with 261 pathologically confirmed lung nodules were enrolled. Double reading was used to rate each nodule using a four-scale malignancy score system, including unlikely (0–25%), malignancy cannot be completely excluded (25–50%), highly likely (50–75%), and considered as malignant (75–100%), with any disagreement resolved through discussion. DL-CADx automatically rated each nodule with a malignancy likelihood ranging from 0 to 100%, which was then quadrichotomized accordingly. Intraclass correlation coefficient (ICC) was used to evaluate the agreement in malignancy risk rating between DL-CADx and double reading, with ICC value of <0.5, 0.5 to 0.75, 0.75 to 0.9, and >0.9 indicating poor, moderate, good, and perfect agreement, respectively. With malignancy likelihood >50% as cut-off value for malignancy and pathological results as gold standard, sensitivity, specificity, and accuracy were calculated for double reading and DL-CADx, separately. RESULTS: Among the 261 nodules, 247 nodules were successfully detected by DL-CADx with detection rate of 94.7%. Regarding malignancy rating, DL-CADx was in moderate agreement with double reading (ICC = 0.555, 95% CI 0.424 to 0.655). DL-CADx misdiagnosed 40 true malignant nodules as benign nodules and 30 true benign nodules as malignant nodules with sensitivity, specificity, and accuracy of 79.2, 45.5, and 71.7%, respectively. In contrast, double reading achieved better performance with 16 true malignant nodules misdiagnosed as benign nodules and 26 true benign nodules as malignant nodules with sensitivity, specificity, and accuracy of 91.7, 52.7, and 83.0%, respectively. CONCLUSION: Compared with double reading, DL-CADx we used still shows inferior performance in differentiating malignant and benign nodules.
format Online
Article
Text
id pubmed-7581733
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-75817332020-11-05 Does a Deep Learning–Based Computer-Assisted Diagnosis System Outperform Conventional Double Reading by Radiologists in Distinguishing Benign and Malignant Lung Nodules? Liu, Zhou Li, Li Li, Tianran Luo, Douqiang Wang, Xiaoliang Luo, Dehong Front Oncol Oncology BACKGROUND: In differentiating indeterminate pulmonary nodules, multiple studies indicated the superiority of deep learning–based computer-assisted diagnosis system (DL-CADx) over conventional double reading by radiologists, which needs external validation. Therefore, our aim was to externally validate the performance of a commercial DL-CADx in differentiating benign and malignant lung nodules. METHODS: In this retrospective study, 233 patients with 261 pathologically confirmed lung nodules were enrolled. Double reading was used to rate each nodule using a four-scale malignancy score system, including unlikely (0–25%), malignancy cannot be completely excluded (25–50%), highly likely (50–75%), and considered as malignant (75–100%), with any disagreement resolved through discussion. DL-CADx automatically rated each nodule with a malignancy likelihood ranging from 0 to 100%, which was then quadrichotomized accordingly. Intraclass correlation coefficient (ICC) was used to evaluate the agreement in malignancy risk rating between DL-CADx and double reading, with ICC value of <0.5, 0.5 to 0.75, 0.75 to 0.9, and >0.9 indicating poor, moderate, good, and perfect agreement, respectively. With malignancy likelihood >50% as cut-off value for malignancy and pathological results as gold standard, sensitivity, specificity, and accuracy were calculated for double reading and DL-CADx, separately. RESULTS: Among the 261 nodules, 247 nodules were successfully detected by DL-CADx with detection rate of 94.7%. Regarding malignancy rating, DL-CADx was in moderate agreement with double reading (ICC = 0.555, 95% CI 0.424 to 0.655). DL-CADx misdiagnosed 40 true malignant nodules as benign nodules and 30 true benign nodules as malignant nodules with sensitivity, specificity, and accuracy of 79.2, 45.5, and 71.7%, respectively. In contrast, double reading achieved better performance with 16 true malignant nodules misdiagnosed as benign nodules and 26 true benign nodules as malignant nodules with sensitivity, specificity, and accuracy of 91.7, 52.7, and 83.0%, respectively. CONCLUSION: Compared with double reading, DL-CADx we used still shows inferior performance in differentiating malignant and benign nodules. Frontiers Media S.A. 2020-10-09 /pmc/articles/PMC7581733/ /pubmed/33163395 http://dx.doi.org/10.3389/fonc.2020.545862 Text en Copyright © 2020 Liu, Li, Li, Luo, Wang and Luo. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Oncology
Liu, Zhou
Li, Li
Li, Tianran
Luo, Douqiang
Wang, Xiaoliang
Luo, Dehong
Does a Deep Learning–Based Computer-Assisted Diagnosis System Outperform Conventional Double Reading by Radiologists in Distinguishing Benign and Malignant Lung Nodules?
title Does a Deep Learning–Based Computer-Assisted Diagnosis System Outperform Conventional Double Reading by Radiologists in Distinguishing Benign and Malignant Lung Nodules?
title_full Does a Deep Learning–Based Computer-Assisted Diagnosis System Outperform Conventional Double Reading by Radiologists in Distinguishing Benign and Malignant Lung Nodules?
title_fullStr Does a Deep Learning–Based Computer-Assisted Diagnosis System Outperform Conventional Double Reading by Radiologists in Distinguishing Benign and Malignant Lung Nodules?
title_full_unstemmed Does a Deep Learning–Based Computer-Assisted Diagnosis System Outperform Conventional Double Reading by Radiologists in Distinguishing Benign and Malignant Lung Nodules?
title_short Does a Deep Learning–Based Computer-Assisted Diagnosis System Outperform Conventional Double Reading by Radiologists in Distinguishing Benign and Malignant Lung Nodules?
title_sort does a deep learning–based computer-assisted diagnosis system outperform conventional double reading by radiologists in distinguishing benign and malignant lung nodules?
topic Oncology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7581733/
https://www.ncbi.nlm.nih.gov/pubmed/33163395
http://dx.doi.org/10.3389/fonc.2020.545862
work_keys_str_mv AT liuzhou doesadeeplearningbasedcomputerassisteddiagnosissystemoutperformconventionaldoublereadingbyradiologistsindistinguishingbenignandmalignantlungnodules
AT lili doesadeeplearningbasedcomputerassisteddiagnosissystemoutperformconventionaldoublereadingbyradiologistsindistinguishingbenignandmalignantlungnodules
AT litianran doesadeeplearningbasedcomputerassisteddiagnosissystemoutperformconventionaldoublereadingbyradiologistsindistinguishingbenignandmalignantlungnodules
AT luodouqiang doesadeeplearningbasedcomputerassisteddiagnosissystemoutperformconventionaldoublereadingbyradiologistsindistinguishingbenignandmalignantlungnodules
AT wangxiaoliang doesadeeplearningbasedcomputerassisteddiagnosissystemoutperformconventionaldoublereadingbyradiologistsindistinguishingbenignandmalignantlungnodules
AT luodehong doesadeeplearningbasedcomputerassisteddiagnosissystemoutperformconventionaldoublereadingbyradiologistsindistinguishingbenignandmalignantlungnodules