Cargando…
Does a Deep Learning–Based Computer-Assisted Diagnosis System Outperform Conventional Double Reading by Radiologists in Distinguishing Benign and Malignant Lung Nodules?
BACKGROUND: In differentiating indeterminate pulmonary nodules, multiple studies indicated the superiority of deep learning–based computer-assisted diagnosis system (DL-CADx) over conventional double reading by radiologists, which needs external validation. Therefore, our aim was to externally valid...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7581733/ https://www.ncbi.nlm.nih.gov/pubmed/33163395 http://dx.doi.org/10.3389/fonc.2020.545862 |
_version_ | 1783599035068710912 |
---|---|
author | Liu, Zhou Li, Li Li, Tianran Luo, Douqiang Wang, Xiaoliang Luo, Dehong |
author_facet | Liu, Zhou Li, Li Li, Tianran Luo, Douqiang Wang, Xiaoliang Luo, Dehong |
author_sort | Liu, Zhou |
collection | PubMed |
description | BACKGROUND: In differentiating indeterminate pulmonary nodules, multiple studies indicated the superiority of deep learning–based computer-assisted diagnosis system (DL-CADx) over conventional double reading by radiologists, which needs external validation. Therefore, our aim was to externally validate the performance of a commercial DL-CADx in differentiating benign and malignant lung nodules. METHODS: In this retrospective study, 233 patients with 261 pathologically confirmed lung nodules were enrolled. Double reading was used to rate each nodule using a four-scale malignancy score system, including unlikely (0–25%), malignancy cannot be completely excluded (25–50%), highly likely (50–75%), and considered as malignant (75–100%), with any disagreement resolved through discussion. DL-CADx automatically rated each nodule with a malignancy likelihood ranging from 0 to 100%, which was then quadrichotomized accordingly. Intraclass correlation coefficient (ICC) was used to evaluate the agreement in malignancy risk rating between DL-CADx and double reading, with ICC value of <0.5, 0.5 to 0.75, 0.75 to 0.9, and >0.9 indicating poor, moderate, good, and perfect agreement, respectively. With malignancy likelihood >50% as cut-off value for malignancy and pathological results as gold standard, sensitivity, specificity, and accuracy were calculated for double reading and DL-CADx, separately. RESULTS: Among the 261 nodules, 247 nodules were successfully detected by DL-CADx with detection rate of 94.7%. Regarding malignancy rating, DL-CADx was in moderate agreement with double reading (ICC = 0.555, 95% CI 0.424 to 0.655). DL-CADx misdiagnosed 40 true malignant nodules as benign nodules and 30 true benign nodules as malignant nodules with sensitivity, specificity, and accuracy of 79.2, 45.5, and 71.7%, respectively. In contrast, double reading achieved better performance with 16 true malignant nodules misdiagnosed as benign nodules and 26 true benign nodules as malignant nodules with sensitivity, specificity, and accuracy of 91.7, 52.7, and 83.0%, respectively. CONCLUSION: Compared with double reading, DL-CADx we used still shows inferior performance in differentiating malignant and benign nodules. |
format | Online Article Text |
id | pubmed-7581733 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-75817332020-11-05 Does a Deep Learning–Based Computer-Assisted Diagnosis System Outperform Conventional Double Reading by Radiologists in Distinguishing Benign and Malignant Lung Nodules? Liu, Zhou Li, Li Li, Tianran Luo, Douqiang Wang, Xiaoliang Luo, Dehong Front Oncol Oncology BACKGROUND: In differentiating indeterminate pulmonary nodules, multiple studies indicated the superiority of deep learning–based computer-assisted diagnosis system (DL-CADx) over conventional double reading by radiologists, which needs external validation. Therefore, our aim was to externally validate the performance of a commercial DL-CADx in differentiating benign and malignant lung nodules. METHODS: In this retrospective study, 233 patients with 261 pathologically confirmed lung nodules were enrolled. Double reading was used to rate each nodule using a four-scale malignancy score system, including unlikely (0–25%), malignancy cannot be completely excluded (25–50%), highly likely (50–75%), and considered as malignant (75–100%), with any disagreement resolved through discussion. DL-CADx automatically rated each nodule with a malignancy likelihood ranging from 0 to 100%, which was then quadrichotomized accordingly. Intraclass correlation coefficient (ICC) was used to evaluate the agreement in malignancy risk rating between DL-CADx and double reading, with ICC value of <0.5, 0.5 to 0.75, 0.75 to 0.9, and >0.9 indicating poor, moderate, good, and perfect agreement, respectively. With malignancy likelihood >50% as cut-off value for malignancy and pathological results as gold standard, sensitivity, specificity, and accuracy were calculated for double reading and DL-CADx, separately. RESULTS: Among the 261 nodules, 247 nodules were successfully detected by DL-CADx with detection rate of 94.7%. Regarding malignancy rating, DL-CADx was in moderate agreement with double reading (ICC = 0.555, 95% CI 0.424 to 0.655). DL-CADx misdiagnosed 40 true malignant nodules as benign nodules and 30 true benign nodules as malignant nodules with sensitivity, specificity, and accuracy of 79.2, 45.5, and 71.7%, respectively. In contrast, double reading achieved better performance with 16 true malignant nodules misdiagnosed as benign nodules and 26 true benign nodules as malignant nodules with sensitivity, specificity, and accuracy of 91.7, 52.7, and 83.0%, respectively. CONCLUSION: Compared with double reading, DL-CADx we used still shows inferior performance in differentiating malignant and benign nodules. Frontiers Media S.A. 2020-10-09 /pmc/articles/PMC7581733/ /pubmed/33163395 http://dx.doi.org/10.3389/fonc.2020.545862 Text en Copyright © 2020 Liu, Li, Li, Luo, Wang and Luo. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Oncology Liu, Zhou Li, Li Li, Tianran Luo, Douqiang Wang, Xiaoliang Luo, Dehong Does a Deep Learning–Based Computer-Assisted Diagnosis System Outperform Conventional Double Reading by Radiologists in Distinguishing Benign and Malignant Lung Nodules? |
title | Does a Deep Learning–Based Computer-Assisted Diagnosis System Outperform Conventional Double Reading by Radiologists in Distinguishing Benign and Malignant Lung Nodules? |
title_full | Does a Deep Learning–Based Computer-Assisted Diagnosis System Outperform Conventional Double Reading by Radiologists in Distinguishing Benign and Malignant Lung Nodules? |
title_fullStr | Does a Deep Learning–Based Computer-Assisted Diagnosis System Outperform Conventional Double Reading by Radiologists in Distinguishing Benign and Malignant Lung Nodules? |
title_full_unstemmed | Does a Deep Learning–Based Computer-Assisted Diagnosis System Outperform Conventional Double Reading by Radiologists in Distinguishing Benign and Malignant Lung Nodules? |
title_short | Does a Deep Learning–Based Computer-Assisted Diagnosis System Outperform Conventional Double Reading by Radiologists in Distinguishing Benign and Malignant Lung Nodules? |
title_sort | does a deep learning–based computer-assisted diagnosis system outperform conventional double reading by radiologists in distinguishing benign and malignant lung nodules? |
topic | Oncology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7581733/ https://www.ncbi.nlm.nih.gov/pubmed/33163395 http://dx.doi.org/10.3389/fonc.2020.545862 |
work_keys_str_mv | AT liuzhou doesadeeplearningbasedcomputerassisteddiagnosissystemoutperformconventionaldoublereadingbyradiologistsindistinguishingbenignandmalignantlungnodules AT lili doesadeeplearningbasedcomputerassisteddiagnosissystemoutperformconventionaldoublereadingbyradiologistsindistinguishingbenignandmalignantlungnodules AT litianran doesadeeplearningbasedcomputerassisteddiagnosissystemoutperformconventionaldoublereadingbyradiologistsindistinguishingbenignandmalignantlungnodules AT luodouqiang doesadeeplearningbasedcomputerassisteddiagnosissystemoutperformconventionaldoublereadingbyradiologistsindistinguishingbenignandmalignantlungnodules AT wangxiaoliang doesadeeplearningbasedcomputerassisteddiagnosissystemoutperformconventionaldoublereadingbyradiologistsindistinguishingbenignandmalignantlungnodules AT luodehong doesadeeplearningbasedcomputerassisteddiagnosissystemoutperformconventionaldoublereadingbyradiologistsindistinguishingbenignandmalignantlungnodules |