Cargando…

Diagnostic Test Accuracy of Deep Learning Detection of COVID-19: A Systematic Review and Meta-Analysis

RATIONALE AND OBJECTIVE: To perform a meta-analysis to compare the diagnostic test accuracy (DTA) of deep learning (DL) in detecting coronavirus disease 2019 (COVID-19), and to investigate how network architecture and type of datasets affect DL performance. MATERIALS AND METHODS: We searched PubMed,...

Descripción completa

Detalles Bibliográficos
Autores principales: Komolafe, Temitope Emmanuel, Cao, Yuzhu, Nguchu, Benedictor Alexander, Monkam, Patrice, Olaniyi, Ebenezer Obaloluwa, Sun, Haotian, Zheng, Jian, Yang, Xiaodong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Association of University Radiologists. Published by Elsevier Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8445811/
https://www.ncbi.nlm.nih.gov/pubmed/34649779
http://dx.doi.org/10.1016/j.acra.2021.08.008
Descripción
Sumario:RATIONALE AND OBJECTIVE: To perform a meta-analysis to compare the diagnostic test accuracy (DTA) of deep learning (DL) in detecting coronavirus disease 2019 (COVID-19), and to investigate how network architecture and type of datasets affect DL performance. MATERIALS AND METHODS: We searched PubMed, Web of Science and Inspec from January 1, 2020, to December 3, 2020, for retrospective and prospective studies on deep learning detection with at least reported sensitivity and specificity. Pooled DTA was obtained using random-effect models. Sub-group analysis between studies was also carried out for data source and network architectures. RESULTS: The pooled sensitivity and specificity were 91% (95% confidence interval [CI]: 88%, 93%; [Formula: see text]  = 69%) and 92% (95% CI: 88%, 94%; [Formula: see text]  = 88%), respectively for 19 studies. The pooled AUC and diagnostic odds ratio (DOR) were 0.95 (95% CI: 0.88, 0.92) and 112.5 (95% CI: 57.7, 219.3; [Formula: see text]  = 90%) respectively. The overall accuracy, recall, F1-score, LR(+) and LR(−) are 89.5%, 89.5%, 89.7%, 23.13 and 0.13. Sub-group analysis shows that the sensitivity and DOR significantly vary with the type of network architectures and sources of data with low heterogeneity are ([Formula: see text]  = 0%) and ([Formula: see text]  = 18%) for ResNet architecture and single-source datasets, respectively. CONCLUSION: The diagnosis of COVID-19 via deep learning has achieved incredible performance, and the source of datasets, as well as network architectures, strongly affect DL performance.