Cargando…

External validation of Machine Learning models for COVID-19 detection based on Complete Blood Count

PURPOSE: The rRT-PCR for COVID-19 diagnosis is affected by long turnaround time, potential shortage of reagents, high false-negative rates and high costs. Routine hematochemical tests are a faster and less expensive alternative for diagnosis. Thus, Machine Learning (ML) has been applied to hematolog...

Descripción completa

Detalles Bibliográficos
Autores principales: Campagner, Andrea, Carobene, Anna, Cabitza, Federico
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8540880/
https://www.ncbi.nlm.nih.gov/pubmed/34721844
http://dx.doi.org/10.1007/s13755-021-00167-3
_version_ 1784589093540200448
author Campagner, Andrea
Carobene, Anna
Cabitza, Federico
author_facet Campagner, Andrea
Carobene, Anna
Cabitza, Federico
author_sort Campagner, Andrea
collection PubMed
description PURPOSE: The rRT-PCR for COVID-19 diagnosis is affected by long turnaround time, potential shortage of reagents, high false-negative rates and high costs. Routine hematochemical tests are a faster and less expensive alternative for diagnosis. Thus, Machine Learning (ML) has been applied to hematological parameters to develop diagnostic tools and help clinicians in promptly managing positive patients. However, few ML models have been externally validated, making their real-world applicability unclear. METHODS: We externally validate 6 state-of-the-art diagnostic ML models, based on Complete Blood Count (CBC) and trained on a dataset encompassing 816 COVID-19 positive cases. The external validation was performed based on two datasets, collected at two different hospitals in northern Italy and encompassing 163 and 104 COVID-19 positive cases, in terms of both error rate and calibration. RESULTS AND CONCLUSION: We report an average AUC of 95% and average Brier score of 0.11, out-performing existing ML methods, and showing good cross-site transportability. The best performing model (SVM) reported an average AUC of 97.5% (Sensitivity: 87.5%, Specificity: 94%), comparable with the performance of RT-PCR, and was also the best calibrated. The validated models can be useful in the early identification of potential COVID-19 patients, due to the rapid availability of CBC exams, and in multiple test settings.
format Online
Article
Text
id pubmed-8540880
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-85408802021-10-25 External validation of Machine Learning models for COVID-19 detection based on Complete Blood Count Campagner, Andrea Carobene, Anna Cabitza, Federico Health Inf Sci Syst Research PURPOSE: The rRT-PCR for COVID-19 diagnosis is affected by long turnaround time, potential shortage of reagents, high false-negative rates and high costs. Routine hematochemical tests are a faster and less expensive alternative for diagnosis. Thus, Machine Learning (ML) has been applied to hematological parameters to develop diagnostic tools and help clinicians in promptly managing positive patients. However, few ML models have been externally validated, making their real-world applicability unclear. METHODS: We externally validate 6 state-of-the-art diagnostic ML models, based on Complete Blood Count (CBC) and trained on a dataset encompassing 816 COVID-19 positive cases. The external validation was performed based on two datasets, collected at two different hospitals in northern Italy and encompassing 163 and 104 COVID-19 positive cases, in terms of both error rate and calibration. RESULTS AND CONCLUSION: We report an average AUC of 95% and average Brier score of 0.11, out-performing existing ML methods, and showing good cross-site transportability. The best performing model (SVM) reported an average AUC of 97.5% (Sensitivity: 87.5%, Specificity: 94%), comparable with the performance of RT-PCR, and was also the best calibrated. The validated models can be useful in the early identification of potential COVID-19 patients, due to the rapid availability of CBC exams, and in multiple test settings. Springer International Publishing 2021-10-23 /pmc/articles/PMC8540880/ /pubmed/34721844 http://dx.doi.org/10.1007/s13755-021-00167-3 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Research
Campagner, Andrea
Carobene, Anna
Cabitza, Federico
External validation of Machine Learning models for COVID-19 detection based on Complete Blood Count
title External validation of Machine Learning models for COVID-19 detection based on Complete Blood Count
title_full External validation of Machine Learning models for COVID-19 detection based on Complete Blood Count
title_fullStr External validation of Machine Learning models for COVID-19 detection based on Complete Blood Count
title_full_unstemmed External validation of Machine Learning models for COVID-19 detection based on Complete Blood Count
title_short External validation of Machine Learning models for COVID-19 detection based on Complete Blood Count
title_sort external validation of machine learning models for covid-19 detection based on complete blood count
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8540880/
https://www.ncbi.nlm.nih.gov/pubmed/34721844
http://dx.doi.org/10.1007/s13755-021-00167-3
work_keys_str_mv AT campagnerandrea externalvalidationofmachinelearningmodelsforcovid19detectionbasedoncompletebloodcount
AT carobeneanna externalvalidationofmachinelearningmodelsforcovid19detectionbasedoncompletebloodcount
AT cabitzafederico externalvalidationofmachinelearningmodelsforcovid19detectionbasedoncompletebloodcount