Cargando…

QCovSML: A reliable COVID-19 detection system using CBC biomarkers by a stacking machine learning model

The reverse transcription-polymerase chain reaction (RT-PCR) test is considered the current gold standard for the detection of coronavirus disease (COVID-19), although it suffers from some shortcomings, namely comparatively longer turnaround time, higher false-negative rates around 20–25%, and highe...

Descripción completa

Detalles Bibliográficos
Autores principales: Rahman, Tawsifur, Khandakar, Amith, Abir, Farhan Fuad, Faisal, Md Ahasan Atick, Hossain, Md Shafayet, Podder, Kanchon Kanti, Abbas, Tariq O., Alam, Mohammed Fasihul, Kashem, Saad Bin, Islam, Mohammad Tariqul, Zughaier, Susu M., Chowdhury, Muhammad E.H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier Ltd. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8839805/
https://www.ncbi.nlm.nih.gov/pubmed/35180500
http://dx.doi.org/10.1016/j.compbiomed.2022.105284
_version_ 1784650461479960576
author Rahman, Tawsifur
Khandakar, Amith
Abir, Farhan Fuad
Faisal, Md Ahasan Atick
Hossain, Md Shafayet
Podder, Kanchon Kanti
Abbas, Tariq O.
Alam, Mohammed Fasihul
Kashem, Saad Bin
Islam, Mohammad Tariqul
Zughaier, Susu M.
Chowdhury, Muhammad E.H.
author_facet Rahman, Tawsifur
Khandakar, Amith
Abir, Farhan Fuad
Faisal, Md Ahasan Atick
Hossain, Md Shafayet
Podder, Kanchon Kanti
Abbas, Tariq O.
Alam, Mohammed Fasihul
Kashem, Saad Bin
Islam, Mohammad Tariqul
Zughaier, Susu M.
Chowdhury, Muhammad E.H.
author_sort Rahman, Tawsifur
collection PubMed
description The reverse transcription-polymerase chain reaction (RT-PCR) test is considered the current gold standard for the detection of coronavirus disease (COVID-19), although it suffers from some shortcomings, namely comparatively longer turnaround time, higher false-negative rates around 20–25%, and higher cost equipment. Therefore, finding an efficient, robust, accurate, and widely available, and accessible alternative to RT-PCR for COVID-19 diagnosis is a matter of utmost importance. This study proposes a complete blood count (CBC) biomarkers-based COVID-19 detection system using a stacking machine learning (SML) model, which could be a fast and less expensive alternative. This study used seven different publicly available datasets, where the largest one consisting of fifteen CBC biomarkers collected from 1624 patients (52% COVID-19 positive) admitted at San Raphael Hospital, Italy from February to May 2020 was used to train and validate the proposed model. White blood cell count, monocytes (%), lymphocyte (%), and age parameters collected from the patients during hospital admission were found to be important biomarkers for COVID-19 disease prediction using five different feature selection techniques. Our stacking model produced the best performance with weighted precision, sensitivity, specificity, overall accuracy, and F1-score of 91.44%, 91.44%, 91.44%, 91.45%, and 91.45%, respectively. The stacking machine learning model improved the performance in comparison to other state-of-the-art machine learning classifiers. Finally, a nomogram-based scoring system (QCovSML) was constructed using this stacking approach to predict the COVID-19 patients. The cut-off value of the QCovSML system for classifying COVID-19 and Non-COVID patients was 4.8. Six datasets from three different countries were used to externally validate the proposed model to evaluate its generalizability and robustness. The nomogram demonstrated good calibration and discrimination with the area under the curve (AUC) of 0.961 for the internal cohort and average AUC of 0.967 for all external validation cohort, respectively. The external validation shows an average weighted precision, sensitivity, F1-score, specificity, and overall accuracy of 92.02%, 95.59%, 93.73%, 90.54%, and 93.34%, respectively.
format Online
Article
Text
id pubmed-8839805
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier Ltd.
record_format MEDLINE/PubMed
spelling pubmed-88398052022-02-14 QCovSML: A reliable COVID-19 detection system using CBC biomarkers by a stacking machine learning model Rahman, Tawsifur Khandakar, Amith Abir, Farhan Fuad Faisal, Md Ahasan Atick Hossain, Md Shafayet Podder, Kanchon Kanti Abbas, Tariq O. Alam, Mohammed Fasihul Kashem, Saad Bin Islam, Mohammad Tariqul Zughaier, Susu M. Chowdhury, Muhammad E.H. Comput Biol Med Article The reverse transcription-polymerase chain reaction (RT-PCR) test is considered the current gold standard for the detection of coronavirus disease (COVID-19), although it suffers from some shortcomings, namely comparatively longer turnaround time, higher false-negative rates around 20–25%, and higher cost equipment. Therefore, finding an efficient, robust, accurate, and widely available, and accessible alternative to RT-PCR for COVID-19 diagnosis is a matter of utmost importance. This study proposes a complete blood count (CBC) biomarkers-based COVID-19 detection system using a stacking machine learning (SML) model, which could be a fast and less expensive alternative. This study used seven different publicly available datasets, where the largest one consisting of fifteen CBC biomarkers collected from 1624 patients (52% COVID-19 positive) admitted at San Raphael Hospital, Italy from February to May 2020 was used to train and validate the proposed model. White blood cell count, monocytes (%), lymphocyte (%), and age parameters collected from the patients during hospital admission were found to be important biomarkers for COVID-19 disease prediction using five different feature selection techniques. Our stacking model produced the best performance with weighted precision, sensitivity, specificity, overall accuracy, and F1-score of 91.44%, 91.44%, 91.44%, 91.45%, and 91.45%, respectively. The stacking machine learning model improved the performance in comparison to other state-of-the-art machine learning classifiers. Finally, a nomogram-based scoring system (QCovSML) was constructed using this stacking approach to predict the COVID-19 patients. The cut-off value of the QCovSML system for classifying COVID-19 and Non-COVID patients was 4.8. Six datasets from three different countries were used to externally validate the proposed model to evaluate its generalizability and robustness. The nomogram demonstrated good calibration and discrimination with the area under the curve (AUC) of 0.961 for the internal cohort and average AUC of 0.967 for all external validation cohort, respectively. The external validation shows an average weighted precision, sensitivity, F1-score, specificity, and overall accuracy of 92.02%, 95.59%, 93.73%, 90.54%, and 93.34%, respectively. Elsevier Ltd. 2022-04 2022-02-12 /pmc/articles/PMC8839805/ /pubmed/35180500 http://dx.doi.org/10.1016/j.compbiomed.2022.105284 Text en © 2022 Elsevier Ltd. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
Rahman, Tawsifur
Khandakar, Amith
Abir, Farhan Fuad
Faisal, Md Ahasan Atick
Hossain, Md Shafayet
Podder, Kanchon Kanti
Abbas, Tariq O.
Alam, Mohammed Fasihul
Kashem, Saad Bin
Islam, Mohammad Tariqul
Zughaier, Susu M.
Chowdhury, Muhammad E.H.
QCovSML: A reliable COVID-19 detection system using CBC biomarkers by a stacking machine learning model
title QCovSML: A reliable COVID-19 detection system using CBC biomarkers by a stacking machine learning model
title_full QCovSML: A reliable COVID-19 detection system using CBC biomarkers by a stacking machine learning model
title_fullStr QCovSML: A reliable COVID-19 detection system using CBC biomarkers by a stacking machine learning model
title_full_unstemmed QCovSML: A reliable COVID-19 detection system using CBC biomarkers by a stacking machine learning model
title_short QCovSML: A reliable COVID-19 detection system using CBC biomarkers by a stacking machine learning model
title_sort qcovsml: a reliable covid-19 detection system using cbc biomarkers by a stacking machine learning model
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8839805/
https://www.ncbi.nlm.nih.gov/pubmed/35180500
http://dx.doi.org/10.1016/j.compbiomed.2022.105284
work_keys_str_mv AT rahmantawsifur qcovsmlareliablecovid19detectionsystemusingcbcbiomarkersbyastackingmachinelearningmodel
AT khandakaramith qcovsmlareliablecovid19detectionsystemusingcbcbiomarkersbyastackingmachinelearningmodel
AT abirfarhanfuad qcovsmlareliablecovid19detectionsystemusingcbcbiomarkersbyastackingmachinelearningmodel
AT faisalmdahasanatick qcovsmlareliablecovid19detectionsystemusingcbcbiomarkersbyastackingmachinelearningmodel
AT hossainmdshafayet qcovsmlareliablecovid19detectionsystemusingcbcbiomarkersbyastackingmachinelearningmodel
AT podderkanchonkanti qcovsmlareliablecovid19detectionsystemusingcbcbiomarkersbyastackingmachinelearningmodel
AT abbastariqo qcovsmlareliablecovid19detectionsystemusingcbcbiomarkersbyastackingmachinelearningmodel
AT alammohammedfasihul qcovsmlareliablecovid19detectionsystemusingcbcbiomarkersbyastackingmachinelearningmodel
AT kashemsaadbin qcovsmlareliablecovid19detectionsystemusingcbcbiomarkersbyastackingmachinelearningmodel
AT islammohammadtariqul qcovsmlareliablecovid19detectionsystemusingcbcbiomarkersbyastackingmachinelearningmodel
AT zughaiersusum qcovsmlareliablecovid19detectionsystemusingcbcbiomarkersbyastackingmachinelearningmodel
AT chowdhurymuhammadeh qcovsmlareliablecovid19detectionsystemusingcbcbiomarkersbyastackingmachinelearningmodel