Cargando…
QCovSML: A reliable COVID-19 detection system using CBC biomarkers by a stacking machine learning model
The reverse transcription-polymerase chain reaction (RT-PCR) test is considered the current gold standard for the detection of coronavirus disease (COVID-19), although it suffers from some shortcomings, namely comparatively longer turnaround time, higher false-negative rates around 20–25%, and highe...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier Ltd.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8839805/ https://www.ncbi.nlm.nih.gov/pubmed/35180500 http://dx.doi.org/10.1016/j.compbiomed.2022.105284 |
_version_ | 1784650461479960576 |
---|---|
author | Rahman, Tawsifur Khandakar, Amith Abir, Farhan Fuad Faisal, Md Ahasan Atick Hossain, Md Shafayet Podder, Kanchon Kanti Abbas, Tariq O. Alam, Mohammed Fasihul Kashem, Saad Bin Islam, Mohammad Tariqul Zughaier, Susu M. Chowdhury, Muhammad E.H. |
author_facet | Rahman, Tawsifur Khandakar, Amith Abir, Farhan Fuad Faisal, Md Ahasan Atick Hossain, Md Shafayet Podder, Kanchon Kanti Abbas, Tariq O. Alam, Mohammed Fasihul Kashem, Saad Bin Islam, Mohammad Tariqul Zughaier, Susu M. Chowdhury, Muhammad E.H. |
author_sort | Rahman, Tawsifur |
collection | PubMed |
description | The reverse transcription-polymerase chain reaction (RT-PCR) test is considered the current gold standard for the detection of coronavirus disease (COVID-19), although it suffers from some shortcomings, namely comparatively longer turnaround time, higher false-negative rates around 20–25%, and higher cost equipment. Therefore, finding an efficient, robust, accurate, and widely available, and accessible alternative to RT-PCR for COVID-19 diagnosis is a matter of utmost importance. This study proposes a complete blood count (CBC) biomarkers-based COVID-19 detection system using a stacking machine learning (SML) model, which could be a fast and less expensive alternative. This study used seven different publicly available datasets, where the largest one consisting of fifteen CBC biomarkers collected from 1624 patients (52% COVID-19 positive) admitted at San Raphael Hospital, Italy from February to May 2020 was used to train and validate the proposed model. White blood cell count, monocytes (%), lymphocyte (%), and age parameters collected from the patients during hospital admission were found to be important biomarkers for COVID-19 disease prediction using five different feature selection techniques. Our stacking model produced the best performance with weighted precision, sensitivity, specificity, overall accuracy, and F1-score of 91.44%, 91.44%, 91.44%, 91.45%, and 91.45%, respectively. The stacking machine learning model improved the performance in comparison to other state-of-the-art machine learning classifiers. Finally, a nomogram-based scoring system (QCovSML) was constructed using this stacking approach to predict the COVID-19 patients. The cut-off value of the QCovSML system for classifying COVID-19 and Non-COVID patients was 4.8. Six datasets from three different countries were used to externally validate the proposed model to evaluate its generalizability and robustness. The nomogram demonstrated good calibration and discrimination with the area under the curve (AUC) of 0.961 for the internal cohort and average AUC of 0.967 for all external validation cohort, respectively. The external validation shows an average weighted precision, sensitivity, F1-score, specificity, and overall accuracy of 92.02%, 95.59%, 93.73%, 90.54%, and 93.34%, respectively. |
format | Online Article Text |
id | pubmed-8839805 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Elsevier Ltd. |
record_format | MEDLINE/PubMed |
spelling | pubmed-88398052022-02-14 QCovSML: A reliable COVID-19 detection system using CBC biomarkers by a stacking machine learning model Rahman, Tawsifur Khandakar, Amith Abir, Farhan Fuad Faisal, Md Ahasan Atick Hossain, Md Shafayet Podder, Kanchon Kanti Abbas, Tariq O. Alam, Mohammed Fasihul Kashem, Saad Bin Islam, Mohammad Tariqul Zughaier, Susu M. Chowdhury, Muhammad E.H. Comput Biol Med Article The reverse transcription-polymerase chain reaction (RT-PCR) test is considered the current gold standard for the detection of coronavirus disease (COVID-19), although it suffers from some shortcomings, namely comparatively longer turnaround time, higher false-negative rates around 20–25%, and higher cost equipment. Therefore, finding an efficient, robust, accurate, and widely available, and accessible alternative to RT-PCR for COVID-19 diagnosis is a matter of utmost importance. This study proposes a complete blood count (CBC) biomarkers-based COVID-19 detection system using a stacking machine learning (SML) model, which could be a fast and less expensive alternative. This study used seven different publicly available datasets, where the largest one consisting of fifteen CBC biomarkers collected from 1624 patients (52% COVID-19 positive) admitted at San Raphael Hospital, Italy from February to May 2020 was used to train and validate the proposed model. White blood cell count, monocytes (%), lymphocyte (%), and age parameters collected from the patients during hospital admission were found to be important biomarkers for COVID-19 disease prediction using five different feature selection techniques. Our stacking model produced the best performance with weighted precision, sensitivity, specificity, overall accuracy, and F1-score of 91.44%, 91.44%, 91.44%, 91.45%, and 91.45%, respectively. The stacking machine learning model improved the performance in comparison to other state-of-the-art machine learning classifiers. Finally, a nomogram-based scoring system (QCovSML) was constructed using this stacking approach to predict the COVID-19 patients. The cut-off value of the QCovSML system for classifying COVID-19 and Non-COVID patients was 4.8. Six datasets from three different countries were used to externally validate the proposed model to evaluate its generalizability and robustness. The nomogram demonstrated good calibration and discrimination with the area under the curve (AUC) of 0.961 for the internal cohort and average AUC of 0.967 for all external validation cohort, respectively. The external validation shows an average weighted precision, sensitivity, F1-score, specificity, and overall accuracy of 92.02%, 95.59%, 93.73%, 90.54%, and 93.34%, respectively. Elsevier Ltd. 2022-04 2022-02-12 /pmc/articles/PMC8839805/ /pubmed/35180500 http://dx.doi.org/10.1016/j.compbiomed.2022.105284 Text en © 2022 Elsevier Ltd. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Article Rahman, Tawsifur Khandakar, Amith Abir, Farhan Fuad Faisal, Md Ahasan Atick Hossain, Md Shafayet Podder, Kanchon Kanti Abbas, Tariq O. Alam, Mohammed Fasihul Kashem, Saad Bin Islam, Mohammad Tariqul Zughaier, Susu M. Chowdhury, Muhammad E.H. QCovSML: A reliable COVID-19 detection system using CBC biomarkers by a stacking machine learning model |
title | QCovSML: A reliable COVID-19 detection system using CBC biomarkers by a stacking machine learning model |
title_full | QCovSML: A reliable COVID-19 detection system using CBC biomarkers by a stacking machine learning model |
title_fullStr | QCovSML: A reliable COVID-19 detection system using CBC biomarkers by a stacking machine learning model |
title_full_unstemmed | QCovSML: A reliable COVID-19 detection system using CBC biomarkers by a stacking machine learning model |
title_short | QCovSML: A reliable COVID-19 detection system using CBC biomarkers by a stacking machine learning model |
title_sort | qcovsml: a reliable covid-19 detection system using cbc biomarkers by a stacking machine learning model |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8839805/ https://www.ncbi.nlm.nih.gov/pubmed/35180500 http://dx.doi.org/10.1016/j.compbiomed.2022.105284 |
work_keys_str_mv | AT rahmantawsifur qcovsmlareliablecovid19detectionsystemusingcbcbiomarkersbyastackingmachinelearningmodel AT khandakaramith qcovsmlareliablecovid19detectionsystemusingcbcbiomarkersbyastackingmachinelearningmodel AT abirfarhanfuad qcovsmlareliablecovid19detectionsystemusingcbcbiomarkersbyastackingmachinelearningmodel AT faisalmdahasanatick qcovsmlareliablecovid19detectionsystemusingcbcbiomarkersbyastackingmachinelearningmodel AT hossainmdshafayet qcovsmlareliablecovid19detectionsystemusingcbcbiomarkersbyastackingmachinelearningmodel AT podderkanchonkanti qcovsmlareliablecovid19detectionsystemusingcbcbiomarkersbyastackingmachinelearningmodel AT abbastariqo qcovsmlareliablecovid19detectionsystemusingcbcbiomarkersbyastackingmachinelearningmodel AT alammohammedfasihul qcovsmlareliablecovid19detectionsystemusingcbcbiomarkersbyastackingmachinelearningmodel AT kashemsaadbin qcovsmlareliablecovid19detectionsystemusingcbcbiomarkersbyastackingmachinelearningmodel AT islammohammadtariqul qcovsmlareliablecovid19detectionsystemusingcbcbiomarkersbyastackingmachinelearningmodel AT zughaiersusum qcovsmlareliablecovid19detectionsystemusingcbcbiomarkersbyastackingmachinelearningmodel AT chowdhurymuhammadeh qcovsmlareliablecovid19detectionsystemusingcbcbiomarkersbyastackingmachinelearningmodel |