Cargando…

Automatic Classification Between COVID-19 and Non-COVID-19 Pneumonia Using Symptoms, Comorbidities, and Laboratory Findings: The Khorshid COVID Cohort Study

Coronavirus disease-2019, also known as severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), was a disaster in 2020. Accurate and early diagnosis of coronavirus disease-2019 (COVID-19) is still essential for health policymaking. Reverse transcriptase-polymerase chain reaction (RT-PCR) has b...

Descripción completa

Detalles Bibliográficos
Autores principales: Marateb, Hamid Reza, Ziaie Nezhad, Farzad, Mohebian, Mohammad Reza, Sami, Ramin, Haghjooy Javanmard, Shaghayegh, Dehghan Niri, Fatemeh, Akafzadeh-Savari, Mahsa, Mansourian, Marjan, Mañanas, Miquel Angel, Wolkewitz, Martin, Binder, Harald
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8640954/
https://www.ncbi.nlm.nih.gov/pubmed/34869483
http://dx.doi.org/10.3389/fmed.2021.768467
_version_ 1784609413571543040
author Marateb, Hamid Reza
Ziaie Nezhad, Farzad
Mohebian, Mohammad Reza
Sami, Ramin
Haghjooy Javanmard, Shaghayegh
Dehghan Niri, Fatemeh
Akafzadeh-Savari, Mahsa
Mansourian, Marjan
Mañanas, Miquel Angel
Wolkewitz, Martin
Binder, Harald
author_facet Marateb, Hamid Reza
Ziaie Nezhad, Farzad
Mohebian, Mohammad Reza
Sami, Ramin
Haghjooy Javanmard, Shaghayegh
Dehghan Niri, Fatemeh
Akafzadeh-Savari, Mahsa
Mansourian, Marjan
Mañanas, Miquel Angel
Wolkewitz, Martin
Binder, Harald
author_sort Marateb, Hamid Reza
collection PubMed
description Coronavirus disease-2019, also known as severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), was a disaster in 2020. Accurate and early diagnosis of coronavirus disease-2019 (COVID-19) is still essential for health policymaking. Reverse transcriptase-polymerase chain reaction (RT-PCR) has been performed as the operational gold standard for COVID-19 diagnosis. We aimed to design and implement a reliable COVID-19 diagnosis method to provide the risk of infection using demographics, symptoms and signs, blood markers, and family history of diseases to have excellent agreement with the results obtained by the RT-PCR and CT-scan. Our study primarily used sample data from a 1-year hospital-based prospective COVID-19 open-cohort, the Khorshid COVID Cohort (KCC) study. A sample of 634 patients with COVID-19 and 118 patients with pneumonia with similar characteristics whose RT-PCR and chest CT scan were negative (as the control group) (dataset 1) was used to design the system and for internal validation. Two other online datasets, namely, some symptoms (dataset 2) and blood tests (dataset 3), were also analyzed. A combination of one-hot encoding, stability feature selection, over-sampling, and an ensemble classifier was used. Ten-fold stratified cross-validation was performed. In addition to gender and symptom duration, signs and symptoms, blood biomarkers, and comorbidities were selected. Performance indices of the cross-validated confusion matrix for dataset 1 were as follows: sensitivity of 96% [confidence interval, CI, 95%: 94–98], specificity of 95% [90–99], positive predictive value (PPV) of 99% [98–100], negative predictive value (NPV) of 82% [76–89], diagnostic odds ratio (DOR) of 496 [198–1,245], area under the ROC (AUC) of 0.96 [0.94–0.97], Matthews Correlation Coefficient (MCC) of 0.87 [0.85–0.88], accuracy of 96% [94–98], and Cohen's Kappa of 0.86 [0.81–0.91]. The proposed algorithm showed excellent diagnosis accuracy and class-labeling agreement, and fair discriminant power. The AUC on the datasets 2 and 3 was 0.97 [0.96–0.98] and 0.92 [0.91–0.94], respectively. The most important feature was white blood cell count, shortness of breath, and C-reactive protein for datasets 1, 2, and 3, respectively. The proposed algorithm is, thus, a promising COVID-19 diagnosis method, which could be an amendment to simple blood tests and screening of symptoms. However, the RT-PCR and chest CT-scan, performed as the gold standard, are not 100% accurate.
format Online
Article
Text
id pubmed-8640954
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-86409542021-12-04 Automatic Classification Between COVID-19 and Non-COVID-19 Pneumonia Using Symptoms, Comorbidities, and Laboratory Findings: The Khorshid COVID Cohort Study Marateb, Hamid Reza Ziaie Nezhad, Farzad Mohebian, Mohammad Reza Sami, Ramin Haghjooy Javanmard, Shaghayegh Dehghan Niri, Fatemeh Akafzadeh-Savari, Mahsa Mansourian, Marjan Mañanas, Miquel Angel Wolkewitz, Martin Binder, Harald Front Med (Lausanne) Medicine Coronavirus disease-2019, also known as severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), was a disaster in 2020. Accurate and early diagnosis of coronavirus disease-2019 (COVID-19) is still essential for health policymaking. Reverse transcriptase-polymerase chain reaction (RT-PCR) has been performed as the operational gold standard for COVID-19 diagnosis. We aimed to design and implement a reliable COVID-19 diagnosis method to provide the risk of infection using demographics, symptoms and signs, blood markers, and family history of diseases to have excellent agreement with the results obtained by the RT-PCR and CT-scan. Our study primarily used sample data from a 1-year hospital-based prospective COVID-19 open-cohort, the Khorshid COVID Cohort (KCC) study. A sample of 634 patients with COVID-19 and 118 patients with pneumonia with similar characteristics whose RT-PCR and chest CT scan were negative (as the control group) (dataset 1) was used to design the system and for internal validation. Two other online datasets, namely, some symptoms (dataset 2) and blood tests (dataset 3), were also analyzed. A combination of one-hot encoding, stability feature selection, over-sampling, and an ensemble classifier was used. Ten-fold stratified cross-validation was performed. In addition to gender and symptom duration, signs and symptoms, blood biomarkers, and comorbidities were selected. Performance indices of the cross-validated confusion matrix for dataset 1 were as follows: sensitivity of 96% [confidence interval, CI, 95%: 94–98], specificity of 95% [90–99], positive predictive value (PPV) of 99% [98–100], negative predictive value (NPV) of 82% [76–89], diagnostic odds ratio (DOR) of 496 [198–1,245], area under the ROC (AUC) of 0.96 [0.94–0.97], Matthews Correlation Coefficient (MCC) of 0.87 [0.85–0.88], accuracy of 96% [94–98], and Cohen's Kappa of 0.86 [0.81–0.91]. The proposed algorithm showed excellent diagnosis accuracy and class-labeling agreement, and fair discriminant power. The AUC on the datasets 2 and 3 was 0.97 [0.96–0.98] and 0.92 [0.91–0.94], respectively. The most important feature was white blood cell count, shortness of breath, and C-reactive protein for datasets 1, 2, and 3, respectively. The proposed algorithm is, thus, a promising COVID-19 diagnosis method, which could be an amendment to simple blood tests and screening of symptoms. However, the RT-PCR and chest CT-scan, performed as the gold standard, are not 100% accurate. Frontiers Media S.A. 2021-11-18 /pmc/articles/PMC8640954/ /pubmed/34869483 http://dx.doi.org/10.3389/fmed.2021.768467 Text en Copyright © 2021 Marateb, Ziaie Nezhad, Mohebian, Sami, Haghjooy Javanmard, Dehghan Niri, Akafzadeh-Savari, Mansourian, Mañanas, Wolkewitz and Binder. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Medicine
Marateb, Hamid Reza
Ziaie Nezhad, Farzad
Mohebian, Mohammad Reza
Sami, Ramin
Haghjooy Javanmard, Shaghayegh
Dehghan Niri, Fatemeh
Akafzadeh-Savari, Mahsa
Mansourian, Marjan
Mañanas, Miquel Angel
Wolkewitz, Martin
Binder, Harald
Automatic Classification Between COVID-19 and Non-COVID-19 Pneumonia Using Symptoms, Comorbidities, and Laboratory Findings: The Khorshid COVID Cohort Study
title Automatic Classification Between COVID-19 and Non-COVID-19 Pneumonia Using Symptoms, Comorbidities, and Laboratory Findings: The Khorshid COVID Cohort Study
title_full Automatic Classification Between COVID-19 and Non-COVID-19 Pneumonia Using Symptoms, Comorbidities, and Laboratory Findings: The Khorshid COVID Cohort Study
title_fullStr Automatic Classification Between COVID-19 and Non-COVID-19 Pneumonia Using Symptoms, Comorbidities, and Laboratory Findings: The Khorshid COVID Cohort Study
title_full_unstemmed Automatic Classification Between COVID-19 and Non-COVID-19 Pneumonia Using Symptoms, Comorbidities, and Laboratory Findings: The Khorshid COVID Cohort Study
title_short Automatic Classification Between COVID-19 and Non-COVID-19 Pneumonia Using Symptoms, Comorbidities, and Laboratory Findings: The Khorshid COVID Cohort Study
title_sort automatic classification between covid-19 and non-covid-19 pneumonia using symptoms, comorbidities, and laboratory findings: the khorshid covid cohort study
topic Medicine
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8640954/
https://www.ncbi.nlm.nih.gov/pubmed/34869483
http://dx.doi.org/10.3389/fmed.2021.768467
work_keys_str_mv AT maratebhamidreza automaticclassificationbetweencovid19andnoncovid19pneumoniausingsymptomscomorbiditiesandlaboratoryfindingsthekhorshidcovidcohortstudy
AT ziaienezhadfarzad automaticclassificationbetweencovid19andnoncovid19pneumoniausingsymptomscomorbiditiesandlaboratoryfindingsthekhorshidcovidcohortstudy
AT mohebianmohammadreza automaticclassificationbetweencovid19andnoncovid19pneumoniausingsymptomscomorbiditiesandlaboratoryfindingsthekhorshidcovidcohortstudy
AT samiramin automaticclassificationbetweencovid19andnoncovid19pneumoniausingsymptomscomorbiditiesandlaboratoryfindingsthekhorshidcovidcohortstudy
AT haghjooyjavanmardshaghayegh automaticclassificationbetweencovid19andnoncovid19pneumoniausingsymptomscomorbiditiesandlaboratoryfindingsthekhorshidcovidcohortstudy
AT dehghannirifatemeh automaticclassificationbetweencovid19andnoncovid19pneumoniausingsymptomscomorbiditiesandlaboratoryfindingsthekhorshidcovidcohortstudy
AT akafzadehsavarimahsa automaticclassificationbetweencovid19andnoncovid19pneumoniausingsymptomscomorbiditiesandlaboratoryfindingsthekhorshidcovidcohortstudy
AT mansourianmarjan automaticclassificationbetweencovid19andnoncovid19pneumoniausingsymptomscomorbiditiesandlaboratoryfindingsthekhorshidcovidcohortstudy
AT mananasmiquelangel automaticclassificationbetweencovid19andnoncovid19pneumoniausingsymptomscomorbiditiesandlaboratoryfindingsthekhorshidcovidcohortstudy
AT wolkewitzmartin automaticclassificationbetweencovid19andnoncovid19pneumoniausingsymptomscomorbiditiesandlaboratoryfindingsthekhorshidcovidcohortstudy
AT binderharald automaticclassificationbetweencovid19andnoncovid19pneumoniausingsymptomscomorbiditiesandlaboratoryfindingsthekhorshidcovidcohortstudy