Cargando…
Automatic Detection of Severely and Mildly Infected COVID-19 Patients with Supervised Machine Learning Models
OBJECTIVES: When the prognosis of COVID-19 disease can be detected early, the intense-pressure and loss of workforce in health-services can be partially reduced. The primary-purpose of this article is to determine the feature-dataset consisting of the routine-blood-values (RBV) and demographic-data...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
AGBM. Published by Elsevier Masson SAS.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9158375/ https://www.ncbi.nlm.nih.gov/pubmed/35673548 http://dx.doi.org/10.1016/j.irbm.2022.05.006 |
_version_ | 1784718824627503104 |
---|---|
author | Huyut, M.T. |
author_facet | Huyut, M.T. |
author_sort | Huyut, M.T. |
collection | PubMed |
description | OBJECTIVES: When the prognosis of COVID-19 disease can be detected early, the intense-pressure and loss of workforce in health-services can be partially reduced. The primary-purpose of this article is to determine the feature-dataset consisting of the routine-blood-values (RBV) and demographic-data that affect the prognosis of COVID-19. Second, by applying the feature-dataset to the supervised machine-learning (ML) models, it is to identify severely and mildly infected COVID-19 patients at the time of admission. MATERIAL AND METHODS: The sample of this study consists of severely (n = 192) and mildly (n = 4010) infected-patients hospitalized with the diagnosis of COVID-19 between March-September, 2021. The RBV-data measured at the time of admission and age-gender characteristics of these patients were analyzed retrospectively. For the selection of the features, the minimum-redundancy-maximum-relevance (MRMR) method, principal-components-analysis and forward-multiple-logistics-regression analyzes were used. The features set were statistically compared between mild and severe infected-patients. Then, the performances of various supervised-ML-models were compared in identifying severely and mildly infected-patients using the feature set. RESULTS: In this study, 28 RBV-parameters and age-variable were found as the feature-dataset. The effect of features on the prognosis of the disease has been clinically proven. The ML-models with the highest overall-accuracy in identifying patient-groups were found respectively, as follows: local-weighted-learning (LWL)-97.86%, K-star (K*)-96.31%, Naive-Bayes (NB)-95.36% and k-nearest-neighbor (KNN)-94.05%. Also, the most successful models with the highest area-under-the-receiver-operating-characteristic-curve (AUC) values in identifying patient groups were found respectively, as follows: LWL-0.95%, K*-0.91%, NB-0.85% and KNN-0.75%. CONCLUSION: The findings in this article have significant a motivation for the healthcare professionals to detect at admission severely and mildly infected COVID-19 patients. |
format | Online Article Text |
id | pubmed-9158375 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | AGBM. Published by Elsevier Masson SAS. |
record_format | MEDLINE/PubMed |
spelling | pubmed-91583752022-06-02 Automatic Detection of Severely and Mildly Infected COVID-19 Patients with Supervised Machine Learning Models Huyut, M.T. Ing Rech Biomed Original Article OBJECTIVES: When the prognosis of COVID-19 disease can be detected early, the intense-pressure and loss of workforce in health-services can be partially reduced. The primary-purpose of this article is to determine the feature-dataset consisting of the routine-blood-values (RBV) and demographic-data that affect the prognosis of COVID-19. Second, by applying the feature-dataset to the supervised machine-learning (ML) models, it is to identify severely and mildly infected COVID-19 patients at the time of admission. MATERIAL AND METHODS: The sample of this study consists of severely (n = 192) and mildly (n = 4010) infected-patients hospitalized with the diagnosis of COVID-19 between March-September, 2021. The RBV-data measured at the time of admission and age-gender characteristics of these patients were analyzed retrospectively. For the selection of the features, the minimum-redundancy-maximum-relevance (MRMR) method, principal-components-analysis and forward-multiple-logistics-regression analyzes were used. The features set were statistically compared between mild and severe infected-patients. Then, the performances of various supervised-ML-models were compared in identifying severely and mildly infected-patients using the feature set. RESULTS: In this study, 28 RBV-parameters and age-variable were found as the feature-dataset. The effect of features on the prognosis of the disease has been clinically proven. The ML-models with the highest overall-accuracy in identifying patient-groups were found respectively, as follows: local-weighted-learning (LWL)-97.86%, K-star (K*)-96.31%, Naive-Bayes (NB)-95.36% and k-nearest-neighbor (KNN)-94.05%. Also, the most successful models with the highest area-under-the-receiver-operating-characteristic-curve (AUC) values in identifying patient groups were found respectively, as follows: LWL-0.95%, K*-0.91%, NB-0.85% and KNN-0.75%. CONCLUSION: The findings in this article have significant a motivation for the healthcare professionals to detect at admission severely and mildly infected COVID-19 patients. AGBM. Published by Elsevier Masson SAS. 2023-02 2022-06-01 /pmc/articles/PMC9158375/ /pubmed/35673548 http://dx.doi.org/10.1016/j.irbm.2022.05.006 Text en © 2022 AGBM. Published by Elsevier Masson SAS. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Original Article Huyut, M.T. Automatic Detection of Severely and Mildly Infected COVID-19 Patients with Supervised Machine Learning Models |
title | Automatic Detection of Severely and Mildly Infected COVID-19 Patients with Supervised Machine Learning Models |
title_full | Automatic Detection of Severely and Mildly Infected COVID-19 Patients with Supervised Machine Learning Models |
title_fullStr | Automatic Detection of Severely and Mildly Infected COVID-19 Patients with Supervised Machine Learning Models |
title_full_unstemmed | Automatic Detection of Severely and Mildly Infected COVID-19 Patients with Supervised Machine Learning Models |
title_short | Automatic Detection of Severely and Mildly Infected COVID-19 Patients with Supervised Machine Learning Models |
title_sort | automatic detection of severely and mildly infected covid-19 patients with supervised machine learning models |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9158375/ https://www.ncbi.nlm.nih.gov/pubmed/35673548 http://dx.doi.org/10.1016/j.irbm.2022.05.006 |
work_keys_str_mv | AT huyutmt automaticdetectionofseverelyandmildlyinfectedcovid19patientswithsupervisedmachinelearningmodels |