Cargando…
Comparing machine learning algorithms to predict COVID‑19 mortality using a dataset including chest computed tomography severity score data
Since the beginning of the COVID-19 pandemic, new and non-invasive digital technologies such as artificial intelligence (AI) had been introduced for mortality prediction of COVID-19 patients. The prognostic performances of the machine learning (ML)-based models for predicting clinical outcomes of CO...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10345104/ https://www.ncbi.nlm.nih.gov/pubmed/37443373 http://dx.doi.org/10.1038/s41598-023-38133-6 |
_version_ | 1785073011499466752 |
---|---|
author | Zakariaee, Seyed Salman Naderi, Negar Ebrahimi, Mahdi Kazemi-Arpanahi, Hadi |
author_facet | Zakariaee, Seyed Salman Naderi, Negar Ebrahimi, Mahdi Kazemi-Arpanahi, Hadi |
author_sort | Zakariaee, Seyed Salman |
collection | PubMed |
description | Since the beginning of the COVID-19 pandemic, new and non-invasive digital technologies such as artificial intelligence (AI) had been introduced for mortality prediction of COVID-19 patients. The prognostic performances of the machine learning (ML)-based models for predicting clinical outcomes of COVID-19 patients had been mainly evaluated using demographics, risk factors, clinical manifestations, and laboratory results. There is a lack of information about the prognostic role of imaging manifestations in combination with demographics, clinical manifestations, and laboratory predictors. The purpose of the present study is to develop an efficient ML prognostic model based on a more comprehensive dataset including chest CT severity score (CT-SS). Fifty-five primary features in six main classes were retrospectively reviewed for 6854 suspected cases. The independence test of Chi-square was used to determine the most important features in the mortality prediction of COVID-19 patients. The most relevant predictors were used to train and test ML algorithms. The predictive models were developed using eight ML algorithms including the J48 decision tree (J48), support vector machine (SVM), multi-layer perceptron (MLP), k-nearest neighbourhood (k-NN), Naïve Bayes (NB), logistic regression (LR), random forest (RF), and eXtreme gradient boosting (XGBoost). The performances of the predictive models were evaluated using accuracy, precision, sensitivity, specificity, and area under the ROC curve (AUC) metrics. After applying the exclusion criteria, a total of 815 positive RT-PCR patients were the final sample size, where 54.85% of the patients were male and the mean age of the study population was 57.22 ± 16.76 years. The RF algorithm with an accuracy of 97.2%, the sensitivity of 100%, a precision of 94.8%, specificity of 94.5%, F1-score of 97.3%, and AUC of 99.9% had the best performance. Other ML algorithms with AUC ranging from 81.2 to 93.9% had also good prediction performances in predicting COVID-19 mortality. Results showed that timely and accurate risk stratification of COVID-19 patients could be performed using ML-based predictive models fed by routine data. The proposed algorithm with the more comprehensive dataset including CT-SS could efficiently predict the mortality of COVID-19 patients. This could lead to promptly targeting high-risk patients on admission, the optimal use of hospital resources, and an increased probability of survival of patients. |
format | Online Article Text |
id | pubmed-10345104 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-103451042023-07-15 Comparing machine learning algorithms to predict COVID‑19 mortality using a dataset including chest computed tomography severity score data Zakariaee, Seyed Salman Naderi, Negar Ebrahimi, Mahdi Kazemi-Arpanahi, Hadi Sci Rep Article Since the beginning of the COVID-19 pandemic, new and non-invasive digital technologies such as artificial intelligence (AI) had been introduced for mortality prediction of COVID-19 patients. The prognostic performances of the machine learning (ML)-based models for predicting clinical outcomes of COVID-19 patients had been mainly evaluated using demographics, risk factors, clinical manifestations, and laboratory results. There is a lack of information about the prognostic role of imaging manifestations in combination with demographics, clinical manifestations, and laboratory predictors. The purpose of the present study is to develop an efficient ML prognostic model based on a more comprehensive dataset including chest CT severity score (CT-SS). Fifty-five primary features in six main classes were retrospectively reviewed for 6854 suspected cases. The independence test of Chi-square was used to determine the most important features in the mortality prediction of COVID-19 patients. The most relevant predictors were used to train and test ML algorithms. The predictive models were developed using eight ML algorithms including the J48 decision tree (J48), support vector machine (SVM), multi-layer perceptron (MLP), k-nearest neighbourhood (k-NN), Naïve Bayes (NB), logistic regression (LR), random forest (RF), and eXtreme gradient boosting (XGBoost). The performances of the predictive models were evaluated using accuracy, precision, sensitivity, specificity, and area under the ROC curve (AUC) metrics. After applying the exclusion criteria, a total of 815 positive RT-PCR patients were the final sample size, where 54.85% of the patients were male and the mean age of the study population was 57.22 ± 16.76 years. The RF algorithm with an accuracy of 97.2%, the sensitivity of 100%, a precision of 94.8%, specificity of 94.5%, F1-score of 97.3%, and AUC of 99.9% had the best performance. Other ML algorithms with AUC ranging from 81.2 to 93.9% had also good prediction performances in predicting COVID-19 mortality. Results showed that timely and accurate risk stratification of COVID-19 patients could be performed using ML-based predictive models fed by routine data. The proposed algorithm with the more comprehensive dataset including CT-SS could efficiently predict the mortality of COVID-19 patients. This could lead to promptly targeting high-risk patients on admission, the optimal use of hospital resources, and an increased probability of survival of patients. Nature Publishing Group UK 2023-07-13 /pmc/articles/PMC10345104/ /pubmed/37443373 http://dx.doi.org/10.1038/s41598-023-38133-6 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Zakariaee, Seyed Salman Naderi, Negar Ebrahimi, Mahdi Kazemi-Arpanahi, Hadi Comparing machine learning algorithms to predict COVID‑19 mortality using a dataset including chest computed tomography severity score data |
title | Comparing machine learning algorithms to predict COVID‑19 mortality using a dataset including chest computed tomography severity score data |
title_full | Comparing machine learning algorithms to predict COVID‑19 mortality using a dataset including chest computed tomography severity score data |
title_fullStr | Comparing machine learning algorithms to predict COVID‑19 mortality using a dataset including chest computed tomography severity score data |
title_full_unstemmed | Comparing machine learning algorithms to predict COVID‑19 mortality using a dataset including chest computed tomography severity score data |
title_short | Comparing machine learning algorithms to predict COVID‑19 mortality using a dataset including chest computed tomography severity score data |
title_sort | comparing machine learning algorithms to predict covid‑19 mortality using a dataset including chest computed tomography severity score data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10345104/ https://www.ncbi.nlm.nih.gov/pubmed/37443373 http://dx.doi.org/10.1038/s41598-023-38133-6 |
work_keys_str_mv | AT zakariaeeseyedsalman comparingmachinelearningalgorithmstopredictcovid19mortalityusingadatasetincludingchestcomputedtomographyseverityscoredata AT naderinegar comparingmachinelearningalgorithmstopredictcovid19mortalityusingadatasetincludingchestcomputedtomographyseverityscoredata AT ebrahimimahdi comparingmachinelearningalgorithmstopredictcovid19mortalityusingadatasetincludingchestcomputedtomographyseverityscoredata AT kazemiarpanahihadi comparingmachinelearningalgorithmstopredictcovid19mortalityusingadatasetincludingchestcomputedtomographyseverityscoredata |