Cargando…

Comparing different machine learning techniques for predicting COVID-19 severity

BACKGROUND: Coronavirus disease 2019 (COVID-19) is still ongoing spreading globally, machine learning techniques were used in disease diagnosis and to predict treatment outcomes, which showed favorable performance. The present study aims to predict COVID-19 severity at admission by different machine...

Descripción completa

Detalles Bibliográficos
Autores principales: Xiong, Yibai, Ma, Yan, Ruan, Lianguo, Li, Dan, Lu, Cheng, Huang, Luqi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8851750/
https://www.ncbi.nlm.nih.gov/pubmed/35177120
http://dx.doi.org/10.1186/s40249-022-00946-4
_version_ 1784652886549987328
author Xiong, Yibai
Ma, Yan
Ruan, Lianguo
Li, Dan
Lu, Cheng
Huang, Luqi
author_facet Xiong, Yibai
Ma, Yan
Ruan, Lianguo
Li, Dan
Lu, Cheng
Huang, Luqi
author_sort Xiong, Yibai
collection PubMed
description BACKGROUND: Coronavirus disease 2019 (COVID-19) is still ongoing spreading globally, machine learning techniques were used in disease diagnosis and to predict treatment outcomes, which showed favorable performance. The present study aims to predict COVID-19 severity at admission by different machine learning techniques including random forest (RF), support vector machine (SVM), and logistic regression (LR). Feature importance to COVID-19 severity were further identified. METHODS: A retrospective design was adopted in the JinYinTan Hospital from January 26 to March 28, 2020, eighty-six demographic, clinical, and laboratory features were selected with LassoCV method, Spearman’s rank correlation, experts’ opinions, and literature evaluation. RF, SVM, and LR were performed to predict severe COVID-19, the performance of the models was compared by the area under curve (AUC). Additionally, feature importance to COVID-19 severity were analyzed by the best performance model. RESULTS: A total of 287 patients were enrolled with 36.6% severe cases and 63.4% non-severe cases. The median age was 60.0 years (interquartile range: 49.0–68.0 years). Three models were established using 23 features including 1 clinical, 1 chest computed tomography (CT) and 21 laboratory features. Among three models, RF yielded better overall performance with the highest AUC of 0.970 than SVM of 0.948 and LR of 0.928, RF also achieved a favorable sensitivity of 96.7%, specificity of 69.5%, and accuracy of 84.5%. SVM had sensitivity of 93.9%, specificity of 79.0%, and accuracy of 88.5%. LR also achieved a favorable sensitivity of 92.3%, specificity of 72.3%, and accuracy of 85.2%. Additionally, chest-CT had highest importance to illness severity, and the following features were neutrophil to lymphocyte ratio, lactate dehydrogenase, and D-dimer, respectively. CONCLUSIONS: Our results indicated that RF could be a useful predictive tool to identify patients with severe COVID-19, which may facilitate effective care and further optimize resources. GRAPHICAL ABSTRACT: [Image: see text] SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40249-022-00946-4.
format Online
Article
Text
id pubmed-8851750
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-88517502022-02-18 Comparing different machine learning techniques for predicting COVID-19 severity Xiong, Yibai Ma, Yan Ruan, Lianguo Li, Dan Lu, Cheng Huang, Luqi Infect Dis Poverty Research Article BACKGROUND: Coronavirus disease 2019 (COVID-19) is still ongoing spreading globally, machine learning techniques were used in disease diagnosis and to predict treatment outcomes, which showed favorable performance. The present study aims to predict COVID-19 severity at admission by different machine learning techniques including random forest (RF), support vector machine (SVM), and logistic regression (LR). Feature importance to COVID-19 severity were further identified. METHODS: A retrospective design was adopted in the JinYinTan Hospital from January 26 to March 28, 2020, eighty-six demographic, clinical, and laboratory features were selected with LassoCV method, Spearman’s rank correlation, experts’ opinions, and literature evaluation. RF, SVM, and LR were performed to predict severe COVID-19, the performance of the models was compared by the area under curve (AUC). Additionally, feature importance to COVID-19 severity were analyzed by the best performance model. RESULTS: A total of 287 patients were enrolled with 36.6% severe cases and 63.4% non-severe cases. The median age was 60.0 years (interquartile range: 49.0–68.0 years). Three models were established using 23 features including 1 clinical, 1 chest computed tomography (CT) and 21 laboratory features. Among three models, RF yielded better overall performance with the highest AUC of 0.970 than SVM of 0.948 and LR of 0.928, RF also achieved a favorable sensitivity of 96.7%, specificity of 69.5%, and accuracy of 84.5%. SVM had sensitivity of 93.9%, specificity of 79.0%, and accuracy of 88.5%. LR also achieved a favorable sensitivity of 92.3%, specificity of 72.3%, and accuracy of 85.2%. Additionally, chest-CT had highest importance to illness severity, and the following features were neutrophil to lymphocyte ratio, lactate dehydrogenase, and D-dimer, respectively. CONCLUSIONS: Our results indicated that RF could be a useful predictive tool to identify patients with severe COVID-19, which may facilitate effective care and further optimize resources. GRAPHICAL ABSTRACT: [Image: see text] SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40249-022-00946-4. BioMed Central 2022-02-17 /pmc/articles/PMC8851750/ /pubmed/35177120 http://dx.doi.org/10.1186/s40249-022-00946-4 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Xiong, Yibai
Ma, Yan
Ruan, Lianguo
Li, Dan
Lu, Cheng
Huang, Luqi
Comparing different machine learning techniques for predicting COVID-19 severity
title Comparing different machine learning techniques for predicting COVID-19 severity
title_full Comparing different machine learning techniques for predicting COVID-19 severity
title_fullStr Comparing different machine learning techniques for predicting COVID-19 severity
title_full_unstemmed Comparing different machine learning techniques for predicting COVID-19 severity
title_short Comparing different machine learning techniques for predicting COVID-19 severity
title_sort comparing different machine learning techniques for predicting covid-19 severity
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8851750/
https://www.ncbi.nlm.nih.gov/pubmed/35177120
http://dx.doi.org/10.1186/s40249-022-00946-4
work_keys_str_mv AT xiongyibai comparingdifferentmachinelearningtechniquesforpredictingcovid19severity
AT mayan comparingdifferentmachinelearningtechniquesforpredictingcovid19severity
AT ruanlianguo comparingdifferentmachinelearningtechniquesforpredictingcovid19severity
AT lidan comparingdifferentmachinelearningtechniquesforpredictingcovid19severity
AT lucheng comparingdifferentmachinelearningtechniquesforpredictingcovid19severity
AT huangluqi comparingdifferentmachinelearningtechniquesforpredictingcovid19severity
AT comparingdifferentmachinelearningtechniquesforpredictingcovid19severity