Cargando…

Development and external evaluation of predictions models for mortality of COVID-19 patients using machine learning method

To predict the mortality of patients with coronavirus disease 2019 (COVID-19). We collected clinical data of COVID-19 patients between January 18 and March 29 2020 in Wuhan, China . Gradient boosting decision tree (GBDT), logistic regression (LR) model, and simplified LR were built to predict the mo...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Simin, Lin, Yulan, Zhu, Tong, Fan, Mengjie, Xu, Shicheng, Qiu, Weihao, Chen, Can, Li, Linfeng, Wang, Yao, Yan, Jun, Wong, Justin, Naing, Lin, Xu, Shabei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer London 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7783503/
https://www.ncbi.nlm.nih.gov/pubmed/33424133
http://dx.doi.org/10.1007/s00521-020-05592-1
_version_ 1783632129399193600
author Li, Simin
Lin, Yulan
Zhu, Tong
Fan, Mengjie
Xu, Shicheng
Qiu, Weihao
Chen, Can
Li, Linfeng
Wang, Yao
Yan, Jun
Wong, Justin
Naing, Lin
Xu, Shabei
author_facet Li, Simin
Lin, Yulan
Zhu, Tong
Fan, Mengjie
Xu, Shicheng
Qiu, Weihao
Chen, Can
Li, Linfeng
Wang, Yao
Yan, Jun
Wong, Justin
Naing, Lin
Xu, Shabei
author_sort Li, Simin
collection PubMed
description To predict the mortality of patients with coronavirus disease 2019 (COVID-19). We collected clinical data of COVID-19 patients between January 18 and March 29 2020 in Wuhan, China . Gradient boosting decision tree (GBDT), logistic regression (LR) model, and simplified LR were built to predict the mortality of COVID-19. We also evaluated different models by computing area under curve (AUC), accuracy, positive predictive value (PPV), and negative predictive value (NPV) under fivefold cross-validation. A total of 2924 patients were included in our evaluation, with 257 (8.8%) died and 2667 (91.2%) survived during hospitalization. Upon admission, there were 21 (0.7%) mild cases, 2051 (70.1%) moderate case, 779 (26.6%) severe cases, and 73 (2.5%) critically severe cases. The GBDT model exhibited the highest fivefold AUC, which was 0.941, followed by LR (0.928) and LR-5 (0.913). The diagnostic accuracies of GBDT, LR, and LR-5 were 0.889, 0.868, and 0.887, respectively. In particular, the GBDT model demonstrated the highest sensitivity (0.899) and specificity (0.889). The NPV of all three models exceeded 97%, while their PPV values were relatively low, resulting in 0.381 for LR, 0.402 for LR-5, and 0.432 for GBDT. Regarding severe and critically severe cases, the GBDT model also performed the best with a fivefold AUC of 0.918. In the external validation test of the LR-5 model using 72 cases of COVID-19 from Brunei, leukomonocyte (%) turned to show the highest fivefold AUC (0.917), followed by urea (0.867), age (0.826), and SPO2 (0.704). The findings confirm that the mortality prediction performance of the GBDT is better than the LR models in confirmed cases of COVID-19. The performance comparison seems independent of disease severity. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at(10.1007/s00521-020-05592-1)
format Online
Article
Text
id pubmed-7783503
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer London
record_format MEDLINE/PubMed
spelling pubmed-77835032021-01-05 Development and external evaluation of predictions models for mortality of COVID-19 patients using machine learning method Li, Simin Lin, Yulan Zhu, Tong Fan, Mengjie Xu, Shicheng Qiu, Weihao Chen, Can Li, Linfeng Wang, Yao Yan, Jun Wong, Justin Naing, Lin Xu, Shabei Neural Comput Appl S.I. : Deep Social Computing To predict the mortality of patients with coronavirus disease 2019 (COVID-19). We collected clinical data of COVID-19 patients between January 18 and March 29 2020 in Wuhan, China . Gradient boosting decision tree (GBDT), logistic regression (LR) model, and simplified LR were built to predict the mortality of COVID-19. We also evaluated different models by computing area under curve (AUC), accuracy, positive predictive value (PPV), and negative predictive value (NPV) under fivefold cross-validation. A total of 2924 patients were included in our evaluation, with 257 (8.8%) died and 2667 (91.2%) survived during hospitalization. Upon admission, there were 21 (0.7%) mild cases, 2051 (70.1%) moderate case, 779 (26.6%) severe cases, and 73 (2.5%) critically severe cases. The GBDT model exhibited the highest fivefold AUC, which was 0.941, followed by LR (0.928) and LR-5 (0.913). The diagnostic accuracies of GBDT, LR, and LR-5 were 0.889, 0.868, and 0.887, respectively. In particular, the GBDT model demonstrated the highest sensitivity (0.899) and specificity (0.889). The NPV of all three models exceeded 97%, while their PPV values were relatively low, resulting in 0.381 for LR, 0.402 for LR-5, and 0.432 for GBDT. Regarding severe and critically severe cases, the GBDT model also performed the best with a fivefold AUC of 0.918. In the external validation test of the LR-5 model using 72 cases of COVID-19 from Brunei, leukomonocyte (%) turned to show the highest fivefold AUC (0.917), followed by urea (0.867), age (0.826), and SPO2 (0.704). The findings confirm that the mortality prediction performance of the GBDT is better than the LR models in confirmed cases of COVID-19. The performance comparison seems independent of disease severity. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at(10.1007/s00521-020-05592-1) Springer London 2021-01-05 2023 /pmc/articles/PMC7783503/ /pubmed/33424133 http://dx.doi.org/10.1007/s00521-020-05592-1 Text en © The Author(s), under exclusive licence to Springer-Verlag London Ltd. part of Springer Nature 2021 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle S.I. : Deep Social Computing
Li, Simin
Lin, Yulan
Zhu, Tong
Fan, Mengjie
Xu, Shicheng
Qiu, Weihao
Chen, Can
Li, Linfeng
Wang, Yao
Yan, Jun
Wong, Justin
Naing, Lin
Xu, Shabei
Development and external evaluation of predictions models for mortality of COVID-19 patients using machine learning method
title Development and external evaluation of predictions models for mortality of COVID-19 patients using machine learning method
title_full Development and external evaluation of predictions models for mortality of COVID-19 patients using machine learning method
title_fullStr Development and external evaluation of predictions models for mortality of COVID-19 patients using machine learning method
title_full_unstemmed Development and external evaluation of predictions models for mortality of COVID-19 patients using machine learning method
title_short Development and external evaluation of predictions models for mortality of COVID-19 patients using machine learning method
title_sort development and external evaluation of predictions models for mortality of covid-19 patients using machine learning method
topic S.I. : Deep Social Computing
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7783503/
https://www.ncbi.nlm.nih.gov/pubmed/33424133
http://dx.doi.org/10.1007/s00521-020-05592-1
work_keys_str_mv AT lisimin developmentandexternalevaluationofpredictionsmodelsformortalityofcovid19patientsusingmachinelearningmethod
AT linyulan developmentandexternalevaluationofpredictionsmodelsformortalityofcovid19patientsusingmachinelearningmethod
AT zhutong developmentandexternalevaluationofpredictionsmodelsformortalityofcovid19patientsusingmachinelearningmethod
AT fanmengjie developmentandexternalevaluationofpredictionsmodelsformortalityofcovid19patientsusingmachinelearningmethod
AT xushicheng developmentandexternalevaluationofpredictionsmodelsformortalityofcovid19patientsusingmachinelearningmethod
AT qiuweihao developmentandexternalevaluationofpredictionsmodelsformortalityofcovid19patientsusingmachinelearningmethod
AT chencan developmentandexternalevaluationofpredictionsmodelsformortalityofcovid19patientsusingmachinelearningmethod
AT lilinfeng developmentandexternalevaluationofpredictionsmodelsformortalityofcovid19patientsusingmachinelearningmethod
AT wangyao developmentandexternalevaluationofpredictionsmodelsformortalityofcovid19patientsusingmachinelearningmethod
AT yanjun developmentandexternalevaluationofpredictionsmodelsformortalityofcovid19patientsusingmachinelearningmethod
AT wongjustin developmentandexternalevaluationofpredictionsmodelsformortalityofcovid19patientsusingmachinelearningmethod
AT nainglin developmentandexternalevaluationofpredictionsmodelsformortalityofcovid19patientsusingmachinelearningmethod
AT xushabei developmentandexternalevaluationofpredictionsmodelsformortalityofcovid19patientsusingmachinelearningmethod