Cargando…
Development and external evaluation of predictions models for mortality of COVID-19 patients using machine learning method
To predict the mortality of patients with coronavirus disease 2019 (COVID-19). We collected clinical data of COVID-19 patients between January 18 and March 29 2020 in Wuhan, China . Gradient boosting decision tree (GBDT), logistic regression (LR) model, and simplified LR were built to predict the mo...
Autores principales: | , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer London
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7783503/ https://www.ncbi.nlm.nih.gov/pubmed/33424133 http://dx.doi.org/10.1007/s00521-020-05592-1 |
_version_ | 1783632129399193600 |
---|---|
author | Li, Simin Lin, Yulan Zhu, Tong Fan, Mengjie Xu, Shicheng Qiu, Weihao Chen, Can Li, Linfeng Wang, Yao Yan, Jun Wong, Justin Naing, Lin Xu, Shabei |
author_facet | Li, Simin Lin, Yulan Zhu, Tong Fan, Mengjie Xu, Shicheng Qiu, Weihao Chen, Can Li, Linfeng Wang, Yao Yan, Jun Wong, Justin Naing, Lin Xu, Shabei |
author_sort | Li, Simin |
collection | PubMed |
description | To predict the mortality of patients with coronavirus disease 2019 (COVID-19). We collected clinical data of COVID-19 patients between January 18 and March 29 2020 in Wuhan, China . Gradient boosting decision tree (GBDT), logistic regression (LR) model, and simplified LR were built to predict the mortality of COVID-19. We also evaluated different models by computing area under curve (AUC), accuracy, positive predictive value (PPV), and negative predictive value (NPV) under fivefold cross-validation. A total of 2924 patients were included in our evaluation, with 257 (8.8%) died and 2667 (91.2%) survived during hospitalization. Upon admission, there were 21 (0.7%) mild cases, 2051 (70.1%) moderate case, 779 (26.6%) severe cases, and 73 (2.5%) critically severe cases. The GBDT model exhibited the highest fivefold AUC, which was 0.941, followed by LR (0.928) and LR-5 (0.913). The diagnostic accuracies of GBDT, LR, and LR-5 were 0.889, 0.868, and 0.887, respectively. In particular, the GBDT model demonstrated the highest sensitivity (0.899) and specificity (0.889). The NPV of all three models exceeded 97%, while their PPV values were relatively low, resulting in 0.381 for LR, 0.402 for LR-5, and 0.432 for GBDT. Regarding severe and critically severe cases, the GBDT model also performed the best with a fivefold AUC of 0.918. In the external validation test of the LR-5 model using 72 cases of COVID-19 from Brunei, leukomonocyte (%) turned to show the highest fivefold AUC (0.917), followed by urea (0.867), age (0.826), and SPO2 (0.704). The findings confirm that the mortality prediction performance of the GBDT is better than the LR models in confirmed cases of COVID-19. The performance comparison seems independent of disease severity. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at(10.1007/s00521-020-05592-1) |
format | Online Article Text |
id | pubmed-7783503 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Springer London |
record_format | MEDLINE/PubMed |
spelling | pubmed-77835032021-01-05 Development and external evaluation of predictions models for mortality of COVID-19 patients using machine learning method Li, Simin Lin, Yulan Zhu, Tong Fan, Mengjie Xu, Shicheng Qiu, Weihao Chen, Can Li, Linfeng Wang, Yao Yan, Jun Wong, Justin Naing, Lin Xu, Shabei Neural Comput Appl S.I. : Deep Social Computing To predict the mortality of patients with coronavirus disease 2019 (COVID-19). We collected clinical data of COVID-19 patients between January 18 and March 29 2020 in Wuhan, China . Gradient boosting decision tree (GBDT), logistic regression (LR) model, and simplified LR were built to predict the mortality of COVID-19. We also evaluated different models by computing area under curve (AUC), accuracy, positive predictive value (PPV), and negative predictive value (NPV) under fivefold cross-validation. A total of 2924 patients were included in our evaluation, with 257 (8.8%) died and 2667 (91.2%) survived during hospitalization. Upon admission, there were 21 (0.7%) mild cases, 2051 (70.1%) moderate case, 779 (26.6%) severe cases, and 73 (2.5%) critically severe cases. The GBDT model exhibited the highest fivefold AUC, which was 0.941, followed by LR (0.928) and LR-5 (0.913). The diagnostic accuracies of GBDT, LR, and LR-5 were 0.889, 0.868, and 0.887, respectively. In particular, the GBDT model demonstrated the highest sensitivity (0.899) and specificity (0.889). The NPV of all three models exceeded 97%, while their PPV values were relatively low, resulting in 0.381 for LR, 0.402 for LR-5, and 0.432 for GBDT. Regarding severe and critically severe cases, the GBDT model also performed the best with a fivefold AUC of 0.918. In the external validation test of the LR-5 model using 72 cases of COVID-19 from Brunei, leukomonocyte (%) turned to show the highest fivefold AUC (0.917), followed by urea (0.867), age (0.826), and SPO2 (0.704). The findings confirm that the mortality prediction performance of the GBDT is better than the LR models in confirmed cases of COVID-19. The performance comparison seems independent of disease severity. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at(10.1007/s00521-020-05592-1) Springer London 2021-01-05 2023 /pmc/articles/PMC7783503/ /pubmed/33424133 http://dx.doi.org/10.1007/s00521-020-05592-1 Text en © The Author(s), under exclusive licence to Springer-Verlag London Ltd. part of Springer Nature 2021 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | S.I. : Deep Social Computing Li, Simin Lin, Yulan Zhu, Tong Fan, Mengjie Xu, Shicheng Qiu, Weihao Chen, Can Li, Linfeng Wang, Yao Yan, Jun Wong, Justin Naing, Lin Xu, Shabei Development and external evaluation of predictions models for mortality of COVID-19 patients using machine learning method |
title | Development and external evaluation of predictions models for mortality of COVID-19 patients using machine learning method |
title_full | Development and external evaluation of predictions models for mortality of COVID-19 patients using machine learning method |
title_fullStr | Development and external evaluation of predictions models for mortality of COVID-19 patients using machine learning method |
title_full_unstemmed | Development and external evaluation of predictions models for mortality of COVID-19 patients using machine learning method |
title_short | Development and external evaluation of predictions models for mortality of COVID-19 patients using machine learning method |
title_sort | development and external evaluation of predictions models for mortality of covid-19 patients using machine learning method |
topic | S.I. : Deep Social Computing |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7783503/ https://www.ncbi.nlm.nih.gov/pubmed/33424133 http://dx.doi.org/10.1007/s00521-020-05592-1 |
work_keys_str_mv | AT lisimin developmentandexternalevaluationofpredictionsmodelsformortalityofcovid19patientsusingmachinelearningmethod AT linyulan developmentandexternalevaluationofpredictionsmodelsformortalityofcovid19patientsusingmachinelearningmethod AT zhutong developmentandexternalevaluationofpredictionsmodelsformortalityofcovid19patientsusingmachinelearningmethod AT fanmengjie developmentandexternalevaluationofpredictionsmodelsformortalityofcovid19patientsusingmachinelearningmethod AT xushicheng developmentandexternalevaluationofpredictionsmodelsformortalityofcovid19patientsusingmachinelearningmethod AT qiuweihao developmentandexternalevaluationofpredictionsmodelsformortalityofcovid19patientsusingmachinelearningmethod AT chencan developmentandexternalevaluationofpredictionsmodelsformortalityofcovid19patientsusingmachinelearningmethod AT lilinfeng developmentandexternalevaluationofpredictionsmodelsformortalityofcovid19patientsusingmachinelearningmethod AT wangyao developmentandexternalevaluationofpredictionsmodelsformortalityofcovid19patientsusingmachinelearningmethod AT yanjun developmentandexternalevaluationofpredictionsmodelsformortalityofcovid19patientsusingmachinelearningmethod AT wongjustin developmentandexternalevaluationofpredictionsmodelsformortalityofcovid19patientsusingmachinelearningmethod AT nainglin developmentandexternalevaluationofpredictionsmodelsformortalityofcovid19patientsusingmachinelearningmethod AT xushabei developmentandexternalevaluationofpredictionsmodelsformortalityofcovid19patientsusingmachinelearningmethod |