Cargando…

Prognostic Assessment of COVID-19 in the Intensive Care Unit by Machine Learning Methods: Model Development and Validation

BACKGROUND: Patients with COVID-19 in the intensive care unit (ICU) have a high mortality rate, and methods to assess patients’ prognosis early and administer precise treatment are of great significance. OBJECTIVE: The aim of this study was to use machine learning to construct a model for the analys...

Descripción completa

Detalles Bibliográficos
Autores principales: Pan, Pan, Li, Yichao, Xiao, Yongjiu, Han, Bingchao, Su, Longxiang, Su, Mingliang, Li, Yansheng, Zhang, Siqi, Jiang, Dapeng, Chen, Xia, Zhou, Fuquan, Ma, Ling, Bao, Pengtao, Xie, Lixin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7661105/
https://www.ncbi.nlm.nih.gov/pubmed/33035175
http://dx.doi.org/10.2196/23128
_version_ 1783609142083059712
author Pan, Pan
Li, Yichao
Xiao, Yongjiu
Han, Bingchao
Su, Longxiang
Su, Mingliang
Li, Yansheng
Zhang, Siqi
Jiang, Dapeng
Chen, Xia
Zhou, Fuquan
Ma, Ling
Bao, Pengtao
Xie, Lixin
author_facet Pan, Pan
Li, Yichao
Xiao, Yongjiu
Han, Bingchao
Su, Longxiang
Su, Mingliang
Li, Yansheng
Zhang, Siqi
Jiang, Dapeng
Chen, Xia
Zhou, Fuquan
Ma, Ling
Bao, Pengtao
Xie, Lixin
author_sort Pan, Pan
collection PubMed
description BACKGROUND: Patients with COVID-19 in the intensive care unit (ICU) have a high mortality rate, and methods to assess patients’ prognosis early and administer precise treatment are of great significance. OBJECTIVE: The aim of this study was to use machine learning to construct a model for the analysis of risk factors and prediction of mortality among ICU patients with COVID-19. METHODS: In this study, 123 patients with COVID-19 in the ICU of Vulcan Hill Hospital were retrospectively selected from the database, and the data were randomly divided into a training data set (n=98) and test data set (n=25) with a 4:1 ratio. Significance tests, correlation analysis, and factor analysis were used to screen 100 potential risk factors individually. Conventional logistic regression methods and four machine learning algorithms were used to construct the risk prediction model for the prognosis of patients with COVID-19 in the ICU. The performance of these machine learning models was measured by the area under the receiver operating characteristic curve (AUC). Interpretation and evaluation of the risk prediction model were performed using calibration curves, SHapley Additive exPlanations (SHAP), Local Interpretable Model-Agnostic Explanations (LIME), etc, to ensure its stability and reliability. The outcome was based on the ICU deaths recorded from the database. RESULTS: Layer-by-layer screening of 100 potential risk factors finally revealed 8 important risk factors that were included in the risk prediction model: lymphocyte percentage, prothrombin time, lactate dehydrogenase, total bilirubin, eosinophil percentage, creatinine, neutrophil percentage, and albumin level. Finally, an eXtreme Gradient Boosting (XGBoost) model established with the 8 important risk factors showed the best recognition ability in the training set of 5-fold cross validation (AUC=0.86) and the verification queue (AUC=0.92). The calibration curve showed that the risk predicted by the model was in good agreement with the actual risk. In addition, using the SHAP and LIME algorithms, feature interpretation and sample prediction interpretation algorithms of the XGBoost black box model were implemented. Additionally, the model was translated into a web-based risk calculator that is freely available for public usage. CONCLUSIONS: The 8-factor XGBoost model predicts risk of death in ICU patients with COVID-19 well; it initially demonstrates stability and can be used effectively to predict COVID-19 prognosis in ICU patients.
format Online
Article
Text
id pubmed-7661105
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-76611052020-11-19 Prognostic Assessment of COVID-19 in the Intensive Care Unit by Machine Learning Methods: Model Development and Validation Pan, Pan Li, Yichao Xiao, Yongjiu Han, Bingchao Su, Longxiang Su, Mingliang Li, Yansheng Zhang, Siqi Jiang, Dapeng Chen, Xia Zhou, Fuquan Ma, Ling Bao, Pengtao Xie, Lixin J Med Internet Res Original Paper BACKGROUND: Patients with COVID-19 in the intensive care unit (ICU) have a high mortality rate, and methods to assess patients’ prognosis early and administer precise treatment are of great significance. OBJECTIVE: The aim of this study was to use machine learning to construct a model for the analysis of risk factors and prediction of mortality among ICU patients with COVID-19. METHODS: In this study, 123 patients with COVID-19 in the ICU of Vulcan Hill Hospital were retrospectively selected from the database, and the data were randomly divided into a training data set (n=98) and test data set (n=25) with a 4:1 ratio. Significance tests, correlation analysis, and factor analysis were used to screen 100 potential risk factors individually. Conventional logistic regression methods and four machine learning algorithms were used to construct the risk prediction model for the prognosis of patients with COVID-19 in the ICU. The performance of these machine learning models was measured by the area under the receiver operating characteristic curve (AUC). Interpretation and evaluation of the risk prediction model were performed using calibration curves, SHapley Additive exPlanations (SHAP), Local Interpretable Model-Agnostic Explanations (LIME), etc, to ensure its stability and reliability. The outcome was based on the ICU deaths recorded from the database. RESULTS: Layer-by-layer screening of 100 potential risk factors finally revealed 8 important risk factors that were included in the risk prediction model: lymphocyte percentage, prothrombin time, lactate dehydrogenase, total bilirubin, eosinophil percentage, creatinine, neutrophil percentage, and albumin level. Finally, an eXtreme Gradient Boosting (XGBoost) model established with the 8 important risk factors showed the best recognition ability in the training set of 5-fold cross validation (AUC=0.86) and the verification queue (AUC=0.92). The calibration curve showed that the risk predicted by the model was in good agreement with the actual risk. In addition, using the SHAP and LIME algorithms, feature interpretation and sample prediction interpretation algorithms of the XGBoost black box model were implemented. Additionally, the model was translated into a web-based risk calculator that is freely available for public usage. CONCLUSIONS: The 8-factor XGBoost model predicts risk of death in ICU patients with COVID-19 well; it initially demonstrates stability and can be used effectively to predict COVID-19 prognosis in ICU patients. JMIR Publications 2020-11-11 /pmc/articles/PMC7661105/ /pubmed/33035175 http://dx.doi.org/10.2196/23128 Text en ©Pan Pan, Yichao Li, Yongjiu Xiao, Bingchao Han, Longxiang Su, Mingliang Su, Yansheng Li, Siqi Zhang, Dapeng Jiang, Xia Chen, Fuquan Zhou, Ling Ma, Pengtao Bao, Lixin Xie. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 11.11.2020. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Pan, Pan
Li, Yichao
Xiao, Yongjiu
Han, Bingchao
Su, Longxiang
Su, Mingliang
Li, Yansheng
Zhang, Siqi
Jiang, Dapeng
Chen, Xia
Zhou, Fuquan
Ma, Ling
Bao, Pengtao
Xie, Lixin
Prognostic Assessment of COVID-19 in the Intensive Care Unit by Machine Learning Methods: Model Development and Validation
title Prognostic Assessment of COVID-19 in the Intensive Care Unit by Machine Learning Methods: Model Development and Validation
title_full Prognostic Assessment of COVID-19 in the Intensive Care Unit by Machine Learning Methods: Model Development and Validation
title_fullStr Prognostic Assessment of COVID-19 in the Intensive Care Unit by Machine Learning Methods: Model Development and Validation
title_full_unstemmed Prognostic Assessment of COVID-19 in the Intensive Care Unit by Machine Learning Methods: Model Development and Validation
title_short Prognostic Assessment of COVID-19 in the Intensive Care Unit by Machine Learning Methods: Model Development and Validation
title_sort prognostic assessment of covid-19 in the intensive care unit by machine learning methods: model development and validation
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7661105/
https://www.ncbi.nlm.nih.gov/pubmed/33035175
http://dx.doi.org/10.2196/23128
work_keys_str_mv AT panpan prognosticassessmentofcovid19intheintensivecareunitbymachinelearningmethodsmodeldevelopmentandvalidation
AT liyichao prognosticassessmentofcovid19intheintensivecareunitbymachinelearningmethodsmodeldevelopmentandvalidation
AT xiaoyongjiu prognosticassessmentofcovid19intheintensivecareunitbymachinelearningmethodsmodeldevelopmentandvalidation
AT hanbingchao prognosticassessmentofcovid19intheintensivecareunitbymachinelearningmethodsmodeldevelopmentandvalidation
AT sulongxiang prognosticassessmentofcovid19intheintensivecareunitbymachinelearningmethodsmodeldevelopmentandvalidation
AT sumingliang prognosticassessmentofcovid19intheintensivecareunitbymachinelearningmethodsmodeldevelopmentandvalidation
AT liyansheng prognosticassessmentofcovid19intheintensivecareunitbymachinelearningmethodsmodeldevelopmentandvalidation
AT zhangsiqi prognosticassessmentofcovid19intheintensivecareunitbymachinelearningmethodsmodeldevelopmentandvalidation
AT jiangdapeng prognosticassessmentofcovid19intheintensivecareunitbymachinelearningmethodsmodeldevelopmentandvalidation
AT chenxia prognosticassessmentofcovid19intheintensivecareunitbymachinelearningmethodsmodeldevelopmentandvalidation
AT zhoufuquan prognosticassessmentofcovid19intheintensivecareunitbymachinelearningmethodsmodeldevelopmentandvalidation
AT maling prognosticassessmentofcovid19intheintensivecareunitbymachinelearningmethodsmodeldevelopmentandvalidation
AT baopengtao prognosticassessmentofcovid19intheintensivecareunitbymachinelearningmethodsmodeldevelopmentandvalidation
AT xielixin prognosticassessmentofcovid19intheintensivecareunitbymachinelearningmethodsmodeldevelopmentandvalidation