Cargando…

Predicting the Prognosis of Patients in the Coronary Care Unit: A Novel Multi-Category Machine Learning Model Using XGBoost

BACKGROUND: Early prediction and classification of prognosis is essential for patients in the coronary care unit (CCU). We applied a machine learning (ML) model using the eXtreme Gradient Boosting (XGBoost) algorithm to prognosticate CCU patients and compared XGBoost with traditional classification...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Xingchen, Zhu, Tianqi, Xia, Minghong, Liu, Yu, Wang, Yao, Wang, Xizhi, Zhuang, Lenan, Zhong, Danfeng, Zhu, Jun, He, Hong, Weng, Shaoxiang, Zhu, Junhui, Lai, Dongwu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9133425/
https://www.ncbi.nlm.nih.gov/pubmed/35647052
http://dx.doi.org/10.3389/fcvm.2022.764629
_version_ 1784713564449144832
author Wang, Xingchen
Zhu, Tianqi
Xia, Minghong
Liu, Yu
Wang, Yao
Wang, Xizhi
Zhuang, Lenan
Zhong, Danfeng
Zhu, Jun
He, Hong
Weng, Shaoxiang
Zhu, Junhui
Lai, Dongwu
author_facet Wang, Xingchen
Zhu, Tianqi
Xia, Minghong
Liu, Yu
Wang, Yao
Wang, Xizhi
Zhuang, Lenan
Zhong, Danfeng
Zhu, Jun
He, Hong
Weng, Shaoxiang
Zhu, Junhui
Lai, Dongwu
author_sort Wang, Xingchen
collection PubMed
description BACKGROUND: Early prediction and classification of prognosis is essential for patients in the coronary care unit (CCU). We applied a machine learning (ML) model using the eXtreme Gradient Boosting (XGBoost) algorithm to prognosticate CCU patients and compared XGBoost with traditional classification models. METHODS: CCU patients' data were extracted from the MIMIC-III v1.4 clinical database, and divided into four groups based on the time to death: <30 days, 30 days−1 year, 1–5 years, and ≥5 years. Four classification models, including XGBoost, naïve Bayes (NB), logistic regression (LR), and support vector machine (SVM) were constructed using the Python software. These four models were tested and compared for accuracy, F1 score, Matthews correlation coefficient (MCC), and area under the curve (AUC) of the receiver operating characteristic curves. Subsequently, Local Interpretable Model-Agnostic Explanations method was performed to improve XGBoost model interpretability. We also constructed sub-models of each model based on the different categories of death time and compared the differences by decision curve analysis. The optimal model was further analyzed using a clinical impact curve. At last, feature ablation curves of the XGBoost model were conducted to obtain the simplified model. RESULTS: Overall, 5360 CCU patients were included. Compared to NB, LR, and SVM, the XGBoost model showed better accuracy (0.663, 0.605, 0.632, and 0.622), micro-AUCs (0.873, 0.811, 0.841, and 0.818), and MCC (0.337, 0.317, 0.250, and 0.182). In subgroup analysis, the XGBoost model had a better predictive performance in acute myocardial infarction subgroup. The decision curve and clinical impact curve analyses verified the clinical utility of the XGBoost model for different categories of patients. Finally, we obtained a simplified model with thirty features. CONCLUSIONS: For CCU physicians, the ML technique by XGBoost is a potential predictive tool in patients with different conditions, and it may contribute to improvements in prognosis.
format Online
Article
Text
id pubmed-9133425
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-91334252022-05-27 Predicting the Prognosis of Patients in the Coronary Care Unit: A Novel Multi-Category Machine Learning Model Using XGBoost Wang, Xingchen Zhu, Tianqi Xia, Minghong Liu, Yu Wang, Yao Wang, Xizhi Zhuang, Lenan Zhong, Danfeng Zhu, Jun He, Hong Weng, Shaoxiang Zhu, Junhui Lai, Dongwu Front Cardiovasc Med Cardiovascular Medicine BACKGROUND: Early prediction and classification of prognosis is essential for patients in the coronary care unit (CCU). We applied a machine learning (ML) model using the eXtreme Gradient Boosting (XGBoost) algorithm to prognosticate CCU patients and compared XGBoost with traditional classification models. METHODS: CCU patients' data were extracted from the MIMIC-III v1.4 clinical database, and divided into four groups based on the time to death: <30 days, 30 days−1 year, 1–5 years, and ≥5 years. Four classification models, including XGBoost, naïve Bayes (NB), logistic regression (LR), and support vector machine (SVM) were constructed using the Python software. These four models were tested and compared for accuracy, F1 score, Matthews correlation coefficient (MCC), and area under the curve (AUC) of the receiver operating characteristic curves. Subsequently, Local Interpretable Model-Agnostic Explanations method was performed to improve XGBoost model interpretability. We also constructed sub-models of each model based on the different categories of death time and compared the differences by decision curve analysis. The optimal model was further analyzed using a clinical impact curve. At last, feature ablation curves of the XGBoost model were conducted to obtain the simplified model. RESULTS: Overall, 5360 CCU patients were included. Compared to NB, LR, and SVM, the XGBoost model showed better accuracy (0.663, 0.605, 0.632, and 0.622), micro-AUCs (0.873, 0.811, 0.841, and 0.818), and MCC (0.337, 0.317, 0.250, and 0.182). In subgroup analysis, the XGBoost model had a better predictive performance in acute myocardial infarction subgroup. The decision curve and clinical impact curve analyses verified the clinical utility of the XGBoost model for different categories of patients. Finally, we obtained a simplified model with thirty features. CONCLUSIONS: For CCU physicians, the ML technique by XGBoost is a potential predictive tool in patients with different conditions, and it may contribute to improvements in prognosis. Frontiers Media S.A. 2022-05-12 /pmc/articles/PMC9133425/ /pubmed/35647052 http://dx.doi.org/10.3389/fcvm.2022.764629 Text en Copyright © 2022 Wang, Zhu, Xia, Liu, Wang, Wang, Zhuang, Zhong, Zhu, He, Weng, Zhu and Lai. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Cardiovascular Medicine
Wang, Xingchen
Zhu, Tianqi
Xia, Minghong
Liu, Yu
Wang, Yao
Wang, Xizhi
Zhuang, Lenan
Zhong, Danfeng
Zhu, Jun
He, Hong
Weng, Shaoxiang
Zhu, Junhui
Lai, Dongwu
Predicting the Prognosis of Patients in the Coronary Care Unit: A Novel Multi-Category Machine Learning Model Using XGBoost
title Predicting the Prognosis of Patients in the Coronary Care Unit: A Novel Multi-Category Machine Learning Model Using XGBoost
title_full Predicting the Prognosis of Patients in the Coronary Care Unit: A Novel Multi-Category Machine Learning Model Using XGBoost
title_fullStr Predicting the Prognosis of Patients in the Coronary Care Unit: A Novel Multi-Category Machine Learning Model Using XGBoost
title_full_unstemmed Predicting the Prognosis of Patients in the Coronary Care Unit: A Novel Multi-Category Machine Learning Model Using XGBoost
title_short Predicting the Prognosis of Patients in the Coronary Care Unit: A Novel Multi-Category Machine Learning Model Using XGBoost
title_sort predicting the prognosis of patients in the coronary care unit: a novel multi-category machine learning model using xgboost
topic Cardiovascular Medicine
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9133425/
https://www.ncbi.nlm.nih.gov/pubmed/35647052
http://dx.doi.org/10.3389/fcvm.2022.764629
work_keys_str_mv AT wangxingchen predictingtheprognosisofpatientsinthecoronarycareunitanovelmulticategorymachinelearningmodelusingxgboost
AT zhutianqi predictingtheprognosisofpatientsinthecoronarycareunitanovelmulticategorymachinelearningmodelusingxgboost
AT xiaminghong predictingtheprognosisofpatientsinthecoronarycareunitanovelmulticategorymachinelearningmodelusingxgboost
AT liuyu predictingtheprognosisofpatientsinthecoronarycareunitanovelmulticategorymachinelearningmodelusingxgboost
AT wangyao predictingtheprognosisofpatientsinthecoronarycareunitanovelmulticategorymachinelearningmodelusingxgboost
AT wangxizhi predictingtheprognosisofpatientsinthecoronarycareunitanovelmulticategorymachinelearningmodelusingxgboost
AT zhuanglenan predictingtheprognosisofpatientsinthecoronarycareunitanovelmulticategorymachinelearningmodelusingxgboost
AT zhongdanfeng predictingtheprognosisofpatientsinthecoronarycareunitanovelmulticategorymachinelearningmodelusingxgboost
AT zhujun predictingtheprognosisofpatientsinthecoronarycareunitanovelmulticategorymachinelearningmodelusingxgboost
AT hehong predictingtheprognosisofpatientsinthecoronarycareunitanovelmulticategorymachinelearningmodelusingxgboost
AT wengshaoxiang predictingtheprognosisofpatientsinthecoronarycareunitanovelmulticategorymachinelearningmodelusingxgboost
AT zhujunhui predictingtheprognosisofpatientsinthecoronarycareunitanovelmulticategorymachinelearningmodelusingxgboost
AT laidongwu predictingtheprognosisofpatientsinthecoronarycareunitanovelmulticategorymachinelearningmodelusingxgboost