Cargando…
Predicting the Prognosis of Patients in the Coronary Care Unit: A Novel Multi-Category Machine Learning Model Using XGBoost
BACKGROUND: Early prediction and classification of prognosis is essential for patients in the coronary care unit (CCU). We applied a machine learning (ML) model using the eXtreme Gradient Boosting (XGBoost) algorithm to prognosticate CCU patients and compared XGBoost with traditional classification...
Autores principales: | , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9133425/ https://www.ncbi.nlm.nih.gov/pubmed/35647052 http://dx.doi.org/10.3389/fcvm.2022.764629 |
_version_ | 1784713564449144832 |
---|---|
author | Wang, Xingchen Zhu, Tianqi Xia, Minghong Liu, Yu Wang, Yao Wang, Xizhi Zhuang, Lenan Zhong, Danfeng Zhu, Jun He, Hong Weng, Shaoxiang Zhu, Junhui Lai, Dongwu |
author_facet | Wang, Xingchen Zhu, Tianqi Xia, Minghong Liu, Yu Wang, Yao Wang, Xizhi Zhuang, Lenan Zhong, Danfeng Zhu, Jun He, Hong Weng, Shaoxiang Zhu, Junhui Lai, Dongwu |
author_sort | Wang, Xingchen |
collection | PubMed |
description | BACKGROUND: Early prediction and classification of prognosis is essential for patients in the coronary care unit (CCU). We applied a machine learning (ML) model using the eXtreme Gradient Boosting (XGBoost) algorithm to prognosticate CCU patients and compared XGBoost with traditional classification models. METHODS: CCU patients' data were extracted from the MIMIC-III v1.4 clinical database, and divided into four groups based on the time to death: <30 days, 30 days−1 year, 1–5 years, and ≥5 years. Four classification models, including XGBoost, naïve Bayes (NB), logistic regression (LR), and support vector machine (SVM) were constructed using the Python software. These four models were tested and compared for accuracy, F1 score, Matthews correlation coefficient (MCC), and area under the curve (AUC) of the receiver operating characteristic curves. Subsequently, Local Interpretable Model-Agnostic Explanations method was performed to improve XGBoost model interpretability. We also constructed sub-models of each model based on the different categories of death time and compared the differences by decision curve analysis. The optimal model was further analyzed using a clinical impact curve. At last, feature ablation curves of the XGBoost model were conducted to obtain the simplified model. RESULTS: Overall, 5360 CCU patients were included. Compared to NB, LR, and SVM, the XGBoost model showed better accuracy (0.663, 0.605, 0.632, and 0.622), micro-AUCs (0.873, 0.811, 0.841, and 0.818), and MCC (0.337, 0.317, 0.250, and 0.182). In subgroup analysis, the XGBoost model had a better predictive performance in acute myocardial infarction subgroup. The decision curve and clinical impact curve analyses verified the clinical utility of the XGBoost model for different categories of patients. Finally, we obtained a simplified model with thirty features. CONCLUSIONS: For CCU physicians, the ML technique by XGBoost is a potential predictive tool in patients with different conditions, and it may contribute to improvements in prognosis. |
format | Online Article Text |
id | pubmed-9133425 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-91334252022-05-27 Predicting the Prognosis of Patients in the Coronary Care Unit: A Novel Multi-Category Machine Learning Model Using XGBoost Wang, Xingchen Zhu, Tianqi Xia, Minghong Liu, Yu Wang, Yao Wang, Xizhi Zhuang, Lenan Zhong, Danfeng Zhu, Jun He, Hong Weng, Shaoxiang Zhu, Junhui Lai, Dongwu Front Cardiovasc Med Cardiovascular Medicine BACKGROUND: Early prediction and classification of prognosis is essential for patients in the coronary care unit (CCU). We applied a machine learning (ML) model using the eXtreme Gradient Boosting (XGBoost) algorithm to prognosticate CCU patients and compared XGBoost with traditional classification models. METHODS: CCU patients' data were extracted from the MIMIC-III v1.4 clinical database, and divided into four groups based on the time to death: <30 days, 30 days−1 year, 1–5 years, and ≥5 years. Four classification models, including XGBoost, naïve Bayes (NB), logistic regression (LR), and support vector machine (SVM) were constructed using the Python software. These four models were tested and compared for accuracy, F1 score, Matthews correlation coefficient (MCC), and area under the curve (AUC) of the receiver operating characteristic curves. Subsequently, Local Interpretable Model-Agnostic Explanations method was performed to improve XGBoost model interpretability. We also constructed sub-models of each model based on the different categories of death time and compared the differences by decision curve analysis. The optimal model was further analyzed using a clinical impact curve. At last, feature ablation curves of the XGBoost model were conducted to obtain the simplified model. RESULTS: Overall, 5360 CCU patients were included. Compared to NB, LR, and SVM, the XGBoost model showed better accuracy (0.663, 0.605, 0.632, and 0.622), micro-AUCs (0.873, 0.811, 0.841, and 0.818), and MCC (0.337, 0.317, 0.250, and 0.182). In subgroup analysis, the XGBoost model had a better predictive performance in acute myocardial infarction subgroup. The decision curve and clinical impact curve analyses verified the clinical utility of the XGBoost model for different categories of patients. Finally, we obtained a simplified model with thirty features. CONCLUSIONS: For CCU physicians, the ML technique by XGBoost is a potential predictive tool in patients with different conditions, and it may contribute to improvements in prognosis. Frontiers Media S.A. 2022-05-12 /pmc/articles/PMC9133425/ /pubmed/35647052 http://dx.doi.org/10.3389/fcvm.2022.764629 Text en Copyright © 2022 Wang, Zhu, Xia, Liu, Wang, Wang, Zhuang, Zhong, Zhu, He, Weng, Zhu and Lai. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Cardiovascular Medicine Wang, Xingchen Zhu, Tianqi Xia, Minghong Liu, Yu Wang, Yao Wang, Xizhi Zhuang, Lenan Zhong, Danfeng Zhu, Jun He, Hong Weng, Shaoxiang Zhu, Junhui Lai, Dongwu Predicting the Prognosis of Patients in the Coronary Care Unit: A Novel Multi-Category Machine Learning Model Using XGBoost |
title | Predicting the Prognosis of Patients in the Coronary Care Unit: A Novel Multi-Category Machine Learning Model Using XGBoost |
title_full | Predicting the Prognosis of Patients in the Coronary Care Unit: A Novel Multi-Category Machine Learning Model Using XGBoost |
title_fullStr | Predicting the Prognosis of Patients in the Coronary Care Unit: A Novel Multi-Category Machine Learning Model Using XGBoost |
title_full_unstemmed | Predicting the Prognosis of Patients in the Coronary Care Unit: A Novel Multi-Category Machine Learning Model Using XGBoost |
title_short | Predicting the Prognosis of Patients in the Coronary Care Unit: A Novel Multi-Category Machine Learning Model Using XGBoost |
title_sort | predicting the prognosis of patients in the coronary care unit: a novel multi-category machine learning model using xgboost |
topic | Cardiovascular Medicine |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9133425/ https://www.ncbi.nlm.nih.gov/pubmed/35647052 http://dx.doi.org/10.3389/fcvm.2022.764629 |
work_keys_str_mv | AT wangxingchen predictingtheprognosisofpatientsinthecoronarycareunitanovelmulticategorymachinelearningmodelusingxgboost AT zhutianqi predictingtheprognosisofpatientsinthecoronarycareunitanovelmulticategorymachinelearningmodelusingxgboost AT xiaminghong predictingtheprognosisofpatientsinthecoronarycareunitanovelmulticategorymachinelearningmodelusingxgboost AT liuyu predictingtheprognosisofpatientsinthecoronarycareunitanovelmulticategorymachinelearningmodelusingxgboost AT wangyao predictingtheprognosisofpatientsinthecoronarycareunitanovelmulticategorymachinelearningmodelusingxgboost AT wangxizhi predictingtheprognosisofpatientsinthecoronarycareunitanovelmulticategorymachinelearningmodelusingxgboost AT zhuanglenan predictingtheprognosisofpatientsinthecoronarycareunitanovelmulticategorymachinelearningmodelusingxgboost AT zhongdanfeng predictingtheprognosisofpatientsinthecoronarycareunitanovelmulticategorymachinelearningmodelusingxgboost AT zhujun predictingtheprognosisofpatientsinthecoronarycareunitanovelmulticategorymachinelearningmodelusingxgboost AT hehong predictingtheprognosisofpatientsinthecoronarycareunitanovelmulticategorymachinelearningmodelusingxgboost AT wengshaoxiang predictingtheprognosisofpatientsinthecoronarycareunitanovelmulticategorymachinelearningmodelusingxgboost AT zhujunhui predictingtheprognosisofpatientsinthecoronarycareunitanovelmulticategorymachinelearningmodelusingxgboost AT laidongwu predictingtheprognosisofpatientsinthecoronarycareunitanovelmulticategorymachinelearningmodelusingxgboost |