Cargando…
A study on predicting the length of hospital stay for Chinese patients with ischemic stroke based on the XGBoost algorithm
BACKGROUND: The incidence of stroke is a challenge in China, as stroke imposes a heavy burden on families, national health services, social services, and the economy. The length of hospital stay (LOS) is an essential indicator of utilization of medical services and is usually used to assess the effi...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10031936/ https://www.ncbi.nlm.nih.gov/pubmed/36949434 http://dx.doi.org/10.1186/s12911-023-02140-4 |
_version_ | 1784910695698006016 |
---|---|
author | Chen, Rui Zhang, Shengfa Li, Jie Guo, Dongwei Zhang, Weijun Wang, Xiaoying Tian, Donghua Qu, Zhiyong Wang, Xiaohua |
author_facet | Chen, Rui Zhang, Shengfa Li, Jie Guo, Dongwei Zhang, Weijun Wang, Xiaoying Tian, Donghua Qu, Zhiyong Wang, Xiaohua |
author_sort | Chen, Rui |
collection | PubMed |
description | BACKGROUND: The incidence of stroke is a challenge in China, as stroke imposes a heavy burden on families, national health services, social services, and the economy. The length of hospital stay (LOS) is an essential indicator of utilization of medical services and is usually used to assess the efficiency of hospital management and patient quality of care. This study established a prediction model based on a machine learning algorithm to predict ischemic stroke patients’ LOS. METHODS: A total of 18,195 ischemic stroke patients’ electronic medical records and 28 attributes were extracted from electronic medical records in a large comprehensive hospital in China. The prediction of LOS was regarded as a multi classification problem, and LOS was divided into three categories: 1–7 days, 8–14 days and more than 14 days. After preprocessing the data and feature selection, the XGBoost algorithm was used to build a machine learning model. Ten fold cross-validation was used for model validation. The accuracy (ACC), recall rate (RE) and F1 measure were used to evaluate the performance of the prediction model of LOS of ischemic stroke patients. Finally, the XGBoost algorithm was used to identify and remove irrelevant features by ranking all attributes based on feature importance. RESULTS: Compared with the naive Bayesian algorithm, logistic region algorithm, decision tree classifier algorithm and ADaBoost classifier algorithm, the XGBoot algorithm has higher ACC, RE and F1 measure. The average ACC, RE and F1 measure were 0.89, 0.89 and 0.89 under the 10-fold cross-validation. According to the analysis of the importance of features, the LOS of ischemic stroke patients was affected by demographic characteristics, past medical history, admission examination features, and operation characteristics. Finally, the features in terms of hemiplegia aphasia, MRS, NIHSS, TIA, Operation or not, coma index etc. were found to be the top features in importance in predicting the LOS of ischemic stroke patients. CONCLUSIONS: The XGBoost algorithm was an appropriate machine learning method for predicting the LOS of patients with ischemic stroke. Based on the prediction model, an intelligent medical management prediction system could be developed to predict the LOS based on ischemic stroke patients’ electronic medical records. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-023-02140-4 |
format | Online Article Text |
id | pubmed-10031936 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-100319362023-03-23 A study on predicting the length of hospital stay for Chinese patients with ischemic stroke based on the XGBoost algorithm Chen, Rui Zhang, Shengfa Li, Jie Guo, Dongwei Zhang, Weijun Wang, Xiaoying Tian, Donghua Qu, Zhiyong Wang, Xiaohua BMC Med Inform Decis Mak Research Article BACKGROUND: The incidence of stroke is a challenge in China, as stroke imposes a heavy burden on families, national health services, social services, and the economy. The length of hospital stay (LOS) is an essential indicator of utilization of medical services and is usually used to assess the efficiency of hospital management and patient quality of care. This study established a prediction model based on a machine learning algorithm to predict ischemic stroke patients’ LOS. METHODS: A total of 18,195 ischemic stroke patients’ electronic medical records and 28 attributes were extracted from electronic medical records in a large comprehensive hospital in China. The prediction of LOS was regarded as a multi classification problem, and LOS was divided into three categories: 1–7 days, 8–14 days and more than 14 days. After preprocessing the data and feature selection, the XGBoost algorithm was used to build a machine learning model. Ten fold cross-validation was used for model validation. The accuracy (ACC), recall rate (RE) and F1 measure were used to evaluate the performance of the prediction model of LOS of ischemic stroke patients. Finally, the XGBoost algorithm was used to identify and remove irrelevant features by ranking all attributes based on feature importance. RESULTS: Compared with the naive Bayesian algorithm, logistic region algorithm, decision tree classifier algorithm and ADaBoost classifier algorithm, the XGBoot algorithm has higher ACC, RE and F1 measure. The average ACC, RE and F1 measure were 0.89, 0.89 and 0.89 under the 10-fold cross-validation. According to the analysis of the importance of features, the LOS of ischemic stroke patients was affected by demographic characteristics, past medical history, admission examination features, and operation characteristics. Finally, the features in terms of hemiplegia aphasia, MRS, NIHSS, TIA, Operation or not, coma index etc. were found to be the top features in importance in predicting the LOS of ischemic stroke patients. CONCLUSIONS: The XGBoost algorithm was an appropriate machine learning method for predicting the LOS of patients with ischemic stroke. Based on the prediction model, an intelligent medical management prediction system could be developed to predict the LOS based on ischemic stroke patients’ electronic medical records. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-023-02140-4 BioMed Central 2023-03-22 /pmc/articles/PMC10031936/ /pubmed/36949434 http://dx.doi.org/10.1186/s12911-023-02140-4 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Article Chen, Rui Zhang, Shengfa Li, Jie Guo, Dongwei Zhang, Weijun Wang, Xiaoying Tian, Donghua Qu, Zhiyong Wang, Xiaohua A study on predicting the length of hospital stay for Chinese patients with ischemic stroke based on the XGBoost algorithm |
title | A study on predicting the length of hospital stay for Chinese patients with ischemic stroke based on the XGBoost algorithm |
title_full | A study on predicting the length of hospital stay for Chinese patients with ischemic stroke based on the XGBoost algorithm |
title_fullStr | A study on predicting the length of hospital stay for Chinese patients with ischemic stroke based on the XGBoost algorithm |
title_full_unstemmed | A study on predicting the length of hospital stay for Chinese patients with ischemic stroke based on the XGBoost algorithm |
title_short | A study on predicting the length of hospital stay for Chinese patients with ischemic stroke based on the XGBoost algorithm |
title_sort | study on predicting the length of hospital stay for chinese patients with ischemic stroke based on the xgboost algorithm |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10031936/ https://www.ncbi.nlm.nih.gov/pubmed/36949434 http://dx.doi.org/10.1186/s12911-023-02140-4 |
work_keys_str_mv | AT chenrui astudyonpredictingthelengthofhospitalstayforchinesepatientswithischemicstrokebasedonthexgboostalgorithm AT zhangshengfa astudyonpredictingthelengthofhospitalstayforchinesepatientswithischemicstrokebasedonthexgboostalgorithm AT lijie astudyonpredictingthelengthofhospitalstayforchinesepatientswithischemicstrokebasedonthexgboostalgorithm AT guodongwei astudyonpredictingthelengthofhospitalstayforchinesepatientswithischemicstrokebasedonthexgboostalgorithm AT zhangweijun astudyonpredictingthelengthofhospitalstayforchinesepatientswithischemicstrokebasedonthexgboostalgorithm AT wangxiaoying astudyonpredictingthelengthofhospitalstayforchinesepatientswithischemicstrokebasedonthexgboostalgorithm AT tiandonghua astudyonpredictingthelengthofhospitalstayforchinesepatientswithischemicstrokebasedonthexgboostalgorithm AT quzhiyong astudyonpredictingthelengthofhospitalstayforchinesepatientswithischemicstrokebasedonthexgboostalgorithm AT wangxiaohua astudyonpredictingthelengthofhospitalstayforchinesepatientswithischemicstrokebasedonthexgboostalgorithm AT chenrui studyonpredictingthelengthofhospitalstayforchinesepatientswithischemicstrokebasedonthexgboostalgorithm AT zhangshengfa studyonpredictingthelengthofhospitalstayforchinesepatientswithischemicstrokebasedonthexgboostalgorithm AT lijie studyonpredictingthelengthofhospitalstayforchinesepatientswithischemicstrokebasedonthexgboostalgorithm AT guodongwei studyonpredictingthelengthofhospitalstayforchinesepatientswithischemicstrokebasedonthexgboostalgorithm AT zhangweijun studyonpredictingthelengthofhospitalstayforchinesepatientswithischemicstrokebasedonthexgboostalgorithm AT wangxiaoying studyonpredictingthelengthofhospitalstayforchinesepatientswithischemicstrokebasedonthexgboostalgorithm AT tiandonghua studyonpredictingthelengthofhospitalstayforchinesepatientswithischemicstrokebasedonthexgboostalgorithm AT quzhiyong studyonpredictingthelengthofhospitalstayforchinesepatientswithischemicstrokebasedonthexgboostalgorithm AT wangxiaohua studyonpredictingthelengthofhospitalstayforchinesepatientswithischemicstrokebasedonthexgboostalgorithm |