Cargando…

Interpretable machine learning for predicting 28-day all-cause in-hospital mortality for hypertensive ischemic or hemorrhagic stroke patients in the ICU: a multi-center retrospective cohort study with internal and external cross-validation

BACKGROUND: Timely and accurate outcome prediction plays a critical role in guiding clinical decisions for hypertensive ischemic or hemorrhagic stroke patients admitted to the ICU. However, interpreting and translating the predictive models into clinical applications are as important as the predicti...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Jian, Chen, Huaqiao, Deng, Jiewen, Liu, Xiaozhu, Shu, Tingting, Yin, Chengliang, Duan, Minjie, Fu, Li, Wang, Kai, Zeng, Song
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10443100/
https://www.ncbi.nlm.nih.gov/pubmed/37614971
http://dx.doi.org/10.3389/fneur.2023.1185447
_version_ 1785093750510321664
author Huang, Jian
Chen, Huaqiao
Deng, Jiewen
Liu, Xiaozhu
Shu, Tingting
Yin, Chengliang
Duan, Minjie
Fu, Li
Wang, Kai
Zeng, Song
author_facet Huang, Jian
Chen, Huaqiao
Deng, Jiewen
Liu, Xiaozhu
Shu, Tingting
Yin, Chengliang
Duan, Minjie
Fu, Li
Wang, Kai
Zeng, Song
author_sort Huang, Jian
collection PubMed
description BACKGROUND: Timely and accurate outcome prediction plays a critical role in guiding clinical decisions for hypertensive ischemic or hemorrhagic stroke patients admitted to the ICU. However, interpreting and translating the predictive models into clinical applications are as important as the prediction itself. This study aimed to develop an interpretable machine learning (IML) model that accurately predicts 28-day all-cause mortality in hypertensive ischemic or hemorrhagic stroke patients. METHODS: A total of 4,274 hypertensive ischemic or hemorrhagic stroke patients admitted to the ICU in the USA from multicenter cohorts were included in this study to develop and validate the IML model. Five machine learning (ML) models were developed, including artificial neural network (ANN), gradient boosting machine (GBM), eXtreme Gradient Boosting (XGBoost), logistic regression (LR), and support vector machine (SVM), to predict mortality using the MIMIC-IV and eICU-CRD database in the USA. Feature selection was performed using the Least Absolute Shrinkage and Selection Operator (LASSO) algorithm. Model performance was evaluated based on the area under the curve (AUC), accuracy, positive predictive value (PPV), and negative predictive value (NPV). The ML model with the best predictive performance was selected for interpretability analysis. Finally, the SHapley Additive exPlanations (SHAP) method was employed to evaluate the risk of all-cause in-hospital mortality among hypertensive ischemic or hemorrhagic stroke patients admitted to the ICU. RESULTS: The XGBoost model demonstrated the best predictive performance, with the AUC values of 0.822, 0.739, and 0.700 in the training, test, and external cohorts, respectively. The analysis of feature importance revealed that age, ethnicity, white blood cell (WBC), hyperlipidemia, mean corpuscular volume (MCV), glucose, pulse oximeter oxygen saturation (SpO(2)), serum calcium, red blood cell distribution width (RDW), blood urea nitrogen (BUN), and bicarbonate were the 11 most important features. The SHAP plots were employed to interpret the XGBoost model. CONCLUSIONS: The XGBoost model accurately predicted 28-day all-cause in-hospital mortality among hypertensive ischemic or hemorrhagic stroke patients admitted to the ICU. The SHAP method can provide explicit explanations of personalized risk prediction, which can aid physicians in understanding the model.
format Online
Article
Text
id pubmed-10443100
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-104431002023-08-23 Interpretable machine learning for predicting 28-day all-cause in-hospital mortality for hypertensive ischemic or hemorrhagic stroke patients in the ICU: a multi-center retrospective cohort study with internal and external cross-validation Huang, Jian Chen, Huaqiao Deng, Jiewen Liu, Xiaozhu Shu, Tingting Yin, Chengliang Duan, Minjie Fu, Li Wang, Kai Zeng, Song Front Neurol Neurology BACKGROUND: Timely and accurate outcome prediction plays a critical role in guiding clinical decisions for hypertensive ischemic or hemorrhagic stroke patients admitted to the ICU. However, interpreting and translating the predictive models into clinical applications are as important as the prediction itself. This study aimed to develop an interpretable machine learning (IML) model that accurately predicts 28-day all-cause mortality in hypertensive ischemic or hemorrhagic stroke patients. METHODS: A total of 4,274 hypertensive ischemic or hemorrhagic stroke patients admitted to the ICU in the USA from multicenter cohorts were included in this study to develop and validate the IML model. Five machine learning (ML) models were developed, including artificial neural network (ANN), gradient boosting machine (GBM), eXtreme Gradient Boosting (XGBoost), logistic regression (LR), and support vector machine (SVM), to predict mortality using the MIMIC-IV and eICU-CRD database in the USA. Feature selection was performed using the Least Absolute Shrinkage and Selection Operator (LASSO) algorithm. Model performance was evaluated based on the area under the curve (AUC), accuracy, positive predictive value (PPV), and negative predictive value (NPV). The ML model with the best predictive performance was selected for interpretability analysis. Finally, the SHapley Additive exPlanations (SHAP) method was employed to evaluate the risk of all-cause in-hospital mortality among hypertensive ischemic or hemorrhagic stroke patients admitted to the ICU. RESULTS: The XGBoost model demonstrated the best predictive performance, with the AUC values of 0.822, 0.739, and 0.700 in the training, test, and external cohorts, respectively. The analysis of feature importance revealed that age, ethnicity, white blood cell (WBC), hyperlipidemia, mean corpuscular volume (MCV), glucose, pulse oximeter oxygen saturation (SpO(2)), serum calcium, red blood cell distribution width (RDW), blood urea nitrogen (BUN), and bicarbonate were the 11 most important features. The SHAP plots were employed to interpret the XGBoost model. CONCLUSIONS: The XGBoost model accurately predicted 28-day all-cause in-hospital mortality among hypertensive ischemic or hemorrhagic stroke patients admitted to the ICU. The SHAP method can provide explicit explanations of personalized risk prediction, which can aid physicians in understanding the model. Frontiers Media S.A. 2023-08-08 /pmc/articles/PMC10443100/ /pubmed/37614971 http://dx.doi.org/10.3389/fneur.2023.1185447 Text en Copyright © 2023 Huang, Chen, Deng, Liu, Shu, Yin, Duan, Fu, Wang and Zeng. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neurology
Huang, Jian
Chen, Huaqiao
Deng, Jiewen
Liu, Xiaozhu
Shu, Tingting
Yin, Chengliang
Duan, Minjie
Fu, Li
Wang, Kai
Zeng, Song
Interpretable machine learning for predicting 28-day all-cause in-hospital mortality for hypertensive ischemic or hemorrhagic stroke patients in the ICU: a multi-center retrospective cohort study with internal and external cross-validation
title Interpretable machine learning for predicting 28-day all-cause in-hospital mortality for hypertensive ischemic or hemorrhagic stroke patients in the ICU: a multi-center retrospective cohort study with internal and external cross-validation
title_full Interpretable machine learning for predicting 28-day all-cause in-hospital mortality for hypertensive ischemic or hemorrhagic stroke patients in the ICU: a multi-center retrospective cohort study with internal and external cross-validation
title_fullStr Interpretable machine learning for predicting 28-day all-cause in-hospital mortality for hypertensive ischemic or hemorrhagic stroke patients in the ICU: a multi-center retrospective cohort study with internal and external cross-validation
title_full_unstemmed Interpretable machine learning for predicting 28-day all-cause in-hospital mortality for hypertensive ischemic or hemorrhagic stroke patients in the ICU: a multi-center retrospective cohort study with internal and external cross-validation
title_short Interpretable machine learning for predicting 28-day all-cause in-hospital mortality for hypertensive ischemic or hemorrhagic stroke patients in the ICU: a multi-center retrospective cohort study with internal and external cross-validation
title_sort interpretable machine learning for predicting 28-day all-cause in-hospital mortality for hypertensive ischemic or hemorrhagic stroke patients in the icu: a multi-center retrospective cohort study with internal and external cross-validation
topic Neurology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10443100/
https://www.ncbi.nlm.nih.gov/pubmed/37614971
http://dx.doi.org/10.3389/fneur.2023.1185447
work_keys_str_mv AT huangjian interpretablemachinelearningforpredicting28dayallcauseinhospitalmortalityforhypertensiveischemicorhemorrhagicstrokepatientsintheicuamulticenterretrospectivecohortstudywithinternalandexternalcrossvalidation
AT chenhuaqiao interpretablemachinelearningforpredicting28dayallcauseinhospitalmortalityforhypertensiveischemicorhemorrhagicstrokepatientsintheicuamulticenterretrospectivecohortstudywithinternalandexternalcrossvalidation
AT dengjiewen interpretablemachinelearningforpredicting28dayallcauseinhospitalmortalityforhypertensiveischemicorhemorrhagicstrokepatientsintheicuamulticenterretrospectivecohortstudywithinternalandexternalcrossvalidation
AT liuxiaozhu interpretablemachinelearningforpredicting28dayallcauseinhospitalmortalityforhypertensiveischemicorhemorrhagicstrokepatientsintheicuamulticenterretrospectivecohortstudywithinternalandexternalcrossvalidation
AT shutingting interpretablemachinelearningforpredicting28dayallcauseinhospitalmortalityforhypertensiveischemicorhemorrhagicstrokepatientsintheicuamulticenterretrospectivecohortstudywithinternalandexternalcrossvalidation
AT yinchengliang interpretablemachinelearningforpredicting28dayallcauseinhospitalmortalityforhypertensiveischemicorhemorrhagicstrokepatientsintheicuamulticenterretrospectivecohortstudywithinternalandexternalcrossvalidation
AT duanminjie interpretablemachinelearningforpredicting28dayallcauseinhospitalmortalityforhypertensiveischemicorhemorrhagicstrokepatientsintheicuamulticenterretrospectivecohortstudywithinternalandexternalcrossvalidation
AT fuli interpretablemachinelearningforpredicting28dayallcauseinhospitalmortalityforhypertensiveischemicorhemorrhagicstrokepatientsintheicuamulticenterretrospectivecohortstudywithinternalandexternalcrossvalidation
AT wangkai interpretablemachinelearningforpredicting28dayallcauseinhospitalmortalityforhypertensiveischemicorhemorrhagicstrokepatientsintheicuamulticenterretrospectivecohortstudywithinternalandexternalcrossvalidation
AT zengsong interpretablemachinelearningforpredicting28dayallcauseinhospitalmortalityforhypertensiveischemicorhemorrhagicstrokepatientsintheicuamulticenterretrospectivecohortstudywithinternalandexternalcrossvalidation