Cargando…

Explainable ensemble machine learning model for prediction of 28-day mortality risk in patients with sepsis-associated acute kidney injury

BACKGROUND: Sepsis-associated acute kidney injury (S-AKI) is a major contributor to mortality in intensive care units (ICU). Early prediction of mortality risk is crucial to enhance prognosis and optimize clinical decisions. This study aims to develop a 28-day mortality risk prediction model for S-A...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Jijun, Peng, Hongbing, Luo, Youhong, Zhu, Tao, Xie, Li
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10232880/
https://www.ncbi.nlm.nih.gov/pubmed/37275353
http://dx.doi.org/10.3389/fmed.2023.1165129
_version_ 1785052096408584192
author Yang, Jijun
Peng, Hongbing
Luo, Youhong
Zhu, Tao
Xie, Li
author_facet Yang, Jijun
Peng, Hongbing
Luo, Youhong
Zhu, Tao
Xie, Li
author_sort Yang, Jijun
collection PubMed
description BACKGROUND: Sepsis-associated acute kidney injury (S-AKI) is a major contributor to mortality in intensive care units (ICU). Early prediction of mortality risk is crucial to enhance prognosis and optimize clinical decisions. This study aims to develop a 28-day mortality risk prediction model for S-AKI utilizing an explainable ensemble machine learning (ML) algorithm. METHODS: This study utilized data from the Medical Information Mart for Intensive Care IV (MIMIC-IV 2.0) database to gather information on patients with S-AKI. Univariate regression, correlation analysis and Boruta were combined for feature selection. To construct the four ML models, hyperparameters were tuned via random search and five-fold cross-validation. To evaluate the performance of all models, ROC, K-S, and LIFT curves were used. The discrimination of ML models and traditional scoring systems was compared using area under the receiver operating characteristic curve (AUC). Additionally, the SHapley Additive exPlanation (SHAP) was utilized to interpret the ML model and identify essential variables. To investigate the relationship between the top nine continuous variables and the risk of 28-day mortality. COX regression-restricted cubic splines were utilized while controlling for age and comorbidities. RESULTS: The study analyzed data from 9,158 patients with S-AKI, dividing them into a 28-day mortality group of 1,940 and a survival group of 7,578. The results showed that XGBoost was the best performing model of the four ML models with AUC of 0.873. All models outperformed APS-III 0.713 and SAPS-II 0.681. The K-S and LIFT curves indicated XGBoost as the most effective predictor for 28-day mortality risk. The model’s performance was evaluated using ROCpr curves, calibration curves, accuracy, precision, and F1 scores. SHAP force plots were utilized to interpret and visualize the personalized predictive power of the 28-day mortality risk model. Additionally, COX regression restricted cubic splines revealed an interesting non-linear relationship between the top nine variables and 28-day mortality. CONCLUSION: The use of ensemble ML models has shown to be more effective than the LR model and conventional scoring systems in predicting 28-day mortality risk in S-AKI patients. By visualizing the XGBoost model with the best predictive performance, clinicians are able to identify high-risk patients early on and improve prognosis.
format Online
Article
Text
id pubmed-10232880
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-102328802023-06-02 Explainable ensemble machine learning model for prediction of 28-day mortality risk in patients with sepsis-associated acute kidney injury Yang, Jijun Peng, Hongbing Luo, Youhong Zhu, Tao Xie, Li Front Med (Lausanne) Medicine BACKGROUND: Sepsis-associated acute kidney injury (S-AKI) is a major contributor to mortality in intensive care units (ICU). Early prediction of mortality risk is crucial to enhance prognosis and optimize clinical decisions. This study aims to develop a 28-day mortality risk prediction model for S-AKI utilizing an explainable ensemble machine learning (ML) algorithm. METHODS: This study utilized data from the Medical Information Mart for Intensive Care IV (MIMIC-IV 2.0) database to gather information on patients with S-AKI. Univariate regression, correlation analysis and Boruta were combined for feature selection. To construct the four ML models, hyperparameters were tuned via random search and five-fold cross-validation. To evaluate the performance of all models, ROC, K-S, and LIFT curves were used. The discrimination of ML models and traditional scoring systems was compared using area under the receiver operating characteristic curve (AUC). Additionally, the SHapley Additive exPlanation (SHAP) was utilized to interpret the ML model and identify essential variables. To investigate the relationship between the top nine continuous variables and the risk of 28-day mortality. COX regression-restricted cubic splines were utilized while controlling for age and comorbidities. RESULTS: The study analyzed data from 9,158 patients with S-AKI, dividing them into a 28-day mortality group of 1,940 and a survival group of 7,578. The results showed that XGBoost was the best performing model of the four ML models with AUC of 0.873. All models outperformed APS-III 0.713 and SAPS-II 0.681. The K-S and LIFT curves indicated XGBoost as the most effective predictor for 28-day mortality risk. The model’s performance was evaluated using ROCpr curves, calibration curves, accuracy, precision, and F1 scores. SHAP force plots were utilized to interpret and visualize the personalized predictive power of the 28-day mortality risk model. Additionally, COX regression restricted cubic splines revealed an interesting non-linear relationship between the top nine variables and 28-day mortality. CONCLUSION: The use of ensemble ML models has shown to be more effective than the LR model and conventional scoring systems in predicting 28-day mortality risk in S-AKI patients. By visualizing the XGBoost model with the best predictive performance, clinicians are able to identify high-risk patients early on and improve prognosis. Frontiers Media S.A. 2023-05-18 /pmc/articles/PMC10232880/ /pubmed/37275353 http://dx.doi.org/10.3389/fmed.2023.1165129 Text en Copyright © 2023 Yang, Peng, Luo, Zhu and Xie. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Medicine
Yang, Jijun
Peng, Hongbing
Luo, Youhong
Zhu, Tao
Xie, Li
Explainable ensemble machine learning model for prediction of 28-day mortality risk in patients with sepsis-associated acute kidney injury
title Explainable ensemble machine learning model for prediction of 28-day mortality risk in patients with sepsis-associated acute kidney injury
title_full Explainable ensemble machine learning model for prediction of 28-day mortality risk in patients with sepsis-associated acute kidney injury
title_fullStr Explainable ensemble machine learning model for prediction of 28-day mortality risk in patients with sepsis-associated acute kidney injury
title_full_unstemmed Explainable ensemble machine learning model for prediction of 28-day mortality risk in patients with sepsis-associated acute kidney injury
title_short Explainable ensemble machine learning model for prediction of 28-day mortality risk in patients with sepsis-associated acute kidney injury
title_sort explainable ensemble machine learning model for prediction of 28-day mortality risk in patients with sepsis-associated acute kidney injury
topic Medicine
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10232880/
https://www.ncbi.nlm.nih.gov/pubmed/37275353
http://dx.doi.org/10.3389/fmed.2023.1165129
work_keys_str_mv AT yangjijun explainableensemblemachinelearningmodelforpredictionof28daymortalityriskinpatientswithsepsisassociatedacutekidneyinjury
AT penghongbing explainableensemblemachinelearningmodelforpredictionof28daymortalityriskinpatientswithsepsisassociatedacutekidneyinjury
AT luoyouhong explainableensemblemachinelearningmodelforpredictionof28daymortalityriskinpatientswithsepsisassociatedacutekidneyinjury
AT zhutao explainableensemblemachinelearningmodelforpredictionof28daymortalityriskinpatientswithsepsisassociatedacutekidneyinjury
AT xieli explainableensemblemachinelearningmodelforpredictionof28daymortalityriskinpatientswithsepsisassociatedacutekidneyinjury