Cargando…

Machine Learning Models for Predicting Influential Factors of Early Outcomes in Acute Ischemic Stroke: Registry-Based Study

BACKGROUND: Timely and accurate outcome prediction plays a vital role in guiding clinical decisions on acute ischemic stroke. Early condition deterioration and severity after the acute stage are determinants for long-term outcomes. Therefore, predicting early outcomes is crucial in acute stroke mana...

Descripción completa

Detalles Bibliográficos
Autores principales:	Su, Po-Yuan, Wei, Yi-Chia, Luo, Hao, Liu, Chi-Hung, Huang, Wen-Yi, Chen, Kuan-Fu, Lin, Ching-Po, Wei, Hung-Yu, Lee, Tsong-Hai
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	JMIR Publications 2022
Materias:	Original Paper
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8994144/ https://www.ncbi.nlm.nih.gov/pubmed/35072631 http://dx.doi.org/10.2196/32508

_version_	1784684049627873280
author	Su, Po-Yuan Wei, Yi-Chia Luo, Hao Liu, Chi-Hung Huang, Wen-Yi Chen, Kuan-Fu Lin, Ching-Po Wei, Hung-Yu Lee, Tsong-Hai
author_facet	Su, Po-Yuan Wei, Yi-Chia Luo, Hao Liu, Chi-Hung Huang, Wen-Yi Chen, Kuan-Fu Lin, Ching-Po Wei, Hung-Yu Lee, Tsong-Hai
author_sort	Su, Po-Yuan
collection	PubMed
description	BACKGROUND: Timely and accurate outcome prediction plays a vital role in guiding clinical decisions on acute ischemic stroke. Early condition deterioration and severity after the acute stage are determinants for long-term outcomes. Therefore, predicting early outcomes is crucial in acute stroke management. However, interpreting the predictions and transforming them into clinically explainable concepts are as important as the predictions themselves. OBJECTIVE: This work focused on machine learning model analysis in predicting the early outcomes of ischemic stroke and used model explanation skills in interpreting the results. METHODS: Acute ischemic stroke patients registered on the Stroke Registry of the Chang Gung Healthcare System (SRICHS) in 2009 were enrolled for machine learning predictions of the two primary outcomes: modified Rankin Scale (mRS) at hospital discharge and in-hospital deterioration. We compared 4 machine learning models, namely support vector machine (SVM), random forest (RF), light gradient boosting machine (LGBM), and deep neural network (DNN), with the area under the curve (AUC) of the receiver operating characteristic curve. Further, 3 resampling methods, random under sampling (RUS), random over sampling, and the synthetic minority over-sampling technique, dealt with the imbalanced data. The models were explained based on the ranking of feature importance and the SHapley Additive exPlanations (SHAP). RESULTS: RF performed well in both outcomes (discharge mRS: mean AUC 0.829, SD 0.018; in-hospital deterioration: mean AUC 0.710, SD 0.023 on original data and 0.728, SD 0.036 on resampled data with RUS for imbalanced data). In addition, DNN outperformed other models in predicting in-hospital deterioration on data without resampling (mean AUC 0.732, SD 0.064). In general, resampling contributed to the limited improvement of model performance in predicting in-hospital deterioration using imbalanced data. The features obtained from the National Institutes of Health Stroke Scale (NIHSS), white blood cell differential counts, and age were the key features for predicting discharge mRS. In contrast, the NIHSS total score, initial blood pressure, having diabetes mellitus, and features from hemograms were the most important features in predicting in-hospital deterioration. The SHAP summary described the impacts of the feature values on each outcome prediction. CONCLUSIONS: Machine learning models are feasible in predicting early stroke outcomes. An enriched feature bank could improve model performance. Initial neurological levels and age determined the activity independence at hospital discharge. In addition, physiological and laboratory surveillance aided in predicting in-hospital deterioration. The use of the SHAP explanatory method successfully transformed machine learning predictions into clinically meaningful results.
format	Online Article Text
id	pubmed-8994144
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	JMIR Publications
record_format	MEDLINE/PubMed
spelling	pubmed-89941442022-04-10 Machine Learning Models for Predicting Influential Factors of Early Outcomes in Acute Ischemic Stroke: Registry-Based Study Su, Po-Yuan Wei, Yi-Chia Luo, Hao Liu, Chi-Hung Huang, Wen-Yi Chen, Kuan-Fu Lin, Ching-Po Wei, Hung-Yu Lee, Tsong-Hai JMIR Med Inform Original Paper BACKGROUND: Timely and accurate outcome prediction plays a vital role in guiding clinical decisions on acute ischemic stroke. Early condition deterioration and severity after the acute stage are determinants for long-term outcomes. Therefore, predicting early outcomes is crucial in acute stroke management. However, interpreting the predictions and transforming them into clinically explainable concepts are as important as the predictions themselves. OBJECTIVE: This work focused on machine learning model analysis in predicting the early outcomes of ischemic stroke and used model explanation skills in interpreting the results. METHODS: Acute ischemic stroke patients registered on the Stroke Registry of the Chang Gung Healthcare System (SRICHS) in 2009 were enrolled for machine learning predictions of the two primary outcomes: modified Rankin Scale (mRS) at hospital discharge and in-hospital deterioration. We compared 4 machine learning models, namely support vector machine (SVM), random forest (RF), light gradient boosting machine (LGBM), and deep neural network (DNN), with the area under the curve (AUC) of the receiver operating characteristic curve. Further, 3 resampling methods, random under sampling (RUS), random over sampling, and the synthetic minority over-sampling technique, dealt with the imbalanced data. The models were explained based on the ranking of feature importance and the SHapley Additive exPlanations (SHAP). RESULTS: RF performed well in both outcomes (discharge mRS: mean AUC 0.829, SD 0.018; in-hospital deterioration: mean AUC 0.710, SD 0.023 on original data and 0.728, SD 0.036 on resampled data with RUS for imbalanced data). In addition, DNN outperformed other models in predicting in-hospital deterioration on data without resampling (mean AUC 0.732, SD 0.064). In general, resampling contributed to the limited improvement of model performance in predicting in-hospital deterioration using imbalanced data. The features obtained from the National Institutes of Health Stroke Scale (NIHSS), white blood cell differential counts, and age were the key features for predicting discharge mRS. In contrast, the NIHSS total score, initial blood pressure, having diabetes mellitus, and features from hemograms were the most important features in predicting in-hospital deterioration. The SHAP summary described the impacts of the feature values on each outcome prediction. CONCLUSIONS: Machine learning models are feasible in predicting early stroke outcomes. An enriched feature bank could improve model performance. Initial neurological levels and age determined the activity independence at hospital discharge. In addition, physiological and laboratory surveillance aided in predicting in-hospital deterioration. The use of the SHAP explanatory method successfully transformed machine learning predictions into clinically meaningful results. JMIR Publications 2022-03-25 /pmc/articles/PMC8994144/ /pubmed/35072631 http://dx.doi.org/10.2196/32508 Text en ©Po-Yuan Su, Yi-Chia Wei, Hao Luo, Chi-Hung Liu, Wen-Yi Huang, Kuan-Fu Chen, Ching-Po Lin, Hung-Yu Wei, Tsong-Hai Lee. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 25.03.2022. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on https://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle	Original Paper Su, Po-Yuan Wei, Yi-Chia Luo, Hao Liu, Chi-Hung Huang, Wen-Yi Chen, Kuan-Fu Lin, Ching-Po Wei, Hung-Yu Lee, Tsong-Hai Machine Learning Models for Predicting Influential Factors of Early Outcomes in Acute Ischemic Stroke: Registry-Based Study
title	Machine Learning Models for Predicting Influential Factors of Early Outcomes in Acute Ischemic Stroke: Registry-Based Study
title_full	Machine Learning Models for Predicting Influential Factors of Early Outcomes in Acute Ischemic Stroke: Registry-Based Study
title_fullStr	Machine Learning Models for Predicting Influential Factors of Early Outcomes in Acute Ischemic Stroke: Registry-Based Study
title_full_unstemmed	Machine Learning Models for Predicting Influential Factors of Early Outcomes in Acute Ischemic Stroke: Registry-Based Study
title_short	Machine Learning Models for Predicting Influential Factors of Early Outcomes in Acute Ischemic Stroke: Registry-Based Study
title_sort	machine learning models for predicting influential factors of early outcomes in acute ischemic stroke: registry-based study
topic	Original Paper
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8994144/ https://www.ncbi.nlm.nih.gov/pubmed/35072631 http://dx.doi.org/10.2196/32508
work_keys_str_mv	AT supoyuan machinelearningmodelsforpredictinginfluentialfactorsofearlyoutcomesinacuteischemicstrokeregistrybasedstudy AT weiyichia machinelearningmodelsforpredictinginfluentialfactorsofearlyoutcomesinacuteischemicstrokeregistrybasedstudy AT luohao machinelearningmodelsforpredictinginfluentialfactorsofearlyoutcomesinacuteischemicstrokeregistrybasedstudy AT liuchihung machinelearningmodelsforpredictinginfluentialfactorsofearlyoutcomesinacuteischemicstrokeregistrybasedstudy AT huangwenyi machinelearningmodelsforpredictinginfluentialfactorsofearlyoutcomesinacuteischemicstrokeregistrybasedstudy AT chenkuanfu machinelearningmodelsforpredictinginfluentialfactorsofearlyoutcomesinacuteischemicstrokeregistrybasedstudy AT linchingpo machinelearningmodelsforpredictinginfluentialfactorsofearlyoutcomesinacuteischemicstrokeregistrybasedstudy AT weihungyu machinelearningmodelsforpredictinginfluentialfactorsofearlyoutcomesinacuteischemicstrokeregistrybasedstudy AT leetsonghai machinelearningmodelsforpredictinginfluentialfactorsofearlyoutcomesinacuteischemicstrokeregistrybasedstudy

Machine Learning Models for Predicting Influential Factors of Early Outcomes in Acute Ischemic Stroke: Registry-Based Study

Ejemplares similares