Cargando…

Machine Learning-Enabled 30-Day Readmission Model for Stroke Patients

Background and Purpose: Hospital readmissions impose a substantial burden on the healthcare system. Reducing readmissions after stroke could lead to improved quality of care especially since stroke is associated with a high rate of readmission. The goal of this study is to enhance our understanding...

Descripción completa

Detalles Bibliográficos
Autores principales: Darabi, Negar, Hosseinichimeh, Niyousha, Noto, Anthony, Zand, Ramin, Abedi, Vida
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8044392/
https://www.ncbi.nlm.nih.gov/pubmed/33868147
http://dx.doi.org/10.3389/fneur.2021.638267
_version_ 1783678474258481152
author Darabi, Negar
Hosseinichimeh, Niyousha
Noto, Anthony
Zand, Ramin
Abedi, Vida
author_facet Darabi, Negar
Hosseinichimeh, Niyousha
Noto, Anthony
Zand, Ramin
Abedi, Vida
author_sort Darabi, Negar
collection PubMed
description Background and Purpose: Hospital readmissions impose a substantial burden on the healthcare system. Reducing readmissions after stroke could lead to improved quality of care especially since stroke is associated with a high rate of readmission. The goal of this study is to enhance our understanding of the predictors of 30-day readmission after ischemic stroke and develop models to identify high-risk individuals for targeted interventions. Methods: We used patient-level data from electronic health records (EHR), five machine learning algorithms (random forest, gradient boosting machine, extreme gradient boosting–XGBoost, support vector machine, and logistic regression-LR), data-driven feature selection strategy, and adaptive sampling to develop 15 models of 30-day readmission after ischemic stroke. We further identified important clinical variables. Results: We included 3,184 patients with ischemic stroke (mean age: 71 ± 13.90 years, men: 51.06%). Among the 61 clinical variables included in the model, the National Institutes of Health Stroke Scale score above 24, insert indwelling urinary catheter, hypercoagulable state, and percutaneous gastrostomy had the highest importance score. The Model's AUC (area under the curve) for predicting 30-day readmission was 0.74 (95%CI: 0.64–0.78) with PPV of 0.43 when the XGBoost algorithm was used with ROSE-sampling. The balance between specificity and sensitivity improved through the sampling strategy. The best sensitivity was achieved with LR when optimized with feature selection and ROSE-sampling (AUC: 0.64, sensitivity: 0.53, specificity: 0.69). Conclusions: Machine learning-based models can be designed to predict 30-day readmission after stroke using structured data from EHR. Among the algorithms analyzed, XGBoost with ROSE-sampling had the best performance in terms of AUC while LR with ROSE-sampling and feature selection had the best sensitivity. Clinical variables highly associated with 30-day readmission could be targeted for personalized interventions. Depending on healthcare systems' resources and criteria, models with optimized performance metrics can be implemented to improve outcomes.
format Online
Article
Text
id pubmed-8044392
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-80443922021-04-15 Machine Learning-Enabled 30-Day Readmission Model for Stroke Patients Darabi, Negar Hosseinichimeh, Niyousha Noto, Anthony Zand, Ramin Abedi, Vida Front Neurol Neurology Background and Purpose: Hospital readmissions impose a substantial burden on the healthcare system. Reducing readmissions after stroke could lead to improved quality of care especially since stroke is associated with a high rate of readmission. The goal of this study is to enhance our understanding of the predictors of 30-day readmission after ischemic stroke and develop models to identify high-risk individuals for targeted interventions. Methods: We used patient-level data from electronic health records (EHR), five machine learning algorithms (random forest, gradient boosting machine, extreme gradient boosting–XGBoost, support vector machine, and logistic regression-LR), data-driven feature selection strategy, and adaptive sampling to develop 15 models of 30-day readmission after ischemic stroke. We further identified important clinical variables. Results: We included 3,184 patients with ischemic stroke (mean age: 71 ± 13.90 years, men: 51.06%). Among the 61 clinical variables included in the model, the National Institutes of Health Stroke Scale score above 24, insert indwelling urinary catheter, hypercoagulable state, and percutaneous gastrostomy had the highest importance score. The Model's AUC (area under the curve) for predicting 30-day readmission was 0.74 (95%CI: 0.64–0.78) with PPV of 0.43 when the XGBoost algorithm was used with ROSE-sampling. The balance between specificity and sensitivity improved through the sampling strategy. The best sensitivity was achieved with LR when optimized with feature selection and ROSE-sampling (AUC: 0.64, sensitivity: 0.53, specificity: 0.69). Conclusions: Machine learning-based models can be designed to predict 30-day readmission after stroke using structured data from EHR. Among the algorithms analyzed, XGBoost with ROSE-sampling had the best performance in terms of AUC while LR with ROSE-sampling and feature selection had the best sensitivity. Clinical variables highly associated with 30-day readmission could be targeted for personalized interventions. Depending on healthcare systems' resources and criteria, models with optimized performance metrics can be implemented to improve outcomes. Frontiers Media S.A. 2021-03-31 /pmc/articles/PMC8044392/ /pubmed/33868147 http://dx.doi.org/10.3389/fneur.2021.638267 Text en Copyright © 2021 Darabi, Hosseinichimeh, Noto, Zand and Abedi. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neurology
Darabi, Negar
Hosseinichimeh, Niyousha
Noto, Anthony
Zand, Ramin
Abedi, Vida
Machine Learning-Enabled 30-Day Readmission Model for Stroke Patients
title Machine Learning-Enabled 30-Day Readmission Model for Stroke Patients
title_full Machine Learning-Enabled 30-Day Readmission Model for Stroke Patients
title_fullStr Machine Learning-Enabled 30-Day Readmission Model for Stroke Patients
title_full_unstemmed Machine Learning-Enabled 30-Day Readmission Model for Stroke Patients
title_short Machine Learning-Enabled 30-Day Readmission Model for Stroke Patients
title_sort machine learning-enabled 30-day readmission model for stroke patients
topic Neurology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8044392/
https://www.ncbi.nlm.nih.gov/pubmed/33868147
http://dx.doi.org/10.3389/fneur.2021.638267
work_keys_str_mv AT darabinegar machinelearningenabled30dayreadmissionmodelforstrokepatients
AT hosseinichimehniyousha machinelearningenabled30dayreadmissionmodelforstrokepatients
AT notoanthony machinelearningenabled30dayreadmissionmodelforstrokepatients
AT zandramin machinelearningenabled30dayreadmissionmodelforstrokepatients
AT abedivida machinelearningenabled30dayreadmissionmodelforstrokepatients