Cargando…

Leveraging Advanced Data Analytics to Predict the Risk of All-Cause Seven-Day Emergency Readmissions

Introduction Emergency readmissions have been a long-time, multifaceted, unsolved problem. Developing a predictive model calibrated with hospital-specific Electronic Health Record (EHR) data could give higher prediction accuracy and insights into high-risk patients for readmission. Thus, we need to...

Descripción completa

Detalles Bibliográficos
Autores principales: Aldhoayan, Mohammed D, Khayat, Afnan M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cureus 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9481186/
https://www.ncbi.nlm.nih.gov/pubmed/36127978
http://dx.doi.org/10.7759/cureus.27630
_version_ 1784791207724974080
author Aldhoayan, Mohammed D
Khayat, Afnan M
author_facet Aldhoayan, Mohammed D
Khayat, Afnan M
author_sort Aldhoayan, Mohammed D
collection PubMed
description Introduction Emergency readmissions have been a long-time, multifaceted, unsolved problem. Developing a predictive model calibrated with hospital-specific Electronic Health Record (EHR) data could give higher prediction accuracy and insights into high-risk patients for readmission. Thus, we need to proactively introduce the necessary interventions. This study aims to investigate the relationship between features that consider significant predictors of at-risk patients for seven-day readmission through logistic regression in addition to developing several machine learning models to test the predictability of those attributes using EHR data in a Saudi Arabia-specific ED context. Methods Univariate and multivariate logistic regression has been used to identify the most statistically significant features that contributed to classifying readmitted and not readmitted patients. Seven different machine learning models were trained and tested, and a comparison between the best-performing model was conducted in terms of five performance metrics. To construct the prediction model and internally validate it, the processed dataset was split into two sets: 70% for the training set and 30% for the test set or validation set. Results XGBoost achieved the highest accuracy (64%) in predicting early seven-day readmissions. Catboost was the second-best predictive model at 61%. XGBoost achieved the highest specificity at 70%, and all the models had a sensitivity of 57% except for XGBoost and Catboost at 32% and 38%, respectively. All predictive attributes, patient age, length of stay (LOS) in minutes, visit time (AM), marital status (married), number of medications, and number of abnormal lab results were significant predictors of early seven-day readmissions while marital status and number of vital-sign instabilities at discharge were not statistically significant predictors of seven-day readmission. Conclusion Although XGBoost and Catboost showed good accuracy, none of the models achieved good discriminative ability in terms of sensitivity and specificity. Thus, none can be clinically used for predicting early seven-day readmission. More predictive variables need to be fed into the model, specifically predictors approximate to the day of discharge, in order to optimize the model’s performance.
format Online
Article
Text
id pubmed-9481186
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Cureus
record_format MEDLINE/PubMed
spelling pubmed-94811862022-09-19 Leveraging Advanced Data Analytics to Predict the Risk of All-Cause Seven-Day Emergency Readmissions Aldhoayan, Mohammed D Khayat, Afnan M Cureus Emergency Medicine Introduction Emergency readmissions have been a long-time, multifaceted, unsolved problem. Developing a predictive model calibrated with hospital-specific Electronic Health Record (EHR) data could give higher prediction accuracy and insights into high-risk patients for readmission. Thus, we need to proactively introduce the necessary interventions. This study aims to investigate the relationship between features that consider significant predictors of at-risk patients for seven-day readmission through logistic regression in addition to developing several machine learning models to test the predictability of those attributes using EHR data in a Saudi Arabia-specific ED context. Methods Univariate and multivariate logistic regression has been used to identify the most statistically significant features that contributed to classifying readmitted and not readmitted patients. Seven different machine learning models were trained and tested, and a comparison between the best-performing model was conducted in terms of five performance metrics. To construct the prediction model and internally validate it, the processed dataset was split into two sets: 70% for the training set and 30% for the test set or validation set. Results XGBoost achieved the highest accuracy (64%) in predicting early seven-day readmissions. Catboost was the second-best predictive model at 61%. XGBoost achieved the highest specificity at 70%, and all the models had a sensitivity of 57% except for XGBoost and Catboost at 32% and 38%, respectively. All predictive attributes, patient age, length of stay (LOS) in minutes, visit time (AM), marital status (married), number of medications, and number of abnormal lab results were significant predictors of early seven-day readmissions while marital status and number of vital-sign instabilities at discharge were not statistically significant predictors of seven-day readmission. Conclusion Although XGBoost and Catboost showed good accuracy, none of the models achieved good discriminative ability in terms of sensitivity and specificity. Thus, none can be clinically used for predicting early seven-day readmission. More predictive variables need to be fed into the model, specifically predictors approximate to the day of discharge, in order to optimize the model’s performance. Cureus 2022-08-03 /pmc/articles/PMC9481186/ /pubmed/36127978 http://dx.doi.org/10.7759/cureus.27630 Text en Copyright © 2022, Aldhoayan et al. https://creativecommons.org/licenses/by/3.0/This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Emergency Medicine
Aldhoayan, Mohammed D
Khayat, Afnan M
Leveraging Advanced Data Analytics to Predict the Risk of All-Cause Seven-Day Emergency Readmissions
title Leveraging Advanced Data Analytics to Predict the Risk of All-Cause Seven-Day Emergency Readmissions
title_full Leveraging Advanced Data Analytics to Predict the Risk of All-Cause Seven-Day Emergency Readmissions
title_fullStr Leveraging Advanced Data Analytics to Predict the Risk of All-Cause Seven-Day Emergency Readmissions
title_full_unstemmed Leveraging Advanced Data Analytics to Predict the Risk of All-Cause Seven-Day Emergency Readmissions
title_short Leveraging Advanced Data Analytics to Predict the Risk of All-Cause Seven-Day Emergency Readmissions
title_sort leveraging advanced data analytics to predict the risk of all-cause seven-day emergency readmissions
topic Emergency Medicine
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9481186/
https://www.ncbi.nlm.nih.gov/pubmed/36127978
http://dx.doi.org/10.7759/cureus.27630
work_keys_str_mv AT aldhoayanmohammedd leveragingadvanceddataanalyticstopredicttheriskofallcausesevendayemergencyreadmissions
AT khayatafnanm leveragingadvanceddataanalyticstopredicttheriskofallcausesevendayemergencyreadmissions