Cargando…
Leveraging Advanced Data Analytics to Predict the Risk of All-Cause Seven-Day Emergency Readmissions
Introduction Emergency readmissions have been a long-time, multifaceted, unsolved problem. Developing a predictive model calibrated with hospital-specific Electronic Health Record (EHR) data could give higher prediction accuracy and insights into high-risk patients for readmission. Thus, we need to...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cureus
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9481186/ https://www.ncbi.nlm.nih.gov/pubmed/36127978 http://dx.doi.org/10.7759/cureus.27630 |
_version_ | 1784791207724974080 |
---|---|
author | Aldhoayan, Mohammed D Khayat, Afnan M |
author_facet | Aldhoayan, Mohammed D Khayat, Afnan M |
author_sort | Aldhoayan, Mohammed D |
collection | PubMed |
description | Introduction Emergency readmissions have been a long-time, multifaceted, unsolved problem. Developing a predictive model calibrated with hospital-specific Electronic Health Record (EHR) data could give higher prediction accuracy and insights into high-risk patients for readmission. Thus, we need to proactively introduce the necessary interventions. This study aims to investigate the relationship between features that consider significant predictors of at-risk patients for seven-day readmission through logistic regression in addition to developing several machine learning models to test the predictability of those attributes using EHR data in a Saudi Arabia-specific ED context. Methods Univariate and multivariate logistic regression has been used to identify the most statistically significant features that contributed to classifying readmitted and not readmitted patients. Seven different machine learning models were trained and tested, and a comparison between the best-performing model was conducted in terms of five performance metrics. To construct the prediction model and internally validate it, the processed dataset was split into two sets: 70% for the training set and 30% for the test set or validation set. Results XGBoost achieved the highest accuracy (64%) in predicting early seven-day readmissions. Catboost was the second-best predictive model at 61%. XGBoost achieved the highest specificity at 70%, and all the models had a sensitivity of 57% except for XGBoost and Catboost at 32% and 38%, respectively. All predictive attributes, patient age, length of stay (LOS) in minutes, visit time (AM), marital status (married), number of medications, and number of abnormal lab results were significant predictors of early seven-day readmissions while marital status and number of vital-sign instabilities at discharge were not statistically significant predictors of seven-day readmission. Conclusion Although XGBoost and Catboost showed good accuracy, none of the models achieved good discriminative ability in terms of sensitivity and specificity. Thus, none can be clinically used for predicting early seven-day readmission. More predictive variables need to be fed into the model, specifically predictors approximate to the day of discharge, in order to optimize the model’s performance. |
format | Online Article Text |
id | pubmed-9481186 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Cureus |
record_format | MEDLINE/PubMed |
spelling | pubmed-94811862022-09-19 Leveraging Advanced Data Analytics to Predict the Risk of All-Cause Seven-Day Emergency Readmissions Aldhoayan, Mohammed D Khayat, Afnan M Cureus Emergency Medicine Introduction Emergency readmissions have been a long-time, multifaceted, unsolved problem. Developing a predictive model calibrated with hospital-specific Electronic Health Record (EHR) data could give higher prediction accuracy and insights into high-risk patients for readmission. Thus, we need to proactively introduce the necessary interventions. This study aims to investigate the relationship between features that consider significant predictors of at-risk patients for seven-day readmission through logistic regression in addition to developing several machine learning models to test the predictability of those attributes using EHR data in a Saudi Arabia-specific ED context. Methods Univariate and multivariate logistic regression has been used to identify the most statistically significant features that contributed to classifying readmitted and not readmitted patients. Seven different machine learning models were trained and tested, and a comparison between the best-performing model was conducted in terms of five performance metrics. To construct the prediction model and internally validate it, the processed dataset was split into two sets: 70% for the training set and 30% for the test set or validation set. Results XGBoost achieved the highest accuracy (64%) in predicting early seven-day readmissions. Catboost was the second-best predictive model at 61%. XGBoost achieved the highest specificity at 70%, and all the models had a sensitivity of 57% except for XGBoost and Catboost at 32% and 38%, respectively. All predictive attributes, patient age, length of stay (LOS) in minutes, visit time (AM), marital status (married), number of medications, and number of abnormal lab results were significant predictors of early seven-day readmissions while marital status and number of vital-sign instabilities at discharge were not statistically significant predictors of seven-day readmission. Conclusion Although XGBoost and Catboost showed good accuracy, none of the models achieved good discriminative ability in terms of sensitivity and specificity. Thus, none can be clinically used for predicting early seven-day readmission. More predictive variables need to be fed into the model, specifically predictors approximate to the day of discharge, in order to optimize the model’s performance. Cureus 2022-08-03 /pmc/articles/PMC9481186/ /pubmed/36127978 http://dx.doi.org/10.7759/cureus.27630 Text en Copyright © 2022, Aldhoayan et al. https://creativecommons.org/licenses/by/3.0/This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Emergency Medicine Aldhoayan, Mohammed D Khayat, Afnan M Leveraging Advanced Data Analytics to Predict the Risk of All-Cause Seven-Day Emergency Readmissions |
title | Leveraging Advanced Data Analytics to Predict the Risk of All-Cause Seven-Day Emergency Readmissions |
title_full | Leveraging Advanced Data Analytics to Predict the Risk of All-Cause Seven-Day Emergency Readmissions |
title_fullStr | Leveraging Advanced Data Analytics to Predict the Risk of All-Cause Seven-Day Emergency Readmissions |
title_full_unstemmed | Leveraging Advanced Data Analytics to Predict the Risk of All-Cause Seven-Day Emergency Readmissions |
title_short | Leveraging Advanced Data Analytics to Predict the Risk of All-Cause Seven-Day Emergency Readmissions |
title_sort | leveraging advanced data analytics to predict the risk of all-cause seven-day emergency readmissions |
topic | Emergency Medicine |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9481186/ https://www.ncbi.nlm.nih.gov/pubmed/36127978 http://dx.doi.org/10.7759/cureus.27630 |
work_keys_str_mv | AT aldhoayanmohammedd leveragingadvanceddataanalyticstopredicttheriskofallcausesevendayemergencyreadmissions AT khayatafnanm leveragingadvanceddataanalyticstopredicttheriskofallcausesevendayemergencyreadmissions |