Cargando…
Predicting 72-hour and 9-day return to the emergency department using machine learning
OBJECTIVES: To predict 72-h and 9-day emergency department (ED) return by using gradient boosting on an expansive set of clinical variables from the electronic health record. METHODS: This retrospective study included all adult discharges from a level 1 trauma center ED and a community hospital ED c...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6951979/ https://www.ncbi.nlm.nih.gov/pubmed/31984367 http://dx.doi.org/10.1093/jamiaopen/ooz019 |
_version_ | 1783486369031520256 |
---|---|
author | Hong, Woo Suk Haimovich, Adrian Daniel Taylor, Richard Andrew |
author_facet | Hong, Woo Suk Haimovich, Adrian Daniel Taylor, Richard Andrew |
author_sort | Hong, Woo Suk |
collection | PubMed |
description | OBJECTIVES: To predict 72-h and 9-day emergency department (ED) return by using gradient boosting on an expansive set of clinical variables from the electronic health record. METHODS: This retrospective study included all adult discharges from a level 1 trauma center ED and a community hospital ED covering the period of March 2013 to July 2017. A total of 1500 variables were extracted for each visit, and samples split randomly into training, validation, and test sets (80%, 10%, and 10%). Gradient boosting models were fit on 3 selections of the data: administrative data (demographics, prior hospital usage, and comorbidity categories), data available at triage, and the full set of data available at discharge. A logistic regression (LR) model built on administrative data was used for baseline comparison. Finally, the top 20 most informative variables identified from the full gradient boosting models were used to build a reduced model for each outcome. RESULTS: A total of 330 631 discharges were available for analysis, with 29 058 discharges (8.8%) resulting in 72-h return and 52 748 discharges (16.0%) resulting in 9-day return to either ED. LR models using administrative data yielded test AUCs of 0.69 (95% confidence interval [CI] 0.68–0.70) and 0.71(95% CI 0.70–0.72), while gradient boosting models using administrative data yielded test AUCs of 0.73 (95% CI 0.72–0.74) and 0.74 (95% CI 0.73–0.74) for 72-h and 9-day return, respectively. Gradient boosting models using variables available at triage yielded test AUCs of 0.75 (95% CI 0.74–0.76) and 0.75 (95% CI 0.74–0.75), while those using the full set of variables yielded test AUCs of 0.76 (95% CI 0.75–0.77) and 0.75 (95% CI 0.75–0.76). Reduced models using the top 20 variables yielded test AUCs of 0.73 (95% CI 0.71–0.74) and 0.73 (95% CI 0.72–0.74). DISCUSSION AND CONCLUSION: Gradient boosting models leveraging clinical data are superior to LR models built on administrative data at predicting 72-h and 9-day returns. |
format | Online Article Text |
id | pubmed-6951979 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-69519792020-01-24 Predicting 72-hour and 9-day return to the emergency department using machine learning Hong, Woo Suk Haimovich, Adrian Daniel Taylor, Richard Andrew JAMIA Open Research and Applications OBJECTIVES: To predict 72-h and 9-day emergency department (ED) return by using gradient boosting on an expansive set of clinical variables from the electronic health record. METHODS: This retrospective study included all adult discharges from a level 1 trauma center ED and a community hospital ED covering the period of March 2013 to July 2017. A total of 1500 variables were extracted for each visit, and samples split randomly into training, validation, and test sets (80%, 10%, and 10%). Gradient boosting models were fit on 3 selections of the data: administrative data (demographics, prior hospital usage, and comorbidity categories), data available at triage, and the full set of data available at discharge. A logistic regression (LR) model built on administrative data was used for baseline comparison. Finally, the top 20 most informative variables identified from the full gradient boosting models were used to build a reduced model for each outcome. RESULTS: A total of 330 631 discharges were available for analysis, with 29 058 discharges (8.8%) resulting in 72-h return and 52 748 discharges (16.0%) resulting in 9-day return to either ED. LR models using administrative data yielded test AUCs of 0.69 (95% confidence interval [CI] 0.68–0.70) and 0.71(95% CI 0.70–0.72), while gradient boosting models using administrative data yielded test AUCs of 0.73 (95% CI 0.72–0.74) and 0.74 (95% CI 0.73–0.74) for 72-h and 9-day return, respectively. Gradient boosting models using variables available at triage yielded test AUCs of 0.75 (95% CI 0.74–0.76) and 0.75 (95% CI 0.74–0.75), while those using the full set of variables yielded test AUCs of 0.76 (95% CI 0.75–0.77) and 0.75 (95% CI 0.75–0.76). Reduced models using the top 20 variables yielded test AUCs of 0.73 (95% CI 0.71–0.74) and 0.73 (95% CI 0.72–0.74). DISCUSSION AND CONCLUSION: Gradient boosting models leveraging clinical data are superior to LR models built on administrative data at predicting 72-h and 9-day returns. Oxford University Press 2019-07-01 /pmc/articles/PMC6951979/ /pubmed/31984367 http://dx.doi.org/10.1093/jamiaopen/ooz019 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of the American Medical Informatics Association. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research and Applications Hong, Woo Suk Haimovich, Adrian Daniel Taylor, Richard Andrew Predicting 72-hour and 9-day return to the emergency department using machine learning |
title | Predicting 72-hour and 9-day return to the emergency department using machine learning |
title_full | Predicting 72-hour and 9-day return to the emergency department using machine learning |
title_fullStr | Predicting 72-hour and 9-day return to the emergency department using machine learning |
title_full_unstemmed | Predicting 72-hour and 9-day return to the emergency department using machine learning |
title_short | Predicting 72-hour and 9-day return to the emergency department using machine learning |
title_sort | predicting 72-hour and 9-day return to the emergency department using machine learning |
topic | Research and Applications |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6951979/ https://www.ncbi.nlm.nih.gov/pubmed/31984367 http://dx.doi.org/10.1093/jamiaopen/ooz019 |
work_keys_str_mv | AT hongwoosuk predicting72hourand9dayreturntotheemergencydepartmentusingmachinelearning AT haimovichadriandaniel predicting72hourand9dayreturntotheemergencydepartmentusingmachinelearning AT taylorrichardandrew predicting72hourand9dayreturntotheemergencydepartmentusingmachinelearning |