Cargando…

Neural networks versus Logistic regression for 30 days all-cause readmission prediction

Heart failure (HF) is one of the leading causes of hospital admissions in the US. Readmission within 30 days after a HF hospitalization is both a recognized indicator for disease progression and a source of considerable financial burden to the healthcare system. Consequently, the identification of p...

Descripción completa

Detalles Bibliográficos
Autores principales: Allam, Ahmed, Nagy, Mate, Thoma, George, Krauthammer, Michael
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6595068/
https://www.ncbi.nlm.nih.gov/pubmed/31243311
http://dx.doi.org/10.1038/s41598-019-45685-z
_version_ 1783430337745911808
author Allam, Ahmed
Nagy, Mate
Thoma, George
Krauthammer, Michael
author_facet Allam, Ahmed
Nagy, Mate
Thoma, George
Krauthammer, Michael
author_sort Allam, Ahmed
collection PubMed
description Heart failure (HF) is one of the leading causes of hospital admissions in the US. Readmission within 30 days after a HF hospitalization is both a recognized indicator for disease progression and a source of considerable financial burden to the healthcare system. Consequently, the identification of patients at risk for readmission is a key step in improving disease management and patient outcome. In this work, we used a large administrative claims dataset to (1) explore the systematic application of neural network-based models versus logistic regression for predicting 30 days all-cause readmission after discharge from a HF admission, and (2) to examine the additive value of patients’ hospitalization timelines on prediction performance. Based on data from 272,778 (49% female) patients with a mean (SD) age of 73 years (14) and 343,328 HF admissions (67% of total admissions), we trained and tested our predictive readmission models following a stratified 5-fold cross-validation scheme. Among the deep learning approaches, a recurrent neural network (RNN) combined with conditional random fields (CRF) model (RNNCRF) achieved the best performance in readmission prediction with 0.642 AUC (95% CI, 0.640–0.645). Other models, such as those based on RNN, convolutional neural networks and CRF alone had lower performance, with a non-timeline based model (MLP) performing worst. A competitive model based on logistic regression with LASSO achieved a performance of 0.643 AUC (95% CI, 0.640–0.646). We conclude that data from patient timelines improve 30 day readmission prediction, that a logistic regression with LASSO has equal performance to the best neural network model and that the use of administrative data result in competitive performance compared to published approaches based on richer clinical datasets.
format Online
Article
Text
id pubmed-6595068
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-65950682019-07-03 Neural networks versus Logistic regression for 30 days all-cause readmission prediction Allam, Ahmed Nagy, Mate Thoma, George Krauthammer, Michael Sci Rep Article Heart failure (HF) is one of the leading causes of hospital admissions in the US. Readmission within 30 days after a HF hospitalization is both a recognized indicator for disease progression and a source of considerable financial burden to the healthcare system. Consequently, the identification of patients at risk for readmission is a key step in improving disease management and patient outcome. In this work, we used a large administrative claims dataset to (1) explore the systematic application of neural network-based models versus logistic regression for predicting 30 days all-cause readmission after discharge from a HF admission, and (2) to examine the additive value of patients’ hospitalization timelines on prediction performance. Based on data from 272,778 (49% female) patients with a mean (SD) age of 73 years (14) and 343,328 HF admissions (67% of total admissions), we trained and tested our predictive readmission models following a stratified 5-fold cross-validation scheme. Among the deep learning approaches, a recurrent neural network (RNN) combined with conditional random fields (CRF) model (RNNCRF) achieved the best performance in readmission prediction with 0.642 AUC (95% CI, 0.640–0.645). Other models, such as those based on RNN, convolutional neural networks and CRF alone had lower performance, with a non-timeline based model (MLP) performing worst. A competitive model based on logistic regression with LASSO achieved a performance of 0.643 AUC (95% CI, 0.640–0.646). We conclude that data from patient timelines improve 30 day readmission prediction, that a logistic regression with LASSO has equal performance to the best neural network model and that the use of administrative data result in competitive performance compared to published approaches based on richer clinical datasets. Nature Publishing Group UK 2019-06-26 /pmc/articles/PMC6595068/ /pubmed/31243311 http://dx.doi.org/10.1038/s41598-019-45685-z Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Allam, Ahmed
Nagy, Mate
Thoma, George
Krauthammer, Michael
Neural networks versus Logistic regression for 30 days all-cause readmission prediction
title Neural networks versus Logistic regression for 30 days all-cause readmission prediction
title_full Neural networks versus Logistic regression for 30 days all-cause readmission prediction
title_fullStr Neural networks versus Logistic regression for 30 days all-cause readmission prediction
title_full_unstemmed Neural networks versus Logistic regression for 30 days all-cause readmission prediction
title_short Neural networks versus Logistic regression for 30 days all-cause readmission prediction
title_sort neural networks versus logistic regression for 30 days all-cause readmission prediction
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6595068/
https://www.ncbi.nlm.nih.gov/pubmed/31243311
http://dx.doi.org/10.1038/s41598-019-45685-z
work_keys_str_mv AT allamahmed neuralnetworksversuslogisticregressionfor30daysallcausereadmissionprediction
AT nagymate neuralnetworksversuslogisticregressionfor30daysallcausereadmissionprediction
AT thomageorge neuralnetworksversuslogisticregressionfor30daysallcausereadmissionprediction
AT krauthammermichael neuralnetworksversuslogisticregressionfor30daysallcausereadmissionprediction