Cargando…

Machine learning methods to predict 30-day hospital readmission outcome among US adults with pneumonia: analysis of the national readmission database

BACKGROUND: Hospital readmissions for pneumonia are a growing concern in the US, with significant consequences for costs and quality of care. This study developed the rule-based model and other machine learning (ML) models to predict 30-day readmission risk in patients with pneumonia and compared mo...

Descripción completa

Detalles Bibliográficos
Autores principales:	Huang, Yinan, Talwar, Ashna, Lin, Ying, Aparasu, Rajender R.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2022
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9643900/ https://www.ncbi.nlm.nih.gov/pubmed/36352392 http://dx.doi.org/10.1186/s12911-022-01995-3

_version_	1784826622755471360
author	Huang, Yinan Talwar, Ashna Lin, Ying Aparasu, Rajender R.
author_facet	Huang, Yinan Talwar, Ashna Lin, Ying Aparasu, Rajender R.
author_sort	Huang, Yinan
collection	PubMed
description	BACKGROUND: Hospital readmissions for pneumonia are a growing concern in the US, with significant consequences for costs and quality of care. This study developed the rule-based model and other machine learning (ML) models to predict 30-day readmission risk in patients with pneumonia and compared model performance. METHODS: This population-based study involved patients aged ≥ 18 years hospitalized with pneumonia from January 1, 2016, through November 30, 2016, using the Healthcare Cost and Utilization Project-National Readmission Database (HCUP-NRD). Rule-based algorithms and other ML algorithms, specifically decision trees, random forest, extreme gradient descent boosting (XGBoost), and Least Absolute Shrinkage and Selection Operator (LASSO), were used to model all-cause readmissions 30 days post-discharge from index pneumonia hospitalization. A total of 61 clinically relevant variables were included for ML model development. Models were trained on randomly partitioned 50% of the data and evaluated using the remaining dataset. Model hyperparameters were tuned using the ten-fold cross-validation on the resampled training dataset. The area under the receiver operating curves (AUROC) and area under precision-recall curves (AUPRC) were calculated for the testing set to evaluate the model performance. RESULTS: Of the 372,293 patients with an index hospital hospitalization for pneumonia, 48,280 (12.97%) were readmitted within 30 days. Judged by AUROC in the testing data, rule-based model (0.6591) significantly outperformed decision tree (0.5783, p value < 0.001), random forest (0.6509, p value < 0.01) and LASSO (0.6087, p value < 0.001), but was less superior than XGBoost (0.6606, p value = 0.015). The AUPRC of the rule-based model in the testing data (0.2146) was higher than the decision tree (0.1560), random forest (0.2052), and LASSO (0.2042), but was similar to XGBoost (0.2147). The top risk-predictive rules captured by the rule-based algorithm were comorbidities, illness severity, disposition locations, payer type, age, and length of stay. These predictive risk factors were also identified by other ML models with high variable importance. CONCLUSION: The performance of machine learning models for predicting readmission in pneumonia patients varied. The XGboost was better than the rule-based model based on the AUROC. However, important risk factors for predicting readmission remained consistent across ML models. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-022-01995-3.
format	Online Article Text
id	pubmed-9643900
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-96439002022-11-14 Machine learning methods to predict 30-day hospital readmission outcome among US adults with pneumonia: analysis of the national readmission database Huang, Yinan Talwar, Ashna Lin, Ying Aparasu, Rajender R. BMC Med Inform Decis Mak Research BACKGROUND: Hospital readmissions for pneumonia are a growing concern in the US, with significant consequences for costs and quality of care. This study developed the rule-based model and other machine learning (ML) models to predict 30-day readmission risk in patients with pneumonia and compared model performance. METHODS: This population-based study involved patients aged ≥ 18 years hospitalized with pneumonia from January 1, 2016, through November 30, 2016, using the Healthcare Cost and Utilization Project-National Readmission Database (HCUP-NRD). Rule-based algorithms and other ML algorithms, specifically decision trees, random forest, extreme gradient descent boosting (XGBoost), and Least Absolute Shrinkage and Selection Operator (LASSO), were used to model all-cause readmissions 30 days post-discharge from index pneumonia hospitalization. A total of 61 clinically relevant variables were included for ML model development. Models were trained on randomly partitioned 50% of the data and evaluated using the remaining dataset. Model hyperparameters were tuned using the ten-fold cross-validation on the resampled training dataset. The area under the receiver operating curves (AUROC) and area under precision-recall curves (AUPRC) were calculated for the testing set to evaluate the model performance. RESULTS: Of the 372,293 patients with an index hospital hospitalization for pneumonia, 48,280 (12.97%) were readmitted within 30 days. Judged by AUROC in the testing data, rule-based model (0.6591) significantly outperformed decision tree (0.5783, p value < 0.001), random forest (0.6509, p value < 0.01) and LASSO (0.6087, p value < 0.001), but was less superior than XGBoost (0.6606, p value = 0.015). The AUPRC of the rule-based model in the testing data (0.2146) was higher than the decision tree (0.1560), random forest (0.2052), and LASSO (0.2042), but was similar to XGBoost (0.2147). The top risk-predictive rules captured by the rule-based algorithm were comorbidities, illness severity, disposition locations, payer type, age, and length of stay. These predictive risk factors were also identified by other ML models with high variable importance. CONCLUSION: The performance of machine learning models for predicting readmission in pneumonia patients varied. The XGboost was better than the rule-based model based on the AUROC. However, important risk factors for predicting readmission remained consistent across ML models. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-022-01995-3. BioMed Central 2022-11-09 /pmc/articles/PMC9643900/ /pubmed/36352392 http://dx.doi.org/10.1186/s12911-022-01995-3 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle	Research Huang, Yinan Talwar, Ashna Lin, Ying Aparasu, Rajender R. Machine learning methods to predict 30-day hospital readmission outcome among US adults with pneumonia: analysis of the national readmission database
title	Machine learning methods to predict 30-day hospital readmission outcome among US adults with pneumonia: analysis of the national readmission database
title_full	Machine learning methods to predict 30-day hospital readmission outcome among US adults with pneumonia: analysis of the national readmission database
title_fullStr	Machine learning methods to predict 30-day hospital readmission outcome among US adults with pneumonia: analysis of the national readmission database
title_full_unstemmed	Machine learning methods to predict 30-day hospital readmission outcome among US adults with pneumonia: analysis of the national readmission database
title_short	Machine learning methods to predict 30-day hospital readmission outcome among US adults with pneumonia: analysis of the national readmission database
title_sort	machine learning methods to predict 30-day hospital readmission outcome among us adults with pneumonia: analysis of the national readmission database
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9643900/ https://www.ncbi.nlm.nih.gov/pubmed/36352392 http://dx.doi.org/10.1186/s12911-022-01995-3
work_keys_str_mv	AT huangyinan machinelearningmethodstopredict30dayhospitalreadmissionoutcomeamongusadultswithpneumoniaanalysisofthenationalreadmissiondatabase AT talwarashna machinelearningmethodstopredict30dayhospitalreadmissionoutcomeamongusadultswithpneumoniaanalysisofthenationalreadmissiondatabase AT linying machinelearningmethodstopredict30dayhospitalreadmissionoutcomeamongusadultswithpneumoniaanalysisofthenationalreadmissiondatabase AT aparasurajenderr machinelearningmethodstopredict30dayhospitalreadmissionoutcomeamongusadultswithpneumoniaanalysisofthenationalreadmissiondatabase

Machine learning methods to predict 30-day hospital readmission outcome among US adults with pneumonia: analysis of the national readmission database

Ejemplares similares