Cargando…
Machine learning methods to predict 30-day hospital readmission outcome among US adults with pneumonia: analysis of the national readmission database
BACKGROUND: Hospital readmissions for pneumonia are a growing concern in the US, with significant consequences for costs and quality of care. This study developed the rule-based model and other machine learning (ML) models to predict 30-day readmission risk in patients with pneumonia and compared mo...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9643900/ https://www.ncbi.nlm.nih.gov/pubmed/36352392 http://dx.doi.org/10.1186/s12911-022-01995-3 |
_version_ | 1784826622755471360 |
---|---|
author | Huang, Yinan Talwar, Ashna Lin, Ying Aparasu, Rajender R. |
author_facet | Huang, Yinan Talwar, Ashna Lin, Ying Aparasu, Rajender R. |
author_sort | Huang, Yinan |
collection | PubMed |
description | BACKGROUND: Hospital readmissions for pneumonia are a growing concern in the US, with significant consequences for costs and quality of care. This study developed the rule-based model and other machine learning (ML) models to predict 30-day readmission risk in patients with pneumonia and compared model performance. METHODS: This population-based study involved patients aged ≥ 18 years hospitalized with pneumonia from January 1, 2016, through November 30, 2016, using the Healthcare Cost and Utilization Project-National Readmission Database (HCUP-NRD). Rule-based algorithms and other ML algorithms, specifically decision trees, random forest, extreme gradient descent boosting (XGBoost), and Least Absolute Shrinkage and Selection Operator (LASSO), were used to model all-cause readmissions 30 days post-discharge from index pneumonia hospitalization. A total of 61 clinically relevant variables were included for ML model development. Models were trained on randomly partitioned 50% of the data and evaluated using the remaining dataset. Model hyperparameters were tuned using the ten-fold cross-validation on the resampled training dataset. The area under the receiver operating curves (AUROC) and area under precision-recall curves (AUPRC) were calculated for the testing set to evaluate the model performance. RESULTS: Of the 372,293 patients with an index hospital hospitalization for pneumonia, 48,280 (12.97%) were readmitted within 30 days. Judged by AUROC in the testing data, rule-based model (0.6591) significantly outperformed decision tree (0.5783, p value < 0.001), random forest (0.6509, p value < 0.01) and LASSO (0.6087, p value < 0.001), but was less superior than XGBoost (0.6606, p value = 0.015). The AUPRC of the rule-based model in the testing data (0.2146) was higher than the decision tree (0.1560), random forest (0.2052), and LASSO (0.2042), but was similar to XGBoost (0.2147). The top risk-predictive rules captured by the rule-based algorithm were comorbidities, illness severity, disposition locations, payer type, age, and length of stay. These predictive risk factors were also identified by other ML models with high variable importance. CONCLUSION: The performance of machine learning models for predicting readmission in pneumonia patients varied. The XGboost was better than the rule-based model based on the AUROC. However, important risk factors for predicting readmission remained consistent across ML models. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-022-01995-3. |
format | Online Article Text |
id | pubmed-9643900 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-96439002022-11-14 Machine learning methods to predict 30-day hospital readmission outcome among US adults with pneumonia: analysis of the national readmission database Huang, Yinan Talwar, Ashna Lin, Ying Aparasu, Rajender R. BMC Med Inform Decis Mak Research BACKGROUND: Hospital readmissions for pneumonia are a growing concern in the US, with significant consequences for costs and quality of care. This study developed the rule-based model and other machine learning (ML) models to predict 30-day readmission risk in patients with pneumonia and compared model performance. METHODS: This population-based study involved patients aged ≥ 18 years hospitalized with pneumonia from January 1, 2016, through November 30, 2016, using the Healthcare Cost and Utilization Project-National Readmission Database (HCUP-NRD). Rule-based algorithms and other ML algorithms, specifically decision trees, random forest, extreme gradient descent boosting (XGBoost), and Least Absolute Shrinkage and Selection Operator (LASSO), were used to model all-cause readmissions 30 days post-discharge from index pneumonia hospitalization. A total of 61 clinically relevant variables were included for ML model development. Models were trained on randomly partitioned 50% of the data and evaluated using the remaining dataset. Model hyperparameters were tuned using the ten-fold cross-validation on the resampled training dataset. The area under the receiver operating curves (AUROC) and area under precision-recall curves (AUPRC) were calculated for the testing set to evaluate the model performance. RESULTS: Of the 372,293 patients with an index hospital hospitalization for pneumonia, 48,280 (12.97%) were readmitted within 30 days. Judged by AUROC in the testing data, rule-based model (0.6591) significantly outperformed decision tree (0.5783, p value < 0.001), random forest (0.6509, p value < 0.01) and LASSO (0.6087, p value < 0.001), but was less superior than XGBoost (0.6606, p value = 0.015). The AUPRC of the rule-based model in the testing data (0.2146) was higher than the decision tree (0.1560), random forest (0.2052), and LASSO (0.2042), but was similar to XGBoost (0.2147). The top risk-predictive rules captured by the rule-based algorithm were comorbidities, illness severity, disposition locations, payer type, age, and length of stay. These predictive risk factors were also identified by other ML models with high variable importance. CONCLUSION: The performance of machine learning models for predicting readmission in pneumonia patients varied. The XGboost was better than the rule-based model based on the AUROC. However, important risk factors for predicting readmission remained consistent across ML models. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-022-01995-3. BioMed Central 2022-11-09 /pmc/articles/PMC9643900/ /pubmed/36352392 http://dx.doi.org/10.1186/s12911-022-01995-3 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Huang, Yinan Talwar, Ashna Lin, Ying Aparasu, Rajender R. Machine learning methods to predict 30-day hospital readmission outcome among US adults with pneumonia: analysis of the national readmission database |
title | Machine learning methods to predict 30-day hospital readmission outcome among US adults with pneumonia: analysis of the national readmission database |
title_full | Machine learning methods to predict 30-day hospital readmission outcome among US adults with pneumonia: analysis of the national readmission database |
title_fullStr | Machine learning methods to predict 30-day hospital readmission outcome among US adults with pneumonia: analysis of the national readmission database |
title_full_unstemmed | Machine learning methods to predict 30-day hospital readmission outcome among US adults with pneumonia: analysis of the national readmission database |
title_short | Machine learning methods to predict 30-day hospital readmission outcome among US adults with pneumonia: analysis of the national readmission database |
title_sort | machine learning methods to predict 30-day hospital readmission outcome among us adults with pneumonia: analysis of the national readmission database |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9643900/ https://www.ncbi.nlm.nih.gov/pubmed/36352392 http://dx.doi.org/10.1186/s12911-022-01995-3 |
work_keys_str_mv | AT huangyinan machinelearningmethodstopredict30dayhospitalreadmissionoutcomeamongusadultswithpneumoniaanalysisofthenationalreadmissiondatabase AT talwarashna machinelearningmethodstopredict30dayhospitalreadmissionoutcomeamongusadultswithpneumoniaanalysisofthenationalreadmissiondatabase AT linying machinelearningmethodstopredict30dayhospitalreadmissionoutcomeamongusadultswithpneumoniaanalysisofthenationalreadmissiondatabase AT aparasurajenderr machinelearningmethodstopredict30dayhospitalreadmissionoutcomeamongusadultswithpneumoniaanalysisofthenationalreadmissiondatabase |