Cargando…
Development of a scoring tool for predicting prolonged length of hospital stay in peritoneal dialysis patients through data mining
BACKGROUND: The hospital admission rate is high in patients treated with peritoneal dialysis (PD), and the length of stay (LOS) in the hospital is a key indicator of medical resource allocation. This study aimed to develop a scoring tool for predicting prolonged LOS (pLOS) in PD patients by combinin...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
AME Publishing Company
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7723539/ https://www.ncbi.nlm.nih.gov/pubmed/33313182 http://dx.doi.org/10.21037/atm-20-1006 |
_version_ | 1783620359918977024 |
---|---|
author | Wu, Jingyi Kong, Guilan Lin, Yu Chu, Hong Yang, Chao Shi, Ying Wang, Haibo Zhang, Luxia |
author_facet | Wu, Jingyi Kong, Guilan Lin, Yu Chu, Hong Yang, Chao Shi, Ying Wang, Haibo Zhang, Luxia |
author_sort | Wu, Jingyi |
collection | PubMed |
description | BACKGROUND: The hospital admission rate is high in patients treated with peritoneal dialysis (PD), and the length of stay (LOS) in the hospital is a key indicator of medical resource allocation. This study aimed to develop a scoring tool for predicting prolonged LOS (pLOS) in PD patients by combining machine learning and traditional logistic regression (LR). METHODS: This study was based on patient data collected using the Hospital Quality Monitoring System (HQMS) in China. Three machine learning methods, classification and regression tree (CART), random forest (RF), and gradient boosting decision tree (GBDT), were used to develop models to predict pLOS, which is longer than the average LOS, in PD patients. The model with the best prediction performance was used to identify predictive factors contributing to the outcome. A multivariate LR model based on the identified predictors was then built to derive the score assigned to each predictor. Finally, a scoring tool was developed, and it was tested by stratifying PD patients into different pLOS risk groups. RESULTS: A total of 22,859 PD patients were included in our study, with 25.2% having pLOS. Among the three machine learning models, the RF model achieved the best prediction performance and thus was used to identify the 10 most predictive variables for building the scoring system. The multivariate LR model based on the identified predictors showed good discrimination power with an AUROC of 0.721 in the test dataset, and its coefficients were used as a basis for scoring tool development. On the basis of the developed scoring tool, PD patients were divided into three groups: low risk (≤5), median risk [5–10], and high risk (>10). The observed pLOS proportions in the low-risk, median-risk, and high-risk groups in the test dataset were 11.4%, 29.5%, and 54.7%, respectively. CONCLUSIONS: This study developed a scoring tool to predict pLOS in PD patients. The scoring tool can effectively discriminate patients with different pLOS risks and be easily implemented in clinical practice. The pLOS scoring tool has a great potential to help physicians allocate medical resources optimally and achieve improved clinical outcomes. |
format | Online Article Text |
id | pubmed-7723539 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | AME Publishing Company |
record_format | MEDLINE/PubMed |
spelling | pubmed-77235392020-12-10 Development of a scoring tool for predicting prolonged length of hospital stay in peritoneal dialysis patients through data mining Wu, Jingyi Kong, Guilan Lin, Yu Chu, Hong Yang, Chao Shi, Ying Wang, Haibo Zhang, Luxia Ann Transl Med Original Article BACKGROUND: The hospital admission rate is high in patients treated with peritoneal dialysis (PD), and the length of stay (LOS) in the hospital is a key indicator of medical resource allocation. This study aimed to develop a scoring tool for predicting prolonged LOS (pLOS) in PD patients by combining machine learning and traditional logistic regression (LR). METHODS: This study was based on patient data collected using the Hospital Quality Monitoring System (HQMS) in China. Three machine learning methods, classification and regression tree (CART), random forest (RF), and gradient boosting decision tree (GBDT), were used to develop models to predict pLOS, which is longer than the average LOS, in PD patients. The model with the best prediction performance was used to identify predictive factors contributing to the outcome. A multivariate LR model based on the identified predictors was then built to derive the score assigned to each predictor. Finally, a scoring tool was developed, and it was tested by stratifying PD patients into different pLOS risk groups. RESULTS: A total of 22,859 PD patients were included in our study, with 25.2% having pLOS. Among the three machine learning models, the RF model achieved the best prediction performance and thus was used to identify the 10 most predictive variables for building the scoring system. The multivariate LR model based on the identified predictors showed good discrimination power with an AUROC of 0.721 in the test dataset, and its coefficients were used as a basis for scoring tool development. On the basis of the developed scoring tool, PD patients were divided into three groups: low risk (≤5), median risk [5–10], and high risk (>10). The observed pLOS proportions in the low-risk, median-risk, and high-risk groups in the test dataset were 11.4%, 29.5%, and 54.7%, respectively. CONCLUSIONS: This study developed a scoring tool to predict pLOS in PD patients. The scoring tool can effectively discriminate patients with different pLOS risks and be easily implemented in clinical practice. The pLOS scoring tool has a great potential to help physicians allocate medical resources optimally and achieve improved clinical outcomes. AME Publishing Company 2020-11 /pmc/articles/PMC7723539/ /pubmed/33313182 http://dx.doi.org/10.21037/atm-20-1006 Text en 2020 Annals of Translational Medicine. All rights reserved. https://creativecommons.org/licenses/by-nc-nd/4.0/Open Access Statement: This is an Open Access article distributed in accordance with the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License (CC BY-NC-ND 4.0), which permits the non-commercial replication and distribution of the article with the strict proviso that no changes or edits are made and the original work is properly cited (including links to both the formal publication through the relevant DOI and the license). See: https://creativecommons.org/licenses/by-nc-nd/4.0 (https://creativecommons.org/licenses/by-nc-nd/4.0/) . |
spellingShingle | Original Article Wu, Jingyi Kong, Guilan Lin, Yu Chu, Hong Yang, Chao Shi, Ying Wang, Haibo Zhang, Luxia Development of a scoring tool for predicting prolonged length of hospital stay in peritoneal dialysis patients through data mining |
title | Development of a scoring tool for predicting prolonged length of hospital stay in peritoneal dialysis patients through data mining |
title_full | Development of a scoring tool for predicting prolonged length of hospital stay in peritoneal dialysis patients through data mining |
title_fullStr | Development of a scoring tool for predicting prolonged length of hospital stay in peritoneal dialysis patients through data mining |
title_full_unstemmed | Development of a scoring tool for predicting prolonged length of hospital stay in peritoneal dialysis patients through data mining |
title_short | Development of a scoring tool for predicting prolonged length of hospital stay in peritoneal dialysis patients through data mining |
title_sort | development of a scoring tool for predicting prolonged length of hospital stay in peritoneal dialysis patients through data mining |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7723539/ https://www.ncbi.nlm.nih.gov/pubmed/33313182 http://dx.doi.org/10.21037/atm-20-1006 |
work_keys_str_mv | AT wujingyi developmentofascoringtoolforpredictingprolongedlengthofhospitalstayinperitonealdialysispatientsthroughdatamining AT kongguilan developmentofascoringtoolforpredictingprolongedlengthofhospitalstayinperitonealdialysispatientsthroughdatamining AT linyu developmentofascoringtoolforpredictingprolongedlengthofhospitalstayinperitonealdialysispatientsthroughdatamining AT chuhong developmentofascoringtoolforpredictingprolongedlengthofhospitalstayinperitonealdialysispatientsthroughdatamining AT yangchao developmentofascoringtoolforpredictingprolongedlengthofhospitalstayinperitonealdialysispatientsthroughdatamining AT shiying developmentofascoringtoolforpredictingprolongedlengthofhospitalstayinperitonealdialysispatientsthroughdatamining AT wanghaibo developmentofascoringtoolforpredictingprolongedlengthofhospitalstayinperitonealdialysispatientsthroughdatamining AT zhangluxia developmentofascoringtoolforpredictingprolongedlengthofhospitalstayinperitonealdialysispatientsthroughdatamining |