Cargando…

Development of a scoring tool for predicting prolonged length of hospital stay in peritoneal dialysis patients through data mining

BACKGROUND: The hospital admission rate is high in patients treated with peritoneal dialysis (PD), and the length of stay (LOS) in the hospital is a key indicator of medical resource allocation. This study aimed to develop a scoring tool for predicting prolonged LOS (pLOS) in PD patients by combinin...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Jingyi, Kong, Guilan, Lin, Yu, Chu, Hong, Yang, Chao, Shi, Ying, Wang, Haibo, Zhang, Luxia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: AME Publishing Company 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7723539/
https://www.ncbi.nlm.nih.gov/pubmed/33313182
http://dx.doi.org/10.21037/atm-20-1006
_version_ 1783620359918977024
author Wu, Jingyi
Kong, Guilan
Lin, Yu
Chu, Hong
Yang, Chao
Shi, Ying
Wang, Haibo
Zhang, Luxia
author_facet Wu, Jingyi
Kong, Guilan
Lin, Yu
Chu, Hong
Yang, Chao
Shi, Ying
Wang, Haibo
Zhang, Luxia
author_sort Wu, Jingyi
collection PubMed
description BACKGROUND: The hospital admission rate is high in patients treated with peritoneal dialysis (PD), and the length of stay (LOS) in the hospital is a key indicator of medical resource allocation. This study aimed to develop a scoring tool for predicting prolonged LOS (pLOS) in PD patients by combining machine learning and traditional logistic regression (LR). METHODS: This study was based on patient data collected using the Hospital Quality Monitoring System (HQMS) in China. Three machine learning methods, classification and regression tree (CART), random forest (RF), and gradient boosting decision tree (GBDT), were used to develop models to predict pLOS, which is longer than the average LOS, in PD patients. The model with the best prediction performance was used to identify predictive factors contributing to the outcome. A multivariate LR model based on the identified predictors was then built to derive the score assigned to each predictor. Finally, a scoring tool was developed, and it was tested by stratifying PD patients into different pLOS risk groups. RESULTS: A total of 22,859 PD patients were included in our study, with 25.2% having pLOS. Among the three machine learning models, the RF model achieved the best prediction performance and thus was used to identify the 10 most predictive variables for building the scoring system. The multivariate LR model based on the identified predictors showed good discrimination power with an AUROC of 0.721 in the test dataset, and its coefficients were used as a basis for scoring tool development. On the basis of the developed scoring tool, PD patients were divided into three groups: low risk (≤5), median risk [5–10], and high risk (>10). The observed pLOS proportions in the low-risk, median-risk, and high-risk groups in the test dataset were 11.4%, 29.5%, and 54.7%, respectively. CONCLUSIONS: This study developed a scoring tool to predict pLOS in PD patients. The scoring tool can effectively discriminate patients with different pLOS risks and be easily implemented in clinical practice. The pLOS scoring tool has a great potential to help physicians allocate medical resources optimally and achieve improved clinical outcomes.
format Online
Article
Text
id pubmed-7723539
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher AME Publishing Company
record_format MEDLINE/PubMed
spelling pubmed-77235392020-12-10 Development of a scoring tool for predicting prolonged length of hospital stay in peritoneal dialysis patients through data mining Wu, Jingyi Kong, Guilan Lin, Yu Chu, Hong Yang, Chao Shi, Ying Wang, Haibo Zhang, Luxia Ann Transl Med Original Article BACKGROUND: The hospital admission rate is high in patients treated with peritoneal dialysis (PD), and the length of stay (LOS) in the hospital is a key indicator of medical resource allocation. This study aimed to develop a scoring tool for predicting prolonged LOS (pLOS) in PD patients by combining machine learning and traditional logistic regression (LR). METHODS: This study was based on patient data collected using the Hospital Quality Monitoring System (HQMS) in China. Three machine learning methods, classification and regression tree (CART), random forest (RF), and gradient boosting decision tree (GBDT), were used to develop models to predict pLOS, which is longer than the average LOS, in PD patients. The model with the best prediction performance was used to identify predictive factors contributing to the outcome. A multivariate LR model based on the identified predictors was then built to derive the score assigned to each predictor. Finally, a scoring tool was developed, and it was tested by stratifying PD patients into different pLOS risk groups. RESULTS: A total of 22,859 PD patients were included in our study, with 25.2% having pLOS. Among the three machine learning models, the RF model achieved the best prediction performance and thus was used to identify the 10 most predictive variables for building the scoring system. The multivariate LR model based on the identified predictors showed good discrimination power with an AUROC of 0.721 in the test dataset, and its coefficients were used as a basis for scoring tool development. On the basis of the developed scoring tool, PD patients were divided into three groups: low risk (≤5), median risk [5–10], and high risk (>10). The observed pLOS proportions in the low-risk, median-risk, and high-risk groups in the test dataset were 11.4%, 29.5%, and 54.7%, respectively. CONCLUSIONS: This study developed a scoring tool to predict pLOS in PD patients. The scoring tool can effectively discriminate patients with different pLOS risks and be easily implemented in clinical practice. The pLOS scoring tool has a great potential to help physicians allocate medical resources optimally and achieve improved clinical outcomes. AME Publishing Company 2020-11 /pmc/articles/PMC7723539/ /pubmed/33313182 http://dx.doi.org/10.21037/atm-20-1006 Text en 2020 Annals of Translational Medicine. All rights reserved. https://creativecommons.org/licenses/by-nc-nd/4.0/Open Access Statement: This is an Open Access article distributed in accordance with the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License (CC BY-NC-ND 4.0), which permits the non-commercial replication and distribution of the article with the strict proviso that no changes or edits are made and the original work is properly cited (including links to both the formal publication through the relevant DOI and the license). See: https://creativecommons.org/licenses/by-nc-nd/4.0 (https://creativecommons.org/licenses/by-nc-nd/4.0/) .
spellingShingle Original Article
Wu, Jingyi
Kong, Guilan
Lin, Yu
Chu, Hong
Yang, Chao
Shi, Ying
Wang, Haibo
Zhang, Luxia
Development of a scoring tool for predicting prolonged length of hospital stay in peritoneal dialysis patients through data mining
title Development of a scoring tool for predicting prolonged length of hospital stay in peritoneal dialysis patients through data mining
title_full Development of a scoring tool for predicting prolonged length of hospital stay in peritoneal dialysis patients through data mining
title_fullStr Development of a scoring tool for predicting prolonged length of hospital stay in peritoneal dialysis patients through data mining
title_full_unstemmed Development of a scoring tool for predicting prolonged length of hospital stay in peritoneal dialysis patients through data mining
title_short Development of a scoring tool for predicting prolonged length of hospital stay in peritoneal dialysis patients through data mining
title_sort development of a scoring tool for predicting prolonged length of hospital stay in peritoneal dialysis patients through data mining
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7723539/
https://www.ncbi.nlm.nih.gov/pubmed/33313182
http://dx.doi.org/10.21037/atm-20-1006
work_keys_str_mv AT wujingyi developmentofascoringtoolforpredictingprolongedlengthofhospitalstayinperitonealdialysispatientsthroughdatamining
AT kongguilan developmentofascoringtoolforpredictingprolongedlengthofhospitalstayinperitonealdialysispatientsthroughdatamining
AT linyu developmentofascoringtoolforpredictingprolongedlengthofhospitalstayinperitonealdialysispatientsthroughdatamining
AT chuhong developmentofascoringtoolforpredictingprolongedlengthofhospitalstayinperitonealdialysispatientsthroughdatamining
AT yangchao developmentofascoringtoolforpredictingprolongedlengthofhospitalstayinperitonealdialysispatientsthroughdatamining
AT shiying developmentofascoringtoolforpredictingprolongedlengthofhospitalstayinperitonealdialysispatientsthroughdatamining
AT wanghaibo developmentofascoringtoolforpredictingprolongedlengthofhospitalstayinperitonealdialysispatientsthroughdatamining
AT zhangluxia developmentofascoringtoolforpredictingprolongedlengthofhospitalstayinperitonealdialysispatientsthroughdatamining