Cargando…
A hybrid machine learning framework to improve prediction of all-cause rehospitalization among elderly patients in Hong Kong
BACKGROUND: Accurately estimating elderly patients’ rehospitalisation risk benefits clinical decisions and service planning. However, research in rehospitalisation and repeated hospitalisation yielded only models with modest performance, and the model performance deteriorates rapidly as the predicti...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9837949/ https://www.ncbi.nlm.nih.gov/pubmed/36639745 http://dx.doi.org/10.1186/s12874-022-01824-1 |
_version_ | 1784869180666806272 |
---|---|
author | Guan, Jingjing Leung, Eman Kwok, Kin-on Chen, Frank Youhua |
author_facet | Guan, Jingjing Leung, Eman Kwok, Kin-on Chen, Frank Youhua |
author_sort | Guan, Jingjing |
collection | PubMed |
description | BACKGROUND: Accurately estimating elderly patients’ rehospitalisation risk benefits clinical decisions and service planning. However, research in rehospitalisation and repeated hospitalisation yielded only models with modest performance, and the model performance deteriorates rapidly as the prediction timeframe expands beyond 28 days and for older participants. METHODS: A temporal zero-inflated Poisson (tZIP) regression model was developed and validated retrospectively and prospectively. The data of the electronic health records (EHRs) contain cohorts (aged 60+) in a major public hospital in Hong Kong. Two temporal offset functions accounted for the associations between exposure time and parameters corresponding to the zero-inflated logistic component and the Poisson distribution’s expected count. tZIP was externally validated with a retrospective cohort’s rehospitalisation events up to 12 months after the discharge date. Subsequently, tZIP was validated prospectively after piloting its implementation at the study hospital. Patients discharged within the pilot period were tagged, and the proposed model’s prediction of their rehospitalisation was verified monthly. Using a hybrid machine learning (ML) approach, the tZIP-based risk estimator’s marginal effect on 28-day rehospitalisation was further validated, competing with other factors representing different post-acute and clinical statuses. RESULTS: The tZIP prediction of rehospitalisation from 28 days to 365 days was achieved at above 80% discrimination accuracy retrospectively and prospectively in two out-of-sample cohorts. With a large margin, it outperformed the Cox proportional and linear models built with the same predictors. The hybrid ML revealed that the risk estimator’s contribution to 28-day rehospitalisation outweighed other features relevant to service utilisation and clinical status. CONCLUSIONS: A novel rehospitalisation risk model was introduced, and its risk estimators, whose importance outweighed all other factors of diverse post-acute care and clinical conditions, were derived. The proposed approach relies on four easily accessible variables easily extracted from EHR. Thus, clinicians could visualise patients’ rehospitalisation risk from 28 days to 365 days after discharge and screen high-risk older patients for follow-up care at the proper time. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12874-022-01824-1. |
format | Online Article Text |
id | pubmed-9837949 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-98379492023-01-14 A hybrid machine learning framework to improve prediction of all-cause rehospitalization among elderly patients in Hong Kong Guan, Jingjing Leung, Eman Kwok, Kin-on Chen, Frank Youhua BMC Med Res Methodol Research BACKGROUND: Accurately estimating elderly patients’ rehospitalisation risk benefits clinical decisions and service planning. However, research in rehospitalisation and repeated hospitalisation yielded only models with modest performance, and the model performance deteriorates rapidly as the prediction timeframe expands beyond 28 days and for older participants. METHODS: A temporal zero-inflated Poisson (tZIP) regression model was developed and validated retrospectively and prospectively. The data of the electronic health records (EHRs) contain cohorts (aged 60+) in a major public hospital in Hong Kong. Two temporal offset functions accounted for the associations between exposure time and parameters corresponding to the zero-inflated logistic component and the Poisson distribution’s expected count. tZIP was externally validated with a retrospective cohort’s rehospitalisation events up to 12 months after the discharge date. Subsequently, tZIP was validated prospectively after piloting its implementation at the study hospital. Patients discharged within the pilot period were tagged, and the proposed model’s prediction of their rehospitalisation was verified monthly. Using a hybrid machine learning (ML) approach, the tZIP-based risk estimator’s marginal effect on 28-day rehospitalisation was further validated, competing with other factors representing different post-acute and clinical statuses. RESULTS: The tZIP prediction of rehospitalisation from 28 days to 365 days was achieved at above 80% discrimination accuracy retrospectively and prospectively in two out-of-sample cohorts. With a large margin, it outperformed the Cox proportional and linear models built with the same predictors. The hybrid ML revealed that the risk estimator’s contribution to 28-day rehospitalisation outweighed other features relevant to service utilisation and clinical status. CONCLUSIONS: A novel rehospitalisation risk model was introduced, and its risk estimators, whose importance outweighed all other factors of diverse post-acute care and clinical conditions, were derived. The proposed approach relies on four easily accessible variables easily extracted from EHR. Thus, clinicians could visualise patients’ rehospitalisation risk from 28 days to 365 days after discharge and screen high-risk older patients for follow-up care at the proper time. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12874-022-01824-1. BioMed Central 2023-01-13 /pmc/articles/PMC9837949/ /pubmed/36639745 http://dx.doi.org/10.1186/s12874-022-01824-1 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Guan, Jingjing Leung, Eman Kwok, Kin-on Chen, Frank Youhua A hybrid machine learning framework to improve prediction of all-cause rehospitalization among elderly patients in Hong Kong |
title | A hybrid machine learning framework to improve prediction of all-cause rehospitalization among elderly patients in Hong Kong |
title_full | A hybrid machine learning framework to improve prediction of all-cause rehospitalization among elderly patients in Hong Kong |
title_fullStr | A hybrid machine learning framework to improve prediction of all-cause rehospitalization among elderly patients in Hong Kong |
title_full_unstemmed | A hybrid machine learning framework to improve prediction of all-cause rehospitalization among elderly patients in Hong Kong |
title_short | A hybrid machine learning framework to improve prediction of all-cause rehospitalization among elderly patients in Hong Kong |
title_sort | hybrid machine learning framework to improve prediction of all-cause rehospitalization among elderly patients in hong kong |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9837949/ https://www.ncbi.nlm.nih.gov/pubmed/36639745 http://dx.doi.org/10.1186/s12874-022-01824-1 |
work_keys_str_mv | AT guanjingjing ahybridmachinelearningframeworktoimprovepredictionofallcauserehospitalizationamongelderlypatientsinhongkong AT leungeman ahybridmachinelearningframeworktoimprovepredictionofallcauserehospitalizationamongelderlypatientsinhongkong AT kwokkinon ahybridmachinelearningframeworktoimprovepredictionofallcauserehospitalizationamongelderlypatientsinhongkong AT chenfrankyouhua ahybridmachinelearningframeworktoimprovepredictionofallcauserehospitalizationamongelderlypatientsinhongkong AT guanjingjing hybridmachinelearningframeworktoimprovepredictionofallcauserehospitalizationamongelderlypatientsinhongkong AT leungeman hybridmachinelearningframeworktoimprovepredictionofallcauserehospitalizationamongelderlypatientsinhongkong AT kwokkinon hybridmachinelearningframeworktoimprovepredictionofallcauserehospitalizationamongelderlypatientsinhongkong AT chenfrankyouhua hybridmachinelearningframeworktoimprovepredictionofallcauserehospitalizationamongelderlypatientsinhongkong |