Cargando…

A hybrid machine learning framework to improve prediction of all-cause rehospitalization among elderly patients in Hong Kong

BACKGROUND: Accurately estimating elderly patients’ rehospitalisation risk benefits clinical decisions and service planning. However, research in rehospitalisation and repeated hospitalisation yielded only models with modest performance, and the model performance deteriorates rapidly as the predicti...

Descripción completa

Detalles Bibliográficos
Autores principales: Guan, Jingjing, Leung, Eman, Kwok, Kin-on, Chen, Frank Youhua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9837949/
https://www.ncbi.nlm.nih.gov/pubmed/36639745
http://dx.doi.org/10.1186/s12874-022-01824-1
_version_ 1784869180666806272
author Guan, Jingjing
Leung, Eman
Kwok, Kin-on
Chen, Frank Youhua
author_facet Guan, Jingjing
Leung, Eman
Kwok, Kin-on
Chen, Frank Youhua
author_sort Guan, Jingjing
collection PubMed
description BACKGROUND: Accurately estimating elderly patients’ rehospitalisation risk benefits clinical decisions and service planning. However, research in rehospitalisation and repeated hospitalisation yielded only models with modest performance, and the model performance deteriorates rapidly as the prediction timeframe expands beyond 28 days and for older participants. METHODS: A temporal zero-inflated Poisson (tZIP) regression model was developed and validated retrospectively and prospectively. The data of the electronic health records (EHRs) contain cohorts (aged 60+) in a major public hospital in Hong Kong. Two temporal offset functions accounted for the associations between exposure time and parameters corresponding to the zero-inflated logistic component and the Poisson distribution’s expected count. tZIP was externally validated with a retrospective cohort’s rehospitalisation events up to 12 months after the discharge date. Subsequently, tZIP was validated prospectively after piloting its implementation at the study hospital. Patients discharged within the pilot period were tagged, and the proposed model’s prediction of their rehospitalisation was verified monthly. Using a hybrid machine learning (ML) approach, the tZIP-based risk estimator’s marginal effect on 28-day rehospitalisation was further validated, competing with other factors representing different post-acute and clinical statuses. RESULTS: The tZIP prediction of rehospitalisation from 28 days to 365 days was achieved at above 80% discrimination accuracy retrospectively and prospectively in two out-of-sample cohorts. With a large margin, it outperformed the Cox proportional and linear models built with the same predictors. The hybrid ML revealed that the risk estimator’s contribution to 28-day rehospitalisation outweighed other features relevant to service utilisation and clinical status. CONCLUSIONS: A novel rehospitalisation risk model was introduced, and its risk estimators, whose importance outweighed all other factors of diverse post-acute care and clinical conditions, were derived. The proposed approach relies on four easily accessible variables easily extracted from EHR. Thus, clinicians could visualise patients’ rehospitalisation risk from 28 days to 365 days after discharge and screen high-risk older patients for follow-up care at the proper time. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12874-022-01824-1.
format Online
Article
Text
id pubmed-9837949
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-98379492023-01-14 A hybrid machine learning framework to improve prediction of all-cause rehospitalization among elderly patients in Hong Kong Guan, Jingjing Leung, Eman Kwok, Kin-on Chen, Frank Youhua BMC Med Res Methodol Research BACKGROUND: Accurately estimating elderly patients’ rehospitalisation risk benefits clinical decisions and service planning. However, research in rehospitalisation and repeated hospitalisation yielded only models with modest performance, and the model performance deteriorates rapidly as the prediction timeframe expands beyond 28 days and for older participants. METHODS: A temporal zero-inflated Poisson (tZIP) regression model was developed and validated retrospectively and prospectively. The data of the electronic health records (EHRs) contain cohorts (aged 60+) in a major public hospital in Hong Kong. Two temporal offset functions accounted for the associations between exposure time and parameters corresponding to the zero-inflated logistic component and the Poisson distribution’s expected count. tZIP was externally validated with a retrospective cohort’s rehospitalisation events up to 12 months after the discharge date. Subsequently, tZIP was validated prospectively after piloting its implementation at the study hospital. Patients discharged within the pilot period were tagged, and the proposed model’s prediction of their rehospitalisation was verified monthly. Using a hybrid machine learning (ML) approach, the tZIP-based risk estimator’s marginal effect on 28-day rehospitalisation was further validated, competing with other factors representing different post-acute and clinical statuses. RESULTS: The tZIP prediction of rehospitalisation from 28 days to 365 days was achieved at above 80% discrimination accuracy retrospectively and prospectively in two out-of-sample cohorts. With a large margin, it outperformed the Cox proportional and linear models built with the same predictors. The hybrid ML revealed that the risk estimator’s contribution to 28-day rehospitalisation outweighed other features relevant to service utilisation and clinical status. CONCLUSIONS: A novel rehospitalisation risk model was introduced, and its risk estimators, whose importance outweighed all other factors of diverse post-acute care and clinical conditions, were derived. The proposed approach relies on four easily accessible variables easily extracted from EHR. Thus, clinicians could visualise patients’ rehospitalisation risk from 28 days to 365 days after discharge and screen high-risk older patients for follow-up care at the proper time. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12874-022-01824-1. BioMed Central 2023-01-13 /pmc/articles/PMC9837949/ /pubmed/36639745 http://dx.doi.org/10.1186/s12874-022-01824-1 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Guan, Jingjing
Leung, Eman
Kwok, Kin-on
Chen, Frank Youhua
A hybrid machine learning framework to improve prediction of all-cause rehospitalization among elderly patients in Hong Kong
title A hybrid machine learning framework to improve prediction of all-cause rehospitalization among elderly patients in Hong Kong
title_full A hybrid machine learning framework to improve prediction of all-cause rehospitalization among elderly patients in Hong Kong
title_fullStr A hybrid machine learning framework to improve prediction of all-cause rehospitalization among elderly patients in Hong Kong
title_full_unstemmed A hybrid machine learning framework to improve prediction of all-cause rehospitalization among elderly patients in Hong Kong
title_short A hybrid machine learning framework to improve prediction of all-cause rehospitalization among elderly patients in Hong Kong
title_sort hybrid machine learning framework to improve prediction of all-cause rehospitalization among elderly patients in hong kong
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9837949/
https://www.ncbi.nlm.nih.gov/pubmed/36639745
http://dx.doi.org/10.1186/s12874-022-01824-1
work_keys_str_mv AT guanjingjing ahybridmachinelearningframeworktoimprovepredictionofallcauserehospitalizationamongelderlypatientsinhongkong
AT leungeman ahybridmachinelearningframeworktoimprovepredictionofallcauserehospitalizationamongelderlypatientsinhongkong
AT kwokkinon ahybridmachinelearningframeworktoimprovepredictionofallcauserehospitalizationamongelderlypatientsinhongkong
AT chenfrankyouhua ahybridmachinelearningframeworktoimprovepredictionofallcauserehospitalizationamongelderlypatientsinhongkong
AT guanjingjing hybridmachinelearningframeworktoimprovepredictionofallcauserehospitalizationamongelderlypatientsinhongkong
AT leungeman hybridmachinelearningframeworktoimprovepredictionofallcauserehospitalizationamongelderlypatientsinhongkong
AT kwokkinon hybridmachinelearningframeworktoimprovepredictionofallcauserehospitalizationamongelderlypatientsinhongkong
AT chenfrankyouhua hybridmachinelearningframeworktoimprovepredictionofallcauserehospitalizationamongelderlypatientsinhongkong