Cargando…

Risk prediction of heart failure in patients with ischemic heart disease using network analytics and stacking ensemble learning

BACKGROUND: Heart failure (HF) is a major complication following ischemic heart disease (IHD) and it adversely affects the outcome. Early prediction of HF risk in patients with IHD is beneficial for timely intervention and for reducing disease burden. METHODS: Two cohorts, cases for patients first d...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Dejia, Qiu, Hang, Wang, Liya, Shen, Minghui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10207812/
https://www.ncbi.nlm.nih.gov/pubmed/37221512
http://dx.doi.org/10.1186/s12911-023-02196-2
_version_ 1785046536417181696
author Zhou, Dejia
Qiu, Hang
Wang, Liya
Shen, Minghui
author_facet Zhou, Dejia
Qiu, Hang
Wang, Liya
Shen, Minghui
author_sort Zhou, Dejia
collection PubMed
description BACKGROUND: Heart failure (HF) is a major complication following ischemic heart disease (IHD) and it adversely affects the outcome. Early prediction of HF risk in patients with IHD is beneficial for timely intervention and for reducing disease burden. METHODS: Two cohorts, cases for patients first diagnosed with IHD and then with HF (N = 11,862) and control IHD patients without HF (N = 25,652), were established from the hospital discharge records in Sichuan, China during 2015-2019. Directed personal disease network (PDN) was constructed for each patient, and then these PDNs were merged to generate the baseline disease network (BDN) for the two cohorts, respectively, which identifies the health trajectories of patients and the complex progression patterns. The differences between the BDNs of the two cohort was represented as disease-specific network (DSN). Three novel network features were exacted from PDN and DSN to represent the similarity of disease patterns and specificity trends from IHD to HF. A stacking-based ensemble model DXLR was proposed to predict HF risk in IHD patients using the novel network features and basic demographic features (i.e., age and sex). The Shapley Addictive exPlanations method was applied to analyze the feature importance of the DXLR model. RESULTS: Compared with the six traditional machine learning models, our DXLR model exhibited the highest AUC (0.934 ± 0.004), accuracy (0.857 ± 0.007), precision (0.723 ± 0.014), recall (0.892 ± 0.012) and F(1) score (0.798 ± 0.010). The feature importance showed that the novel network features ranked as the top three features, playing a notable role in predicting HF risk of IHD patient. The feature comparison experiment also indicated that our novel network features were superior to those proposed by the state-of-the-art study in improving the performance of the prediction model, with an increase in AUC by 19.9%, in accuracy by 18.7%, in precision by 30.7%, in recall by 37.4%, and in F(1) score by 33.7%. CONCLUSIONS: Our proposed approach that combines network analytics and ensemble learning effectively predicts HF risk in patients with IHD. This highlights the potential value of network-based machine learning in disease risk prediction field using administrative data.
format Online
Article
Text
id pubmed-10207812
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-102078122023-05-25 Risk prediction of heart failure in patients with ischemic heart disease using network analytics and stacking ensemble learning Zhou, Dejia Qiu, Hang Wang, Liya Shen, Minghui BMC Med Inform Decis Mak Research BACKGROUND: Heart failure (HF) is a major complication following ischemic heart disease (IHD) and it adversely affects the outcome. Early prediction of HF risk in patients with IHD is beneficial for timely intervention and for reducing disease burden. METHODS: Two cohorts, cases for patients first diagnosed with IHD and then with HF (N = 11,862) and control IHD patients without HF (N = 25,652), were established from the hospital discharge records in Sichuan, China during 2015-2019. Directed personal disease network (PDN) was constructed for each patient, and then these PDNs were merged to generate the baseline disease network (BDN) for the two cohorts, respectively, which identifies the health trajectories of patients and the complex progression patterns. The differences between the BDNs of the two cohort was represented as disease-specific network (DSN). Three novel network features were exacted from PDN and DSN to represent the similarity of disease patterns and specificity trends from IHD to HF. A stacking-based ensemble model DXLR was proposed to predict HF risk in IHD patients using the novel network features and basic demographic features (i.e., age and sex). The Shapley Addictive exPlanations method was applied to analyze the feature importance of the DXLR model. RESULTS: Compared with the six traditional machine learning models, our DXLR model exhibited the highest AUC (0.934 ± 0.004), accuracy (0.857 ± 0.007), precision (0.723 ± 0.014), recall (0.892 ± 0.012) and F(1) score (0.798 ± 0.010). The feature importance showed that the novel network features ranked as the top three features, playing a notable role in predicting HF risk of IHD patient. The feature comparison experiment also indicated that our novel network features were superior to those proposed by the state-of-the-art study in improving the performance of the prediction model, with an increase in AUC by 19.9%, in accuracy by 18.7%, in precision by 30.7%, in recall by 37.4%, and in F(1) score by 33.7%. CONCLUSIONS: Our proposed approach that combines network analytics and ensemble learning effectively predicts HF risk in patients with IHD. This highlights the potential value of network-based machine learning in disease risk prediction field using administrative data. BioMed Central 2023-05-23 /pmc/articles/PMC10207812/ /pubmed/37221512 http://dx.doi.org/10.1186/s12911-023-02196-2 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Zhou, Dejia
Qiu, Hang
Wang, Liya
Shen, Minghui
Risk prediction of heart failure in patients with ischemic heart disease using network analytics and stacking ensemble learning
title Risk prediction of heart failure in patients with ischemic heart disease using network analytics and stacking ensemble learning
title_full Risk prediction of heart failure in patients with ischemic heart disease using network analytics and stacking ensemble learning
title_fullStr Risk prediction of heart failure in patients with ischemic heart disease using network analytics and stacking ensemble learning
title_full_unstemmed Risk prediction of heart failure in patients with ischemic heart disease using network analytics and stacking ensemble learning
title_short Risk prediction of heart failure in patients with ischemic heart disease using network analytics and stacking ensemble learning
title_sort risk prediction of heart failure in patients with ischemic heart disease using network analytics and stacking ensemble learning
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10207812/
https://www.ncbi.nlm.nih.gov/pubmed/37221512
http://dx.doi.org/10.1186/s12911-023-02196-2
work_keys_str_mv AT zhoudejia riskpredictionofheartfailureinpatientswithischemicheartdiseaseusingnetworkanalyticsandstackingensemblelearning
AT qiuhang riskpredictionofheartfailureinpatientswithischemicheartdiseaseusingnetworkanalyticsandstackingensemblelearning
AT wangliya riskpredictionofheartfailureinpatientswithischemicheartdiseaseusingnetworkanalyticsandstackingensemblelearning
AT shenminghui riskpredictionofheartfailureinpatientswithischemicheartdiseaseusingnetworkanalyticsandstackingensemblelearning