Cargando…
Risk prediction of heart failure in patients with ischemic heart disease using network analytics and stacking ensemble learning
BACKGROUND: Heart failure (HF) is a major complication following ischemic heart disease (IHD) and it adversely affects the outcome. Early prediction of HF risk in patients with IHD is beneficial for timely intervention and for reducing disease burden. METHODS: Two cohorts, cases for patients first d...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10207812/ https://www.ncbi.nlm.nih.gov/pubmed/37221512 http://dx.doi.org/10.1186/s12911-023-02196-2 |
_version_ | 1785046536417181696 |
---|---|
author | Zhou, Dejia Qiu, Hang Wang, Liya Shen, Minghui |
author_facet | Zhou, Dejia Qiu, Hang Wang, Liya Shen, Minghui |
author_sort | Zhou, Dejia |
collection | PubMed |
description | BACKGROUND: Heart failure (HF) is a major complication following ischemic heart disease (IHD) and it adversely affects the outcome. Early prediction of HF risk in patients with IHD is beneficial for timely intervention and for reducing disease burden. METHODS: Two cohorts, cases for patients first diagnosed with IHD and then with HF (N = 11,862) and control IHD patients without HF (N = 25,652), were established from the hospital discharge records in Sichuan, China during 2015-2019. Directed personal disease network (PDN) was constructed for each patient, and then these PDNs were merged to generate the baseline disease network (BDN) for the two cohorts, respectively, which identifies the health trajectories of patients and the complex progression patterns. The differences between the BDNs of the two cohort was represented as disease-specific network (DSN). Three novel network features were exacted from PDN and DSN to represent the similarity of disease patterns and specificity trends from IHD to HF. A stacking-based ensemble model DXLR was proposed to predict HF risk in IHD patients using the novel network features and basic demographic features (i.e., age and sex). The Shapley Addictive exPlanations method was applied to analyze the feature importance of the DXLR model. RESULTS: Compared with the six traditional machine learning models, our DXLR model exhibited the highest AUC (0.934 ± 0.004), accuracy (0.857 ± 0.007), precision (0.723 ± 0.014), recall (0.892 ± 0.012) and F(1) score (0.798 ± 0.010). The feature importance showed that the novel network features ranked as the top three features, playing a notable role in predicting HF risk of IHD patient. The feature comparison experiment also indicated that our novel network features were superior to those proposed by the state-of-the-art study in improving the performance of the prediction model, with an increase in AUC by 19.9%, in accuracy by 18.7%, in precision by 30.7%, in recall by 37.4%, and in F(1) score by 33.7%. CONCLUSIONS: Our proposed approach that combines network analytics and ensemble learning effectively predicts HF risk in patients with IHD. This highlights the potential value of network-based machine learning in disease risk prediction field using administrative data. |
format | Online Article Text |
id | pubmed-10207812 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-102078122023-05-25 Risk prediction of heart failure in patients with ischemic heart disease using network analytics and stacking ensemble learning Zhou, Dejia Qiu, Hang Wang, Liya Shen, Minghui BMC Med Inform Decis Mak Research BACKGROUND: Heart failure (HF) is a major complication following ischemic heart disease (IHD) and it adversely affects the outcome. Early prediction of HF risk in patients with IHD is beneficial for timely intervention and for reducing disease burden. METHODS: Two cohorts, cases for patients first diagnosed with IHD and then with HF (N = 11,862) and control IHD patients without HF (N = 25,652), were established from the hospital discharge records in Sichuan, China during 2015-2019. Directed personal disease network (PDN) was constructed for each patient, and then these PDNs were merged to generate the baseline disease network (BDN) for the two cohorts, respectively, which identifies the health trajectories of patients and the complex progression patterns. The differences between the BDNs of the two cohort was represented as disease-specific network (DSN). Three novel network features were exacted from PDN and DSN to represent the similarity of disease patterns and specificity trends from IHD to HF. A stacking-based ensemble model DXLR was proposed to predict HF risk in IHD patients using the novel network features and basic demographic features (i.e., age and sex). The Shapley Addictive exPlanations method was applied to analyze the feature importance of the DXLR model. RESULTS: Compared with the six traditional machine learning models, our DXLR model exhibited the highest AUC (0.934 ± 0.004), accuracy (0.857 ± 0.007), precision (0.723 ± 0.014), recall (0.892 ± 0.012) and F(1) score (0.798 ± 0.010). The feature importance showed that the novel network features ranked as the top three features, playing a notable role in predicting HF risk of IHD patient. The feature comparison experiment also indicated that our novel network features were superior to those proposed by the state-of-the-art study in improving the performance of the prediction model, with an increase in AUC by 19.9%, in accuracy by 18.7%, in precision by 30.7%, in recall by 37.4%, and in F(1) score by 33.7%. CONCLUSIONS: Our proposed approach that combines network analytics and ensemble learning effectively predicts HF risk in patients with IHD. This highlights the potential value of network-based machine learning in disease risk prediction field using administrative data. BioMed Central 2023-05-23 /pmc/articles/PMC10207812/ /pubmed/37221512 http://dx.doi.org/10.1186/s12911-023-02196-2 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Zhou, Dejia Qiu, Hang Wang, Liya Shen, Minghui Risk prediction of heart failure in patients with ischemic heart disease using network analytics and stacking ensemble learning |
title | Risk prediction of heart failure in patients with ischemic heart disease using network analytics and stacking ensemble learning |
title_full | Risk prediction of heart failure in patients with ischemic heart disease using network analytics and stacking ensemble learning |
title_fullStr | Risk prediction of heart failure in patients with ischemic heart disease using network analytics and stacking ensemble learning |
title_full_unstemmed | Risk prediction of heart failure in patients with ischemic heart disease using network analytics and stacking ensemble learning |
title_short | Risk prediction of heart failure in patients with ischemic heart disease using network analytics and stacking ensemble learning |
title_sort | risk prediction of heart failure in patients with ischemic heart disease using network analytics and stacking ensemble learning |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10207812/ https://www.ncbi.nlm.nih.gov/pubmed/37221512 http://dx.doi.org/10.1186/s12911-023-02196-2 |
work_keys_str_mv | AT zhoudejia riskpredictionofheartfailureinpatientswithischemicheartdiseaseusingnetworkanalyticsandstackingensemblelearning AT qiuhang riskpredictionofheartfailureinpatientswithischemicheartdiseaseusingnetworkanalyticsandstackingensemblelearning AT wangliya riskpredictionofheartfailureinpatientswithischemicheartdiseaseusingnetworkanalyticsandstackingensemblelearning AT shenminghui riskpredictionofheartfailureinpatientswithischemicheartdiseaseusingnetworkanalyticsandstackingensemblelearning |