Cargando…
Improving predictive performance in incident heart failure using machine learning and multi-center data
OBJECTIVE: To mitigate the burden associated with heart failure (HF), primary prevention is of the utmost importance. To improve early risk stratification, advanced computational methods such as machine learning (ML) capturing complex individual patterns in large data might be necessary. Therefore,...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9623026/ https://www.ncbi.nlm.nih.gov/pubmed/36330000 http://dx.doi.org/10.3389/fcvm.2022.1011071 |
_version_ | 1784821901154058240 |
---|---|
author | Sabovčik, František Ntalianis, Evangelos Cauwenberghs, Nicholas Kuznetsova, Tatiana |
author_facet | Sabovčik, František Ntalianis, Evangelos Cauwenberghs, Nicholas Kuznetsova, Tatiana |
author_sort | Sabovčik, František |
collection | PubMed |
description | OBJECTIVE: To mitigate the burden associated with heart failure (HF), primary prevention is of the utmost importance. To improve early risk stratification, advanced computational methods such as machine learning (ML) capturing complex individual patterns in large data might be necessary. Therefore, we compared the predictive performance of incident HF risk models in terms of (a) flexible ML models and linear models and (b) models trained on a single cohort (single-center) and on multiple heterogeneous cohorts (multi-center). DESIGN AND METHODS: In our analysis, we used the meta-data consisting of 30,354 individuals from 6 cohorts. During a median follow-up of 5.40 years, 1,068 individuals experienced a non-fatal HF event. We evaluated the predictive performance of survival gradient boosting (SGB), CoxNet, the PCP-HF risk score, and a stacking method. Predictions were obtained iteratively, in each iteration one cohort serving as an external test set and either one or all remaining cohorts as a training set (single- or multi-center, respectively). RESULTS: Overall, multi-center models systematically outperformed single-center models. Further, c-index in the pooled population was higher in SGB (0.735) than in CoxNet (0.694). In the precision-recall (PR) analysis for predicting 10-year HF risk, the stacking method, combining the SGB, CoxNet, Gaussian mixture and PCP-HF models, outperformed other models with PR/AUC 0.804, while PCP-HF achieved only 0.551. CONCLUSION: With a greater number and variety of training cohorts, the model learns a wider range of specific individual health characteristics. Flexible ML algorithms can be used to capture these diverse distributions and produce more precise prediction models. |
format | Online Article Text |
id | pubmed-9623026 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-96230262022-11-02 Improving predictive performance in incident heart failure using machine learning and multi-center data Sabovčik, František Ntalianis, Evangelos Cauwenberghs, Nicholas Kuznetsova, Tatiana Front Cardiovasc Med Cardiovascular Medicine OBJECTIVE: To mitigate the burden associated with heart failure (HF), primary prevention is of the utmost importance. To improve early risk stratification, advanced computational methods such as machine learning (ML) capturing complex individual patterns in large data might be necessary. Therefore, we compared the predictive performance of incident HF risk models in terms of (a) flexible ML models and linear models and (b) models trained on a single cohort (single-center) and on multiple heterogeneous cohorts (multi-center). DESIGN AND METHODS: In our analysis, we used the meta-data consisting of 30,354 individuals from 6 cohorts. During a median follow-up of 5.40 years, 1,068 individuals experienced a non-fatal HF event. We evaluated the predictive performance of survival gradient boosting (SGB), CoxNet, the PCP-HF risk score, and a stacking method. Predictions were obtained iteratively, in each iteration one cohort serving as an external test set and either one or all remaining cohorts as a training set (single- or multi-center, respectively). RESULTS: Overall, multi-center models systematically outperformed single-center models. Further, c-index in the pooled population was higher in SGB (0.735) than in CoxNet (0.694). In the precision-recall (PR) analysis for predicting 10-year HF risk, the stacking method, combining the SGB, CoxNet, Gaussian mixture and PCP-HF models, outperformed other models with PR/AUC 0.804, while PCP-HF achieved only 0.551. CONCLUSION: With a greater number and variety of training cohorts, the model learns a wider range of specific individual health characteristics. Flexible ML algorithms can be used to capture these diverse distributions and produce more precise prediction models. Frontiers Media S.A. 2022-10-18 /pmc/articles/PMC9623026/ /pubmed/36330000 http://dx.doi.org/10.3389/fcvm.2022.1011071 Text en Copyright © 2022 Sabovčik, Ntalianis, Cauwenberghs and Kuznetsova. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Cardiovascular Medicine Sabovčik, František Ntalianis, Evangelos Cauwenberghs, Nicholas Kuznetsova, Tatiana Improving predictive performance in incident heart failure using machine learning and multi-center data |
title | Improving predictive performance in incident heart failure using machine learning and multi-center data |
title_full | Improving predictive performance in incident heart failure using machine learning and multi-center data |
title_fullStr | Improving predictive performance in incident heart failure using machine learning and multi-center data |
title_full_unstemmed | Improving predictive performance in incident heart failure using machine learning and multi-center data |
title_short | Improving predictive performance in incident heart failure using machine learning and multi-center data |
title_sort | improving predictive performance in incident heart failure using machine learning and multi-center data |
topic | Cardiovascular Medicine |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9623026/ https://www.ncbi.nlm.nih.gov/pubmed/36330000 http://dx.doi.org/10.3389/fcvm.2022.1011071 |
work_keys_str_mv | AT sabovcikfrantisek improvingpredictiveperformanceinincidentheartfailureusingmachinelearningandmulticenterdata AT ntalianisevangelos improvingpredictiveperformanceinincidentheartfailureusingmachinelearningandmulticenterdata AT cauwenberghsnicholas improvingpredictiveperformanceinincidentheartfailureusingmachinelearningandmulticenterdata AT kuznetsovatatiana improvingpredictiveperformanceinincidentheartfailureusingmachinelearningandmulticenterdata |