Cargando…

Improving predictive performance in incident heart failure using machine learning and multi-center data

OBJECTIVE: To mitigate the burden associated with heart failure (HF), primary prevention is of the utmost importance. To improve early risk stratification, advanced computational methods such as machine learning (ML) capturing complex individual patterns in large data might be necessary. Therefore,...

Descripción completa

Detalles Bibliográficos
Autores principales: Sabovčik, František, Ntalianis, Evangelos, Cauwenberghs, Nicholas, Kuznetsova, Tatiana
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9623026/
https://www.ncbi.nlm.nih.gov/pubmed/36330000
http://dx.doi.org/10.3389/fcvm.2022.1011071
_version_ 1784821901154058240
author Sabovčik, František
Ntalianis, Evangelos
Cauwenberghs, Nicholas
Kuznetsova, Tatiana
author_facet Sabovčik, František
Ntalianis, Evangelos
Cauwenberghs, Nicholas
Kuznetsova, Tatiana
author_sort Sabovčik, František
collection PubMed
description OBJECTIVE: To mitigate the burden associated with heart failure (HF), primary prevention is of the utmost importance. To improve early risk stratification, advanced computational methods such as machine learning (ML) capturing complex individual patterns in large data might be necessary. Therefore, we compared the predictive performance of incident HF risk models in terms of (a) flexible ML models and linear models and (b) models trained on a single cohort (single-center) and on multiple heterogeneous cohorts (multi-center). DESIGN AND METHODS: In our analysis, we used the meta-data consisting of 30,354 individuals from 6 cohorts. During a median follow-up of 5.40 years, 1,068 individuals experienced a non-fatal HF event. We evaluated the predictive performance of survival gradient boosting (SGB), CoxNet, the PCP-HF risk score, and a stacking method. Predictions were obtained iteratively, in each iteration one cohort serving as an external test set and either one or all remaining cohorts as a training set (single- or multi-center, respectively). RESULTS: Overall, multi-center models systematically outperformed single-center models. Further, c-index in the pooled population was higher in SGB (0.735) than in CoxNet (0.694). In the precision-recall (PR) analysis for predicting 10-year HF risk, the stacking method, combining the SGB, CoxNet, Gaussian mixture and PCP-HF models, outperformed other models with PR/AUC 0.804, while PCP-HF achieved only 0.551. CONCLUSION: With a greater number and variety of training cohorts, the model learns a wider range of specific individual health characteristics. Flexible ML algorithms can be used to capture these diverse distributions and produce more precise prediction models.
format Online
Article
Text
id pubmed-9623026
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-96230262022-11-02 Improving predictive performance in incident heart failure using machine learning and multi-center data Sabovčik, František Ntalianis, Evangelos Cauwenberghs, Nicholas Kuznetsova, Tatiana Front Cardiovasc Med Cardiovascular Medicine OBJECTIVE: To mitigate the burden associated with heart failure (HF), primary prevention is of the utmost importance. To improve early risk stratification, advanced computational methods such as machine learning (ML) capturing complex individual patterns in large data might be necessary. Therefore, we compared the predictive performance of incident HF risk models in terms of (a) flexible ML models and linear models and (b) models trained on a single cohort (single-center) and on multiple heterogeneous cohorts (multi-center). DESIGN AND METHODS: In our analysis, we used the meta-data consisting of 30,354 individuals from 6 cohorts. During a median follow-up of 5.40 years, 1,068 individuals experienced a non-fatal HF event. We evaluated the predictive performance of survival gradient boosting (SGB), CoxNet, the PCP-HF risk score, and a stacking method. Predictions were obtained iteratively, in each iteration one cohort serving as an external test set and either one or all remaining cohorts as a training set (single- or multi-center, respectively). RESULTS: Overall, multi-center models systematically outperformed single-center models. Further, c-index in the pooled population was higher in SGB (0.735) than in CoxNet (0.694). In the precision-recall (PR) analysis for predicting 10-year HF risk, the stacking method, combining the SGB, CoxNet, Gaussian mixture and PCP-HF models, outperformed other models with PR/AUC 0.804, while PCP-HF achieved only 0.551. CONCLUSION: With a greater number and variety of training cohorts, the model learns a wider range of specific individual health characteristics. Flexible ML algorithms can be used to capture these diverse distributions and produce more precise prediction models. Frontiers Media S.A. 2022-10-18 /pmc/articles/PMC9623026/ /pubmed/36330000 http://dx.doi.org/10.3389/fcvm.2022.1011071 Text en Copyright © 2022 Sabovčik, Ntalianis, Cauwenberghs and Kuznetsova. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Cardiovascular Medicine
Sabovčik, František
Ntalianis, Evangelos
Cauwenberghs, Nicholas
Kuznetsova, Tatiana
Improving predictive performance in incident heart failure using machine learning and multi-center data
title Improving predictive performance in incident heart failure using machine learning and multi-center data
title_full Improving predictive performance in incident heart failure using machine learning and multi-center data
title_fullStr Improving predictive performance in incident heart failure using machine learning and multi-center data
title_full_unstemmed Improving predictive performance in incident heart failure using machine learning and multi-center data
title_short Improving predictive performance in incident heart failure using machine learning and multi-center data
title_sort improving predictive performance in incident heart failure using machine learning and multi-center data
topic Cardiovascular Medicine
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9623026/
https://www.ncbi.nlm.nih.gov/pubmed/36330000
http://dx.doi.org/10.3389/fcvm.2022.1011071
work_keys_str_mv AT sabovcikfrantisek improvingpredictiveperformanceinincidentheartfailureusingmachinelearningandmulticenterdata
AT ntalianisevangelos improvingpredictiveperformanceinincidentheartfailureusingmachinelearningandmulticenterdata
AT cauwenberghsnicholas improvingpredictiveperformanceinincidentheartfailureusingmachinelearningandmulticenterdata
AT kuznetsovatatiana improvingpredictiveperformanceinincidentheartfailureusingmachinelearningandmulticenterdata