Cargando…

What drives performance in machine learning models for predicting heart failure outcome?

AIMS: The development of acute heart failure (AHF) is a critical decision point in the natural history of the disease and carries a dismal prognosis. The lack of appropriate risk-stratification tools at hospital discharge of AHF patients significantly limits clinical ability to precisely tailor pati...

Descripción completa

Detalles Bibliográficos
Autores principales: Gutman, Rom, Aronson, Doron, Caspi, Oren, Shalit, Uri
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10232285/
https://www.ncbi.nlm.nih.gov/pubmed/37265860
http://dx.doi.org/10.1093/ehjdh/ztac054
_version_ 1785051940079534080
author Gutman, Rom
Aronson, Doron
Caspi, Oren
Shalit, Uri
author_facet Gutman, Rom
Aronson, Doron
Caspi, Oren
Shalit, Uri
author_sort Gutman, Rom
collection PubMed
description AIMS: The development of acute heart failure (AHF) is a critical decision point in the natural history of the disease and carries a dismal prognosis. The lack of appropriate risk-stratification tools at hospital discharge of AHF patients significantly limits clinical ability to precisely tailor patient-specific therapeutic regimen at this pivotal juncture. Machine learning-based strategies may improve risk stratification by incorporating analysis of high-dimensional patient data with multiple covariates and novel prediction methodologies. In the current study, we aimed at evaluating the drivers for success in prediction models and establishing an institute-tailored artificial Intelligence-based prediction model for real-time decision support. METHODS AND RESULTS: We used a cohort of all 10 868 patients AHF patients admitted to a tertiary hospital during a 12 years period. A total of 372 covariates were collected from admission to the end of the hospitalization. We assessed model performance across two axes: (i) type of prediction method and (ii) type and number of covariates. The primary outcome was 1-year survival from hospital discharge. For the model-type axis, we experimented with seven different methods: logistic regression (LR) with either L(1) or L(2) regularization, random forest (RF), Cox proportional hazards model (Cox), extreme gradient boosting (XGBoost), a deep neural-net (NeuralNet) and an ensemble classifier of all the above methods. We were able to achieve an area under receiver operator curve (AUROC) prediction accuracy of more than 80% with most prediction models including L1/L2-LR (80.4%/80.3%), Cox (80.2%), XGBoost (80.5%), NeuralNet (80.4%). RF was inferior to other methods (78.8%), and the ensemble model was slightly superior (81.2%). The number of covariates was a significant modifier (P < 0.001) of prediction success, the use of multiplex-covariates preformed significantly better (AUROC 80.4% for L1-LR) compared with a set of known clinical covariates (AUROC 77.8%). Demographics followed by lab-tests and administrative data resulted in the largest gain in model performance. CONCLUSIONS: The choice of the predictive modelling method is secondary to the multiplicity and type of covariates for predicting AHF prognosis. The application of a structured data pre-processing combined with the use of multiple-covariates results in an accurate, institute-tailored risk prediction in AHF
format Online
Article
Text
id pubmed-10232285
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-102322852023-06-01 What drives performance in machine learning models for predicting heart failure outcome? Gutman, Rom Aronson, Doron Caspi, Oren Shalit, Uri Eur Heart J Digit Health Original Article AIMS: The development of acute heart failure (AHF) is a critical decision point in the natural history of the disease and carries a dismal prognosis. The lack of appropriate risk-stratification tools at hospital discharge of AHF patients significantly limits clinical ability to precisely tailor patient-specific therapeutic regimen at this pivotal juncture. Machine learning-based strategies may improve risk stratification by incorporating analysis of high-dimensional patient data with multiple covariates and novel prediction methodologies. In the current study, we aimed at evaluating the drivers for success in prediction models and establishing an institute-tailored artificial Intelligence-based prediction model for real-time decision support. METHODS AND RESULTS: We used a cohort of all 10 868 patients AHF patients admitted to a tertiary hospital during a 12 years period. A total of 372 covariates were collected from admission to the end of the hospitalization. We assessed model performance across two axes: (i) type of prediction method and (ii) type and number of covariates. The primary outcome was 1-year survival from hospital discharge. For the model-type axis, we experimented with seven different methods: logistic regression (LR) with either L(1) or L(2) regularization, random forest (RF), Cox proportional hazards model (Cox), extreme gradient boosting (XGBoost), a deep neural-net (NeuralNet) and an ensemble classifier of all the above methods. We were able to achieve an area under receiver operator curve (AUROC) prediction accuracy of more than 80% with most prediction models including L1/L2-LR (80.4%/80.3%), Cox (80.2%), XGBoost (80.5%), NeuralNet (80.4%). RF was inferior to other methods (78.8%), and the ensemble model was slightly superior (81.2%). The number of covariates was a significant modifier (P < 0.001) of prediction success, the use of multiplex-covariates preformed significantly better (AUROC 80.4% for L1-LR) compared with a set of known clinical covariates (AUROC 77.8%). Demographics followed by lab-tests and administrative data resulted in the largest gain in model performance. CONCLUSIONS: The choice of the predictive modelling method is secondary to the multiplicity and type of covariates for predicting AHF prognosis. The application of a structured data pre-processing combined with the use of multiple-covariates results in an accurate, institute-tailored risk prediction in AHF Oxford University Press 2022-09-30 /pmc/articles/PMC10232285/ /pubmed/37265860 http://dx.doi.org/10.1093/ehjdh/ztac054 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of the European Society of Cardiology. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Article
Gutman, Rom
Aronson, Doron
Caspi, Oren
Shalit, Uri
What drives performance in machine learning models for predicting heart failure outcome?
title What drives performance in machine learning models for predicting heart failure outcome?
title_full What drives performance in machine learning models for predicting heart failure outcome?
title_fullStr What drives performance in machine learning models for predicting heart failure outcome?
title_full_unstemmed What drives performance in machine learning models for predicting heart failure outcome?
title_short What drives performance in machine learning models for predicting heart failure outcome?
title_sort what drives performance in machine learning models for predicting heart failure outcome?
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10232285/
https://www.ncbi.nlm.nih.gov/pubmed/37265860
http://dx.doi.org/10.1093/ehjdh/ztac054
work_keys_str_mv AT gutmanrom whatdrivesperformanceinmachinelearningmodelsforpredictingheartfailureoutcome
AT aronsondoron whatdrivesperformanceinmachinelearningmodelsforpredictingheartfailureoutcome
AT caspioren whatdrivesperformanceinmachinelearningmodelsforpredictingheartfailureoutcome
AT shalituri whatdrivesperformanceinmachinelearningmodelsforpredictingheartfailureoutcome