Cargando…

Tree-based survival analysis improves mortality prediction in cardiac surgery

OBJECTIVES: Machine learning (ML) classification tools are known to accurately predict many cardiac surgical outcomes. A novel approach, ML-based survival analysis, remains unstudied for predicting mortality after cardiac surgery. We aimed to benchmark performance, as measured by the concordance ind...

Descripción completa

Detalles Bibliográficos
Autores principales: Penny-Dimri, Jahan C., Bergmeir, Christoph, Reid, Christopher M., Williams-Spence, Jenni, Perry, Luke A., Smith, Julian A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10365268/
https://www.ncbi.nlm.nih.gov/pubmed/37492161
http://dx.doi.org/10.3389/fcvm.2023.1211600
_version_ 1785077005188857856
author Penny-Dimri, Jahan C.
Bergmeir, Christoph
Reid, Christopher M.
Williams-Spence, Jenni
Perry, Luke A.
Smith, Julian A.
author_facet Penny-Dimri, Jahan C.
Bergmeir, Christoph
Reid, Christopher M.
Williams-Spence, Jenni
Perry, Luke A.
Smith, Julian A.
author_sort Penny-Dimri, Jahan C.
collection PubMed
description OBJECTIVES: Machine learning (ML) classification tools are known to accurately predict many cardiac surgical outcomes. A novel approach, ML-based survival analysis, remains unstudied for predicting mortality after cardiac surgery. We aimed to benchmark performance, as measured by the concordance index (C-index), of tree-based survival models against Cox proportional hazards (CPH) modeling and explore risk factors using the best-performing model. METHODS: 144,536 patients with 147,301 surgery events from the Australian and New Zealand Society of Cardiac and Thoracic Surgeons (ANZSCTS) national database were used to train and validate models. Univariate analysis was performed using Student's T-test for continuous variables, Chi-squared test for categorical variables, and stratified Kaplan-Meier estimation of the survival function. Three ML models were tested, a decision tree (DT), random forest (RF), and gradient boosting machine (GBM). Hyperparameter tuning was performed using a Bayesian search strategy. Performance was assessed using 2-fold cross-validation repeated 5 times. RESULTS: The highest performing model was the GBM with a C-index of 0.803 (0.002), followed by RF with 0.791 (0.003), DT with 0.729 (0.014), and finally CPH with 0.596 (0.042). The 5 most predictive features were age, type of procedure, length of hospital stay, drain output in the first 4 h (ml), and inotrope use greater than 4 h postoperatively. CONCLUSION: Tree-based learning for survival analysis is a non-parametric and performant alternative to CPH modeling. GBMs offer interpretable modeling of non-linear relationships, promising to expose the most relevant risk factors and uncover new questions to guide future research.
format Online
Article
Text
id pubmed-10365268
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-103652682023-07-25 Tree-based survival analysis improves mortality prediction in cardiac surgery Penny-Dimri, Jahan C. Bergmeir, Christoph Reid, Christopher M. Williams-Spence, Jenni Perry, Luke A. Smith, Julian A. Front Cardiovasc Med Cardiovascular Medicine OBJECTIVES: Machine learning (ML) classification tools are known to accurately predict many cardiac surgical outcomes. A novel approach, ML-based survival analysis, remains unstudied for predicting mortality after cardiac surgery. We aimed to benchmark performance, as measured by the concordance index (C-index), of tree-based survival models against Cox proportional hazards (CPH) modeling and explore risk factors using the best-performing model. METHODS: 144,536 patients with 147,301 surgery events from the Australian and New Zealand Society of Cardiac and Thoracic Surgeons (ANZSCTS) national database were used to train and validate models. Univariate analysis was performed using Student's T-test for continuous variables, Chi-squared test for categorical variables, and stratified Kaplan-Meier estimation of the survival function. Three ML models were tested, a decision tree (DT), random forest (RF), and gradient boosting machine (GBM). Hyperparameter tuning was performed using a Bayesian search strategy. Performance was assessed using 2-fold cross-validation repeated 5 times. RESULTS: The highest performing model was the GBM with a C-index of 0.803 (0.002), followed by RF with 0.791 (0.003), DT with 0.729 (0.014), and finally CPH with 0.596 (0.042). The 5 most predictive features were age, type of procedure, length of hospital stay, drain output in the first 4 h (ml), and inotrope use greater than 4 h postoperatively. CONCLUSION: Tree-based learning for survival analysis is a non-parametric and performant alternative to CPH modeling. GBMs offer interpretable modeling of non-linear relationships, promising to expose the most relevant risk factors and uncover new questions to guide future research. Frontiers Media S.A. 2023-07-10 /pmc/articles/PMC10365268/ /pubmed/37492161 http://dx.doi.org/10.3389/fcvm.2023.1211600 Text en © 2023 Penny-Dimri, Bergmeir, Reid, Williams-Spence, Perry and Smith. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY) (https://creativecommons.org/licenses/by/4.0/) . The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Cardiovascular Medicine
Penny-Dimri, Jahan C.
Bergmeir, Christoph
Reid, Christopher M.
Williams-Spence, Jenni
Perry, Luke A.
Smith, Julian A.
Tree-based survival analysis improves mortality prediction in cardiac surgery
title Tree-based survival analysis improves mortality prediction in cardiac surgery
title_full Tree-based survival analysis improves mortality prediction in cardiac surgery
title_fullStr Tree-based survival analysis improves mortality prediction in cardiac surgery
title_full_unstemmed Tree-based survival analysis improves mortality prediction in cardiac surgery
title_short Tree-based survival analysis improves mortality prediction in cardiac surgery
title_sort tree-based survival analysis improves mortality prediction in cardiac surgery
topic Cardiovascular Medicine
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10365268/
https://www.ncbi.nlm.nih.gov/pubmed/37492161
http://dx.doi.org/10.3389/fcvm.2023.1211600
work_keys_str_mv AT pennydimrijahanc treebasedsurvivalanalysisimprovesmortalitypredictionincardiacsurgery
AT bergmeirchristoph treebasedsurvivalanalysisimprovesmortalitypredictionincardiacsurgery
AT reidchristopherm treebasedsurvivalanalysisimprovesmortalitypredictionincardiacsurgery
AT williamsspencejenni treebasedsurvivalanalysisimprovesmortalitypredictionincardiacsurgery
AT perrylukea treebasedsurvivalanalysisimprovesmortalitypredictionincardiacsurgery
AT smithjuliana treebasedsurvivalanalysisimprovesmortalitypredictionincardiacsurgery