Cargando…
Tree-based survival analysis improves mortality prediction in cardiac surgery
OBJECTIVES: Machine learning (ML) classification tools are known to accurately predict many cardiac surgical outcomes. A novel approach, ML-based survival analysis, remains unstudied for predicting mortality after cardiac surgery. We aimed to benchmark performance, as measured by the concordance ind...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10365268/ https://www.ncbi.nlm.nih.gov/pubmed/37492161 http://dx.doi.org/10.3389/fcvm.2023.1211600 |
_version_ | 1785077005188857856 |
---|---|
author | Penny-Dimri, Jahan C. Bergmeir, Christoph Reid, Christopher M. Williams-Spence, Jenni Perry, Luke A. Smith, Julian A. |
author_facet | Penny-Dimri, Jahan C. Bergmeir, Christoph Reid, Christopher M. Williams-Spence, Jenni Perry, Luke A. Smith, Julian A. |
author_sort | Penny-Dimri, Jahan C. |
collection | PubMed |
description | OBJECTIVES: Machine learning (ML) classification tools are known to accurately predict many cardiac surgical outcomes. A novel approach, ML-based survival analysis, remains unstudied for predicting mortality after cardiac surgery. We aimed to benchmark performance, as measured by the concordance index (C-index), of tree-based survival models against Cox proportional hazards (CPH) modeling and explore risk factors using the best-performing model. METHODS: 144,536 patients with 147,301 surgery events from the Australian and New Zealand Society of Cardiac and Thoracic Surgeons (ANZSCTS) national database were used to train and validate models. Univariate analysis was performed using Student's T-test for continuous variables, Chi-squared test for categorical variables, and stratified Kaplan-Meier estimation of the survival function. Three ML models were tested, a decision tree (DT), random forest (RF), and gradient boosting machine (GBM). Hyperparameter tuning was performed using a Bayesian search strategy. Performance was assessed using 2-fold cross-validation repeated 5 times. RESULTS: The highest performing model was the GBM with a C-index of 0.803 (0.002), followed by RF with 0.791 (0.003), DT with 0.729 (0.014), and finally CPH with 0.596 (0.042). The 5 most predictive features were age, type of procedure, length of hospital stay, drain output in the first 4 h (ml), and inotrope use greater than 4 h postoperatively. CONCLUSION: Tree-based learning for survival analysis is a non-parametric and performant alternative to CPH modeling. GBMs offer interpretable modeling of non-linear relationships, promising to expose the most relevant risk factors and uncover new questions to guide future research. |
format | Online Article Text |
id | pubmed-10365268 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-103652682023-07-25 Tree-based survival analysis improves mortality prediction in cardiac surgery Penny-Dimri, Jahan C. Bergmeir, Christoph Reid, Christopher M. Williams-Spence, Jenni Perry, Luke A. Smith, Julian A. Front Cardiovasc Med Cardiovascular Medicine OBJECTIVES: Machine learning (ML) classification tools are known to accurately predict many cardiac surgical outcomes. A novel approach, ML-based survival analysis, remains unstudied for predicting mortality after cardiac surgery. We aimed to benchmark performance, as measured by the concordance index (C-index), of tree-based survival models against Cox proportional hazards (CPH) modeling and explore risk factors using the best-performing model. METHODS: 144,536 patients with 147,301 surgery events from the Australian and New Zealand Society of Cardiac and Thoracic Surgeons (ANZSCTS) national database were used to train and validate models. Univariate analysis was performed using Student's T-test for continuous variables, Chi-squared test for categorical variables, and stratified Kaplan-Meier estimation of the survival function. Three ML models were tested, a decision tree (DT), random forest (RF), and gradient boosting machine (GBM). Hyperparameter tuning was performed using a Bayesian search strategy. Performance was assessed using 2-fold cross-validation repeated 5 times. RESULTS: The highest performing model was the GBM with a C-index of 0.803 (0.002), followed by RF with 0.791 (0.003), DT with 0.729 (0.014), and finally CPH with 0.596 (0.042). The 5 most predictive features were age, type of procedure, length of hospital stay, drain output in the first 4 h (ml), and inotrope use greater than 4 h postoperatively. CONCLUSION: Tree-based learning for survival analysis is a non-parametric and performant alternative to CPH modeling. GBMs offer interpretable modeling of non-linear relationships, promising to expose the most relevant risk factors and uncover new questions to guide future research. Frontiers Media S.A. 2023-07-10 /pmc/articles/PMC10365268/ /pubmed/37492161 http://dx.doi.org/10.3389/fcvm.2023.1211600 Text en © 2023 Penny-Dimri, Bergmeir, Reid, Williams-Spence, Perry and Smith. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY) (https://creativecommons.org/licenses/by/4.0/) . The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Cardiovascular Medicine Penny-Dimri, Jahan C. Bergmeir, Christoph Reid, Christopher M. Williams-Spence, Jenni Perry, Luke A. Smith, Julian A. Tree-based survival analysis improves mortality prediction in cardiac surgery |
title | Tree-based survival analysis improves mortality prediction in cardiac surgery |
title_full | Tree-based survival analysis improves mortality prediction in cardiac surgery |
title_fullStr | Tree-based survival analysis improves mortality prediction in cardiac surgery |
title_full_unstemmed | Tree-based survival analysis improves mortality prediction in cardiac surgery |
title_short | Tree-based survival analysis improves mortality prediction in cardiac surgery |
title_sort | tree-based survival analysis improves mortality prediction in cardiac surgery |
topic | Cardiovascular Medicine |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10365268/ https://www.ncbi.nlm.nih.gov/pubmed/37492161 http://dx.doi.org/10.3389/fcvm.2023.1211600 |
work_keys_str_mv | AT pennydimrijahanc treebasedsurvivalanalysisimprovesmortalitypredictionincardiacsurgery AT bergmeirchristoph treebasedsurvivalanalysisimprovesmortalitypredictionincardiacsurgery AT reidchristopherm treebasedsurvivalanalysisimprovesmortalitypredictionincardiacsurgery AT williamsspencejenni treebasedsurvivalanalysisimprovesmortalitypredictionincardiacsurgery AT perrylukea treebasedsurvivalanalysisimprovesmortalitypredictionincardiacsurgery AT smithjuliana treebasedsurvivalanalysisimprovesmortalitypredictionincardiacsurgery |