Cargando…
Comparison of machine learning techniques in prediction of mortality following cardiac surgery: analysis of over 220 000 patients from a large national database
OBJECTIVES: To perform a systematic comparison of in-hospital mortality risk prediction post-cardiac surgery, between the predominant scoring system—European System for Cardiac Operative Risk Evaluation (EuroSCORE) II, logistic regression (LR) retrained on the same variables and alternative machine...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10275911/ https://www.ncbi.nlm.nih.gov/pubmed/37154705 http://dx.doi.org/10.1093/ejcts/ezad183 |
_version_ | 1785059964777136128 |
---|---|
author | Sinha, Shubhra Dong, Tim Dimagli, Arnaldo Vohra, Hunaid A Holmes, Chris Benedetto, Umberto Angelini, Gianni D |
author_facet | Sinha, Shubhra Dong, Tim Dimagli, Arnaldo Vohra, Hunaid A Holmes, Chris Benedetto, Umberto Angelini, Gianni D |
author_sort | Sinha, Shubhra |
collection | PubMed |
description | OBJECTIVES: To perform a systematic comparison of in-hospital mortality risk prediction post-cardiac surgery, between the predominant scoring system—European System for Cardiac Operative Risk Evaluation (EuroSCORE) II, logistic regression (LR) retrained on the same variables and alternative machine learning techniques (ML)—random forest (RF), neural networks (NN), XGBoost and weighted support vector machine. METHODS: Retrospective analyses of prospectively routinely collected data on adult patients undergoing cardiac surgery in the UK from January 2012 to March 2019. Data were temporally split 70:30 into training and validation subsets. Mortality prediction models were created using the 18 variables of EuroSCORE II. Comparisons of discrimination, calibration and clinical utility were then conducted. Changes in model performance, variable-importance over time and hospital/operation-based model performance were also reviewed. RESULTS: Of the 227 087 adults who underwent cardiac surgery during the study period, there were 6258 deaths (2.76%). In the testing cohort, there was an improvement in discrimination [XGBoost (95% confidence interval (CI) area under the receiver operator curve (AUC), 0.834–0.834, F1 score, 0.276–0.280) and RF (95% CI AUC, 0.833–0.834, F1, 0.277–0.281)] compared with EuroSCORE II (95% CI AUC, 0.817–0.818, F1, 0.243–0.245). There was no significant improvement in calibration with ML and retrained-LR compared to EuroSCORE II. However, EuroSCORE II overestimated risk across all deciles of risk and over time. The calibration drift was lowest in NN, XGBoost and RF compared with EuroSCORE II. Decision curve analysis showed XGBoost and RF to have greater net benefit than EuroSCORE II. CONCLUSIONS: ML techniques showed some statistical improvements over retrained-LR and EuroSCORE II. The clinical impact of this improvement is modest at present. However the incorporation of additional risk factors in future studies may improve upon these findings and warrants further study. |
format | Online Article Text |
id | pubmed-10275911 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-102759112023-06-18 Comparison of machine learning techniques in prediction of mortality following cardiac surgery: analysis of over 220 000 patients from a large national database Sinha, Shubhra Dong, Tim Dimagli, Arnaldo Vohra, Hunaid A Holmes, Chris Benedetto, Umberto Angelini, Gianni D Eur J Cardiothorac Surg General Adult Cardiac OBJECTIVES: To perform a systematic comparison of in-hospital mortality risk prediction post-cardiac surgery, between the predominant scoring system—European System for Cardiac Operative Risk Evaluation (EuroSCORE) II, logistic regression (LR) retrained on the same variables and alternative machine learning techniques (ML)—random forest (RF), neural networks (NN), XGBoost and weighted support vector machine. METHODS: Retrospective analyses of prospectively routinely collected data on adult patients undergoing cardiac surgery in the UK from January 2012 to March 2019. Data were temporally split 70:30 into training and validation subsets. Mortality prediction models were created using the 18 variables of EuroSCORE II. Comparisons of discrimination, calibration and clinical utility were then conducted. Changes in model performance, variable-importance over time and hospital/operation-based model performance were also reviewed. RESULTS: Of the 227 087 adults who underwent cardiac surgery during the study period, there were 6258 deaths (2.76%). In the testing cohort, there was an improvement in discrimination [XGBoost (95% confidence interval (CI) area under the receiver operator curve (AUC), 0.834–0.834, F1 score, 0.276–0.280) and RF (95% CI AUC, 0.833–0.834, F1, 0.277–0.281)] compared with EuroSCORE II (95% CI AUC, 0.817–0.818, F1, 0.243–0.245). There was no significant improvement in calibration with ML and retrained-LR compared to EuroSCORE II. However, EuroSCORE II overestimated risk across all deciles of risk and over time. The calibration drift was lowest in NN, XGBoost and RF compared with EuroSCORE II. Decision curve analysis showed XGBoost and RF to have greater net benefit than EuroSCORE II. CONCLUSIONS: ML techniques showed some statistical improvements over retrained-LR and EuroSCORE II. The clinical impact of this improvement is modest at present. However the incorporation of additional risk factors in future studies may improve upon these findings and warrants further study. Oxford University Press 2023-05-08 /pmc/articles/PMC10275911/ /pubmed/37154705 http://dx.doi.org/10.1093/ejcts/ezad183 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of the European Association for Cardio-Thoracic Surgery. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | General Adult Cardiac Sinha, Shubhra Dong, Tim Dimagli, Arnaldo Vohra, Hunaid A Holmes, Chris Benedetto, Umberto Angelini, Gianni D Comparison of machine learning techniques in prediction of mortality following cardiac surgery: analysis of over 220 000 patients from a large national database |
title | Comparison of machine learning techniques in prediction of mortality following cardiac surgery: analysis of over 220 000 patients from a large national database |
title_full | Comparison of machine learning techniques in prediction of mortality following cardiac surgery: analysis of over 220 000 patients from a large national database |
title_fullStr | Comparison of machine learning techniques in prediction of mortality following cardiac surgery: analysis of over 220 000 patients from a large national database |
title_full_unstemmed | Comparison of machine learning techniques in prediction of mortality following cardiac surgery: analysis of over 220 000 patients from a large national database |
title_short | Comparison of machine learning techniques in prediction of mortality following cardiac surgery: analysis of over 220 000 patients from a large national database |
title_sort | comparison of machine learning techniques in prediction of mortality following cardiac surgery: analysis of over 220 000 patients from a large national database |
topic | General Adult Cardiac |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10275911/ https://www.ncbi.nlm.nih.gov/pubmed/37154705 http://dx.doi.org/10.1093/ejcts/ezad183 |
work_keys_str_mv | AT sinhashubhra comparisonofmachinelearningtechniquesinpredictionofmortalityfollowingcardiacsurgeryanalysisofover220000patientsfromalargenationaldatabase AT dongtim comparisonofmachinelearningtechniquesinpredictionofmortalityfollowingcardiacsurgeryanalysisofover220000patientsfromalargenationaldatabase AT dimagliarnaldo comparisonofmachinelearningtechniquesinpredictionofmortalityfollowingcardiacsurgeryanalysisofover220000patientsfromalargenationaldatabase AT vohrahunaida comparisonofmachinelearningtechniquesinpredictionofmortalityfollowingcardiacsurgeryanalysisofover220000patientsfromalargenationaldatabase AT holmeschris comparisonofmachinelearningtechniquesinpredictionofmortalityfollowingcardiacsurgeryanalysisofover220000patientsfromalargenationaldatabase AT benedettoumberto comparisonofmachinelearningtechniquesinpredictionofmortalityfollowingcardiacsurgeryanalysisofover220000patientsfromalargenationaldatabase AT angelinigiannid comparisonofmachinelearningtechniquesinpredictionofmortalityfollowingcardiacsurgeryanalysisofover220000patientsfromalargenationaldatabase |