Cargando…

Cardiac surgery risk prediction using ensemble machine learning to incorporate legacy risk scores: A benchmarking study

OBJECTIVE: The introduction of new clinical risk scores (e.g. European System for Cardiac Operative Risk Evaluation (EuroSCORE) II) superseding original scores (e.g. EuroSCORE I) with different variable sets typically result in disparate datasets due to high levels of missingness for new score varia...

Descripción completa

Detalles Bibliográficos
Autores principales:	Dong, Tim, Sinha, Shubhra, Zhai, Ben, Fudulu, Daniel P, Chan, Jeremy, Narayan, Pradeep, Judge, Andy, Caputo, Massimo, Dimagli, Arnaldo, Benedetto, Umberto, Angelini, Gianni D
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	SAGE Publications 2023
Materias:	Original Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10363892/ https://www.ncbi.nlm.nih.gov/pubmed/37492033 http://dx.doi.org/10.1177/20552076231187605

_version_	1785076736128450560
author	Dong, Tim Sinha, Shubhra Zhai, Ben Fudulu, Daniel P Chan, Jeremy Narayan, Pradeep Judge, Andy Caputo, Massimo Dimagli, Arnaldo Benedetto, Umberto Angelini, Gianni D
author_facet	Dong, Tim Sinha, Shubhra Zhai, Ben Fudulu, Daniel P Chan, Jeremy Narayan, Pradeep Judge, Andy Caputo, Massimo Dimagli, Arnaldo Benedetto, Umberto Angelini, Gianni D
author_sort	Dong, Tim
collection	PubMed
description	OBJECTIVE: The introduction of new clinical risk scores (e.g. European System for Cardiac Operative Risk Evaluation (EuroSCORE) II) superseding original scores (e.g. EuroSCORE I) with different variable sets typically result in disparate datasets due to high levels of missingness for new score variables prior to time of adoption. Little is known about the use of ensemble learning to incorporate disparate data from legacy scores. We tested the hypothesised that Homogenenous and Heterogeneous Machine Learning (ML) ensembles will have better performance than ensembles of Dynamic Model Averaging (DMA) for combining knowledge from EuroSCORE I legacy data with EuroSCORE II data to predict cardiac surgery risk. METHODS: Using the National Adult Cardiac Surgery Audit dataset, we trained 12 different base learner models, based on two different variable sets from either EuroSCORE I (LogES) or EuroScore II (ES II), partitioned by the time of score adoption (1996–2016 or 2012–2016) and evaluated on holdout set (2017–2019). These base learner models were ensembled using nine different combinations of six ML algorithms to produce homogeneous or heterogeneous ensembles. Performance was assessed using a consensus metric. RESULTS: Xgboost homogenous ensemble (HE) was the highest performing model (clinical effectiveness metric (CEM) 0.725) with area under the curve (AUC) (0.8327; 95% confidence interval (CI) 0.8323–0.8329) followed by Random Forest HE (CEM 0.723; AUC 0.8325; 95%CI 0.8320–0.8326). Across different heterogenous ensembles, significantly better performance was obtained by combining siloed datasets across time (CEM 0.720) than building ensembles of either 1996–2011 (t-test adjusted, p = 1.67×10(−6)) or 2012–2019 (t-test adjusted, p = 1.35×10(−193)) datasets alone. CONCLUSIONS: Both homogenous and heterogenous ML ensembles performed significantly better than DMA ensemble of Bayesian Update models. Time-dependent ensemble combination of variables, having differing qualities according to time of score adoption, enabled previously siloed data to be combined, leading to increased power, clinical interpretability of variables and usage of data.
format	Online Article Text
id	pubmed-10363892
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	SAGE Publications
record_format	MEDLINE/PubMed
spelling	pubmed-103638922023-07-25 Cardiac surgery risk prediction using ensemble machine learning to incorporate legacy risk scores: A benchmarking study Dong, Tim Sinha, Shubhra Zhai, Ben Fudulu, Daniel P Chan, Jeremy Narayan, Pradeep Judge, Andy Caputo, Massimo Dimagli, Arnaldo Benedetto, Umberto Angelini, Gianni D Digit Health Original Research OBJECTIVE: The introduction of new clinical risk scores (e.g. European System for Cardiac Operative Risk Evaluation (EuroSCORE) II) superseding original scores (e.g. EuroSCORE I) with different variable sets typically result in disparate datasets due to high levels of missingness for new score variables prior to time of adoption. Little is known about the use of ensemble learning to incorporate disparate data from legacy scores. We tested the hypothesised that Homogenenous and Heterogeneous Machine Learning (ML) ensembles will have better performance than ensembles of Dynamic Model Averaging (DMA) for combining knowledge from EuroSCORE I legacy data with EuroSCORE II data to predict cardiac surgery risk. METHODS: Using the National Adult Cardiac Surgery Audit dataset, we trained 12 different base learner models, based on two different variable sets from either EuroSCORE I (LogES) or EuroScore II (ES II), partitioned by the time of score adoption (1996–2016 or 2012–2016) and evaluated on holdout set (2017–2019). These base learner models were ensembled using nine different combinations of six ML algorithms to produce homogeneous or heterogeneous ensembles. Performance was assessed using a consensus metric. RESULTS: Xgboost homogenous ensemble (HE) was the highest performing model (clinical effectiveness metric (CEM) 0.725) with area under the curve (AUC) (0.8327; 95% confidence interval (CI) 0.8323–0.8329) followed by Random Forest HE (CEM 0.723; AUC 0.8325; 95%CI 0.8320–0.8326). Across different heterogenous ensembles, significantly better performance was obtained by combining siloed datasets across time (CEM 0.720) than building ensembles of either 1996–2011 (t-test adjusted, p = 1.67×10(−6)) or 2012–2019 (t-test adjusted, p = 1.35×10(−193)) datasets alone. CONCLUSIONS: Both homogenous and heterogenous ML ensembles performed significantly better than DMA ensemble of Bayesian Update models. Time-dependent ensemble combination of variables, having differing qualities according to time of score adoption, enabled previously siloed data to be combined, leading to increased power, clinical interpretability of variables and usage of data. SAGE Publications 2023-07-20 /pmc/articles/PMC10363892/ /pubmed/37492033 http://dx.doi.org/10.1177/20552076231187605 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by-nc/4.0/This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle	Original Research Dong, Tim Sinha, Shubhra Zhai, Ben Fudulu, Daniel P Chan, Jeremy Narayan, Pradeep Judge, Andy Caputo, Massimo Dimagli, Arnaldo Benedetto, Umberto Angelini, Gianni D Cardiac surgery risk prediction using ensemble machine learning to incorporate legacy risk scores: A benchmarking study
title	Cardiac surgery risk prediction using ensemble machine learning to incorporate legacy risk scores: A benchmarking study
title_full	Cardiac surgery risk prediction using ensemble machine learning to incorporate legacy risk scores: A benchmarking study
title_fullStr	Cardiac surgery risk prediction using ensemble machine learning to incorporate legacy risk scores: A benchmarking study
title_full_unstemmed	Cardiac surgery risk prediction using ensemble machine learning to incorporate legacy risk scores: A benchmarking study
title_short	Cardiac surgery risk prediction using ensemble machine learning to incorporate legacy risk scores: A benchmarking study
title_sort	cardiac surgery risk prediction using ensemble machine learning to incorporate legacy risk scores: a benchmarking study
topic	Original Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10363892/ https://www.ncbi.nlm.nih.gov/pubmed/37492033 http://dx.doi.org/10.1177/20552076231187605
work_keys_str_mv	AT dongtim cardiacsurgeryriskpredictionusingensemblemachinelearningtoincorporatelegacyriskscoresabenchmarkingstudy AT sinhashubhra cardiacsurgeryriskpredictionusingensemblemachinelearningtoincorporatelegacyriskscoresabenchmarkingstudy AT zhaiben cardiacsurgeryriskpredictionusingensemblemachinelearningtoincorporatelegacyriskscoresabenchmarkingstudy AT fuduludanielp cardiacsurgeryriskpredictionusingensemblemachinelearningtoincorporatelegacyriskscoresabenchmarkingstudy AT chanjeremy cardiacsurgeryriskpredictionusingensemblemachinelearningtoincorporatelegacyriskscoresabenchmarkingstudy AT narayanpradeep cardiacsurgeryriskpredictionusingensemblemachinelearningtoincorporatelegacyriskscoresabenchmarkingstudy AT judgeandy cardiacsurgeryriskpredictionusingensemblemachinelearningtoincorporatelegacyriskscoresabenchmarkingstudy AT caputomassimo cardiacsurgeryriskpredictionusingensemblemachinelearningtoincorporatelegacyriskscoresabenchmarkingstudy AT dimagliarnaldo cardiacsurgeryriskpredictionusingensemblemachinelearningtoincorporatelegacyriskscoresabenchmarkingstudy AT benedettoumberto cardiacsurgeryriskpredictionusingensemblemachinelearningtoincorporatelegacyriskscoresabenchmarkingstudy AT angelinigiannid cardiacsurgeryriskpredictionusingensemblemachinelearningtoincorporatelegacyriskscoresabenchmarkingstudy

Cardiac surgery risk prediction using ensemble machine learning to incorporate legacy risk scores: A benchmarking study

Ejemplares similares