Cargando…

Evaluation of stacked ensemble model performance to predict clinical outcomes: A COVID-19 study

BACKGROUND: The application of machine learning (ML) to analyze clinical data with the goal to predict patient outcomes has garnered increasing attention. Ensemble learning has been used in conjunction with ML to improve predictive performance. Although stacked generalization (stacking), a type of h...

Descripción completa

Detalles Bibliográficos
Autores principales: Kablan, Rianne, Miller, Hunter A., Suliman, Sally, Frieboes, Hermann B.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier B.V. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10165871/
https://www.ncbi.nlm.nih.gov/pubmed/37172507
http://dx.doi.org/10.1016/j.ijmedinf.2023.105090
_version_ 1785038332132065280
author Kablan, Rianne
Miller, Hunter A.
Suliman, Sally
Frieboes, Hermann B.
author_facet Kablan, Rianne
Miller, Hunter A.
Suliman, Sally
Frieboes, Hermann B.
author_sort Kablan, Rianne
collection PubMed
description BACKGROUND: The application of machine learning (ML) to analyze clinical data with the goal to predict patient outcomes has garnered increasing attention. Ensemble learning has been used in conjunction with ML to improve predictive performance. Although stacked generalization (stacking), a type of heterogeneous ensemble of ML models, has emerged in clinical data analysis, it remains unclear how to define the best model combinations for strong predictive performance. This study develops a methodology to evaluate the performance of “base” learner models and their optimized combination using “meta” learner models in stacked ensembles to accurately assess performance in the context of clinical outcomes. METHODS: De-identified COVID-19 data was obtained from the University of Louisville Hospital, where a retrospective chart review was performed from March 2020 to November 2021. Three differently-sized subsets using features from the overall dataset were chosen to train and evaluate ensemble classification performance. The number of base learners chosen from several algorithm families coupled with a complementary meta learner was varied from a minimum of 2 to a maximum of 8. Predictive performance of these combinations was evaluated in terms of mortality and severe cardiac event outcomes using area-under-the-receiver-operating-characteristic (AUROC), F1, balanced accuracy, and kappa. RESULTS: The results highlight the potential to accurately predict clinical outcomes, such as severe cardiac events with COVID-19, from routinely acquired in-hospital patient data. Meta learners Generalized Linear Model (GLM), Multi-Layer Perceptron (MLP), and Partial Least Squares (PLS) had the highest AUROC for both outcomes, while K-Nearest Neighbors (KNN) had the lowest. Performance trended lower in the training set as the number of features increased, and exhibited less variance in both training and validation across all feature subsets as the number of base learners increased. CONCLUSION: This study offers a methodology to robustly evaluate ensemble ML performance when analyzing clinical data.
format Online
Article
Text
id pubmed-10165871
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier B.V.
record_format MEDLINE/PubMed
spelling pubmed-101658712023-05-09 Evaluation of stacked ensemble model performance to predict clinical outcomes: A COVID-19 study Kablan, Rianne Miller, Hunter A. Suliman, Sally Frieboes, Hermann B. Int J Med Inform Article BACKGROUND: The application of machine learning (ML) to analyze clinical data with the goal to predict patient outcomes has garnered increasing attention. Ensemble learning has been used in conjunction with ML to improve predictive performance. Although stacked generalization (stacking), a type of heterogeneous ensemble of ML models, has emerged in clinical data analysis, it remains unclear how to define the best model combinations for strong predictive performance. This study develops a methodology to evaluate the performance of “base” learner models and their optimized combination using “meta” learner models in stacked ensembles to accurately assess performance in the context of clinical outcomes. METHODS: De-identified COVID-19 data was obtained from the University of Louisville Hospital, where a retrospective chart review was performed from March 2020 to November 2021. Three differently-sized subsets using features from the overall dataset were chosen to train and evaluate ensemble classification performance. The number of base learners chosen from several algorithm families coupled with a complementary meta learner was varied from a minimum of 2 to a maximum of 8. Predictive performance of these combinations was evaluated in terms of mortality and severe cardiac event outcomes using area-under-the-receiver-operating-characteristic (AUROC), F1, balanced accuracy, and kappa. RESULTS: The results highlight the potential to accurately predict clinical outcomes, such as severe cardiac events with COVID-19, from routinely acquired in-hospital patient data. Meta learners Generalized Linear Model (GLM), Multi-Layer Perceptron (MLP), and Partial Least Squares (PLS) had the highest AUROC for both outcomes, while K-Nearest Neighbors (KNN) had the lowest. Performance trended lower in the training set as the number of features increased, and exhibited less variance in both training and validation across all feature subsets as the number of base learners increased. CONCLUSION: This study offers a methodology to robustly evaluate ensemble ML performance when analyzing clinical data. Elsevier B.V. 2023-07 2023-05-08 /pmc/articles/PMC10165871/ /pubmed/37172507 http://dx.doi.org/10.1016/j.ijmedinf.2023.105090 Text en © 2023 Elsevier B.V. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
Kablan, Rianne
Miller, Hunter A.
Suliman, Sally
Frieboes, Hermann B.
Evaluation of stacked ensemble model performance to predict clinical outcomes: A COVID-19 study
title Evaluation of stacked ensemble model performance to predict clinical outcomes: A COVID-19 study
title_full Evaluation of stacked ensemble model performance to predict clinical outcomes: A COVID-19 study
title_fullStr Evaluation of stacked ensemble model performance to predict clinical outcomes: A COVID-19 study
title_full_unstemmed Evaluation of stacked ensemble model performance to predict clinical outcomes: A COVID-19 study
title_short Evaluation of stacked ensemble model performance to predict clinical outcomes: A COVID-19 study
title_sort evaluation of stacked ensemble model performance to predict clinical outcomes: a covid-19 study
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10165871/
https://www.ncbi.nlm.nih.gov/pubmed/37172507
http://dx.doi.org/10.1016/j.ijmedinf.2023.105090
work_keys_str_mv AT kablanrianne evaluationofstackedensemblemodelperformancetopredictclinicaloutcomesacovid19study
AT millerhuntera evaluationofstackedensemblemodelperformancetopredictclinicaloutcomesacovid19study
AT sulimansally evaluationofstackedensemblemodelperformancetopredictclinicaloutcomesacovid19study
AT frieboeshermannb evaluationofstackedensemblemodelperformancetopredictclinicaloutcomesacovid19study