Cargando…
Evaluation of stacked ensemble model performance to predict clinical outcomes: A COVID-19 study
BACKGROUND: The application of machine learning (ML) to analyze clinical data with the goal to predict patient outcomes has garnered increasing attention. Ensemble learning has been used in conjunction with ML to improve predictive performance. Although stacked generalization (stacking), a type of h...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier B.V.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10165871/ https://www.ncbi.nlm.nih.gov/pubmed/37172507 http://dx.doi.org/10.1016/j.ijmedinf.2023.105090 |
_version_ | 1785038332132065280 |
---|---|
author | Kablan, Rianne Miller, Hunter A. Suliman, Sally Frieboes, Hermann B. |
author_facet | Kablan, Rianne Miller, Hunter A. Suliman, Sally Frieboes, Hermann B. |
author_sort | Kablan, Rianne |
collection | PubMed |
description | BACKGROUND: The application of machine learning (ML) to analyze clinical data with the goal to predict patient outcomes has garnered increasing attention. Ensemble learning has been used in conjunction with ML to improve predictive performance. Although stacked generalization (stacking), a type of heterogeneous ensemble of ML models, has emerged in clinical data analysis, it remains unclear how to define the best model combinations for strong predictive performance. This study develops a methodology to evaluate the performance of “base” learner models and their optimized combination using “meta” learner models in stacked ensembles to accurately assess performance in the context of clinical outcomes. METHODS: De-identified COVID-19 data was obtained from the University of Louisville Hospital, where a retrospective chart review was performed from March 2020 to November 2021. Three differently-sized subsets using features from the overall dataset were chosen to train and evaluate ensemble classification performance. The number of base learners chosen from several algorithm families coupled with a complementary meta learner was varied from a minimum of 2 to a maximum of 8. Predictive performance of these combinations was evaluated in terms of mortality and severe cardiac event outcomes using area-under-the-receiver-operating-characteristic (AUROC), F1, balanced accuracy, and kappa. RESULTS: The results highlight the potential to accurately predict clinical outcomes, such as severe cardiac events with COVID-19, from routinely acquired in-hospital patient data. Meta learners Generalized Linear Model (GLM), Multi-Layer Perceptron (MLP), and Partial Least Squares (PLS) had the highest AUROC for both outcomes, while K-Nearest Neighbors (KNN) had the lowest. Performance trended lower in the training set as the number of features increased, and exhibited less variance in both training and validation across all feature subsets as the number of base learners increased. CONCLUSION: This study offers a methodology to robustly evaluate ensemble ML performance when analyzing clinical data. |
format | Online Article Text |
id | pubmed-10165871 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Elsevier B.V. |
record_format | MEDLINE/PubMed |
spelling | pubmed-101658712023-05-09 Evaluation of stacked ensemble model performance to predict clinical outcomes: A COVID-19 study Kablan, Rianne Miller, Hunter A. Suliman, Sally Frieboes, Hermann B. Int J Med Inform Article BACKGROUND: The application of machine learning (ML) to analyze clinical data with the goal to predict patient outcomes has garnered increasing attention. Ensemble learning has been used in conjunction with ML to improve predictive performance. Although stacked generalization (stacking), a type of heterogeneous ensemble of ML models, has emerged in clinical data analysis, it remains unclear how to define the best model combinations for strong predictive performance. This study develops a methodology to evaluate the performance of “base” learner models and their optimized combination using “meta” learner models in stacked ensembles to accurately assess performance in the context of clinical outcomes. METHODS: De-identified COVID-19 data was obtained from the University of Louisville Hospital, where a retrospective chart review was performed from March 2020 to November 2021. Three differently-sized subsets using features from the overall dataset were chosen to train and evaluate ensemble classification performance. The number of base learners chosen from several algorithm families coupled with a complementary meta learner was varied from a minimum of 2 to a maximum of 8. Predictive performance of these combinations was evaluated in terms of mortality and severe cardiac event outcomes using area-under-the-receiver-operating-characteristic (AUROC), F1, balanced accuracy, and kappa. RESULTS: The results highlight the potential to accurately predict clinical outcomes, such as severe cardiac events with COVID-19, from routinely acquired in-hospital patient data. Meta learners Generalized Linear Model (GLM), Multi-Layer Perceptron (MLP), and Partial Least Squares (PLS) had the highest AUROC for both outcomes, while K-Nearest Neighbors (KNN) had the lowest. Performance trended lower in the training set as the number of features increased, and exhibited less variance in both training and validation across all feature subsets as the number of base learners increased. CONCLUSION: This study offers a methodology to robustly evaluate ensemble ML performance when analyzing clinical data. Elsevier B.V. 2023-07 2023-05-08 /pmc/articles/PMC10165871/ /pubmed/37172507 http://dx.doi.org/10.1016/j.ijmedinf.2023.105090 Text en © 2023 Elsevier B.V. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Article Kablan, Rianne Miller, Hunter A. Suliman, Sally Frieboes, Hermann B. Evaluation of stacked ensemble model performance to predict clinical outcomes: A COVID-19 study |
title | Evaluation of stacked ensemble model performance to predict clinical outcomes: A COVID-19 study |
title_full | Evaluation of stacked ensemble model performance to predict clinical outcomes: A COVID-19 study |
title_fullStr | Evaluation of stacked ensemble model performance to predict clinical outcomes: A COVID-19 study |
title_full_unstemmed | Evaluation of stacked ensemble model performance to predict clinical outcomes: A COVID-19 study |
title_short | Evaluation of stacked ensemble model performance to predict clinical outcomes: A COVID-19 study |
title_sort | evaluation of stacked ensemble model performance to predict clinical outcomes: a covid-19 study |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10165871/ https://www.ncbi.nlm.nih.gov/pubmed/37172507 http://dx.doi.org/10.1016/j.ijmedinf.2023.105090 |
work_keys_str_mv | AT kablanrianne evaluationofstackedensemblemodelperformancetopredictclinicaloutcomesacovid19study AT millerhuntera evaluationofstackedensemblemodelperformancetopredictclinicaloutcomesacovid19study AT sulimansally evaluationofstackedensemblemodelperformancetopredictclinicaloutcomesacovid19study AT frieboeshermannb evaluationofstackedensemblemodelperformancetopredictclinicaloutcomesacovid19study |