Cargando…

Comparison of machine learning methods with logistic regression analysis in creating predictive models for risk of critical in-hospital events in COVID-19 patients on hospital admission

BACKGROUND: Machine learning (ML) algorithms have been trained to early predict critical in-hospital events from COVID-19 using patient data at admission, but little is known on how their performance compares with each other and/or with statistical logistic regression (LR). This prospective multicen...

Descripción completa

Detalles Bibliográficos
Autores principales:	Sievering, Aaron W., Wohlmuth, Peter, Geßler, Nele, Gunawardene, Melanie A., Herrlinger, Klaus, Bein, Berthold, Arnold, Dirk, Bergmann, Martin, Nowak, Lorenz, Gloeckner, Christian, Koch, Ina, Bachmann, Martin, Herborn, Christoph U., Stang, Axel
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2022
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9702742/ https://www.ncbi.nlm.nih.gov/pubmed/36437469 http://dx.doi.org/10.1186/s12911-022-02057-4

_version_	1784839718662307840
author	Sievering, Aaron W. Wohlmuth, Peter Geßler, Nele Gunawardene, Melanie A. Herrlinger, Klaus Bein, Berthold Arnold, Dirk Bergmann, Martin Nowak, Lorenz Gloeckner, Christian Koch, Ina Bachmann, Martin Herborn, Christoph U. Stang, Axel
author_facet	Sievering, Aaron W. Wohlmuth, Peter Geßler, Nele Gunawardene, Melanie A. Herrlinger, Klaus Bein, Berthold Arnold, Dirk Bergmann, Martin Nowak, Lorenz Gloeckner, Christian Koch, Ina Bachmann, Martin Herborn, Christoph U. Stang, Axel
author_sort	Sievering, Aaron W.
collection	PubMed
description	BACKGROUND: Machine learning (ML) algorithms have been trained to early predict critical in-hospital events from COVID-19 using patient data at admission, but little is known on how their performance compares with each other and/or with statistical logistic regression (LR). This prospective multicentre cohort study compares the performance of a LR and five ML models on the contribution of influencing predictors and predictor-to-event relationships on prediction model´s performance. METHODS: We used 25 baseline variables of 490 COVID-19 patients admitted to 8 hospitals in Germany (March–November 2020) to develop and validate (75/25 random-split) 3 linear (L1 and L2 penalty, elastic net [EN]) and 2 non-linear (support vector machine [SVM] with radial kernel, random forest [RF]) ML approaches for predicting critical events defined by intensive care unit transfer, invasive ventilation and/or death (composite end-point: 181 patients). Models were compared for performance (area-under-the-receiver-operating characteristic-curve [AUC], Brier score) and predictor importance (performance-loss metrics, partial-dependence profiles). RESULTS: Models performed close with a small benefit for LR (utilizing restricted cubic splines for non-linearity) and RF (AUC means: 0.763–0.731 [RF–L1]); Brier scores: 0.184–0.197 [LR–L1]). Top ranked predictor variables (consistently highest importance: C-reactive protein) were largely identical across models, except creatinine, which exhibited marginal (L1, L2, EN, SVM) or high/non-linear effects (LR, RF) on events. CONCLUSIONS: Although the LR and ML models analysed showed no strong differences in performance and the most influencing predictors for COVID-19-related event prediction, our results indicate a predictive benefit from taking account for non-linear predictor-to-event relationships and effects. Future efforts should focus on leveraging data-driven ML technologies from static towards dynamic modelling solutions that continuously learn and adapt to changes in data environments during the evolving pandemic. Trial registration number: NCT04659187. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-022-02057-4.
format	Online Article Text
id	pubmed-9702742
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-97027422022-11-28 Comparison of machine learning methods with logistic regression analysis in creating predictive models for risk of critical in-hospital events in COVID-19 patients on hospital admission Sievering, Aaron W. Wohlmuth, Peter Geßler, Nele Gunawardene, Melanie A. Herrlinger, Klaus Bein, Berthold Arnold, Dirk Bergmann, Martin Nowak, Lorenz Gloeckner, Christian Koch, Ina Bachmann, Martin Herborn, Christoph U. Stang, Axel BMC Med Inform Decis Mak Research BACKGROUND: Machine learning (ML) algorithms have been trained to early predict critical in-hospital events from COVID-19 using patient data at admission, but little is known on how their performance compares with each other and/or with statistical logistic regression (LR). This prospective multicentre cohort study compares the performance of a LR and five ML models on the contribution of influencing predictors and predictor-to-event relationships on prediction model´s performance. METHODS: We used 25 baseline variables of 490 COVID-19 patients admitted to 8 hospitals in Germany (March–November 2020) to develop and validate (75/25 random-split) 3 linear (L1 and L2 penalty, elastic net [EN]) and 2 non-linear (support vector machine [SVM] with radial kernel, random forest [RF]) ML approaches for predicting critical events defined by intensive care unit transfer, invasive ventilation and/or death (composite end-point: 181 patients). Models were compared for performance (area-under-the-receiver-operating characteristic-curve [AUC], Brier score) and predictor importance (performance-loss metrics, partial-dependence profiles). RESULTS: Models performed close with a small benefit for LR (utilizing restricted cubic splines for non-linearity) and RF (AUC means: 0.763–0.731 [RF–L1]); Brier scores: 0.184–0.197 [LR–L1]). Top ranked predictor variables (consistently highest importance: C-reactive protein) were largely identical across models, except creatinine, which exhibited marginal (L1, L2, EN, SVM) or high/non-linear effects (LR, RF) on events. CONCLUSIONS: Although the LR and ML models analysed showed no strong differences in performance and the most influencing predictors for COVID-19-related event prediction, our results indicate a predictive benefit from taking account for non-linear predictor-to-event relationships and effects. Future efforts should focus on leveraging data-driven ML technologies from static towards dynamic modelling solutions that continuously learn and adapt to changes in data environments during the evolving pandemic. Trial registration number: NCT04659187. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-022-02057-4. BioMed Central 2022-11-28 /pmc/articles/PMC9702742/ /pubmed/36437469 http://dx.doi.org/10.1186/s12911-022-02057-4 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle	Research Sievering, Aaron W. Wohlmuth, Peter Geßler, Nele Gunawardene, Melanie A. Herrlinger, Klaus Bein, Berthold Arnold, Dirk Bergmann, Martin Nowak, Lorenz Gloeckner, Christian Koch, Ina Bachmann, Martin Herborn, Christoph U. Stang, Axel Comparison of machine learning methods with logistic regression analysis in creating predictive models for risk of critical in-hospital events in COVID-19 patients on hospital admission
title	Comparison of machine learning methods with logistic regression analysis in creating predictive models for risk of critical in-hospital events in COVID-19 patients on hospital admission
title_full	Comparison of machine learning methods with logistic regression analysis in creating predictive models for risk of critical in-hospital events in COVID-19 patients on hospital admission
title_fullStr	Comparison of machine learning methods with logistic regression analysis in creating predictive models for risk of critical in-hospital events in COVID-19 patients on hospital admission
title_full_unstemmed	Comparison of machine learning methods with logistic regression analysis in creating predictive models for risk of critical in-hospital events in COVID-19 patients on hospital admission
title_short	Comparison of machine learning methods with logistic regression analysis in creating predictive models for risk of critical in-hospital events in COVID-19 patients on hospital admission
title_sort	comparison of machine learning methods with logistic regression analysis in creating predictive models for risk of critical in-hospital events in covid-19 patients on hospital admission
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9702742/ https://www.ncbi.nlm.nih.gov/pubmed/36437469 http://dx.doi.org/10.1186/s12911-022-02057-4
work_keys_str_mv	AT sieveringaaronw comparisonofmachinelearningmethodswithlogisticregressionanalysisincreatingpredictivemodelsforriskofcriticalinhospitaleventsincovid19patientsonhospitaladmission AT wohlmuthpeter comparisonofmachinelearningmethodswithlogisticregressionanalysisincreatingpredictivemodelsforriskofcriticalinhospitaleventsincovid19patientsonhospitaladmission AT geßlernele comparisonofmachinelearningmethodswithlogisticregressionanalysisincreatingpredictivemodelsforriskofcriticalinhospitaleventsincovid19patientsonhospitaladmission AT gunawardenemelaniea comparisonofmachinelearningmethodswithlogisticregressionanalysisincreatingpredictivemodelsforriskofcriticalinhospitaleventsincovid19patientsonhospitaladmission AT herrlingerklaus comparisonofmachinelearningmethodswithlogisticregressionanalysisincreatingpredictivemodelsforriskofcriticalinhospitaleventsincovid19patientsonhospitaladmission AT beinberthold comparisonofmachinelearningmethodswithlogisticregressionanalysisincreatingpredictivemodelsforriskofcriticalinhospitaleventsincovid19patientsonhospitaladmission AT arnolddirk comparisonofmachinelearningmethodswithlogisticregressionanalysisincreatingpredictivemodelsforriskofcriticalinhospitaleventsincovid19patientsonhospitaladmission AT bergmannmartin comparisonofmachinelearningmethodswithlogisticregressionanalysisincreatingpredictivemodelsforriskofcriticalinhospitaleventsincovid19patientsonhospitaladmission AT nowaklorenz comparisonofmachinelearningmethodswithlogisticregressionanalysisincreatingpredictivemodelsforriskofcriticalinhospitaleventsincovid19patientsonhospitaladmission AT gloecknerchristian comparisonofmachinelearningmethodswithlogisticregressionanalysisincreatingpredictivemodelsforriskofcriticalinhospitaleventsincovid19patientsonhospitaladmission AT kochina comparisonofmachinelearningmethodswithlogisticregressionanalysisincreatingpredictivemodelsforriskofcriticalinhospitaleventsincovid19patientsonhospitaladmission AT bachmannmartin comparisonofmachinelearningmethodswithlogisticregressionanalysisincreatingpredictivemodelsforriskofcriticalinhospitaleventsincovid19patientsonhospitaladmission AT herbornchristophu comparisonofmachinelearningmethodswithlogisticregressionanalysisincreatingpredictivemodelsforriskofcriticalinhospitaleventsincovid19patientsonhospitaladmission AT stangaxel comparisonofmachinelearningmethodswithlogisticregressionanalysisincreatingpredictivemodelsforriskofcriticalinhospitaleventsincovid19patientsonhospitaladmission

Comparison of machine learning methods with logistic regression analysis in creating predictive models for risk of critical in-hospital events in COVID-19 patients on hospital admission

Ejemplares similares