Cargando…

The advanced machine learner XGBoost did not reduce prehospital trauma mistriage compared with logistic regression: a simulation study

BACKGROUND: Accurate prehospital trauma triage is crucial for identifying critically injured patients and determining the level of care. In the prehospital setting, time and data are often scarce, limiting the complexity of triage models. The aim of this study was to assess whether, compared with lo...

Descripción completa

Detalles Bibliográficos
Autores principales:	Larsson, Anna, Berg, Johanna, Gellerfors, Mikael, Gerdin Wärnberg, Martin
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2021
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8215793/ https://www.ncbi.nlm.nih.gov/pubmed/34148560 http://dx.doi.org/10.1186/s12911-021-01558-y

_version_	1783710307715121152
author	Larsson, Anna Berg, Johanna Gellerfors, Mikael Gerdin Wärnberg, Martin
author_facet	Larsson, Anna Berg, Johanna Gellerfors, Mikael Gerdin Wärnberg, Martin
author_sort	Larsson, Anna
collection	PubMed
description	BACKGROUND: Accurate prehospital trauma triage is crucial for identifying critically injured patients and determining the level of care. In the prehospital setting, time and data are often scarce, limiting the complexity of triage models. The aim of this study was to assess whether, compared with logistic regression, the advanced machine learner XGBoost (eXtreme Gradient Boosting) is associated with reduced prehospital trauma mistriage. METHODS: We conducted a simulation study based on data from the US National Trauma Data Bank (NTDB) and the Swedish Trauma Registry (SweTrau). We used categorized systolic blood pressure, respiratory rate, Glasgow Coma Scale and age as our predictors. The outcome was the difference in under- and overtriage rates between the models for different training dataset sizes. RESULTS: We used data from 813,567 patients in the NTDB and 30,577 patients in SweTrau. In SweTrau, the smallest training set of 10 events per free parameter was sufficient for model development. XGBoost achieved undertriage rates in the range of 0.314–0.324 with corresponding overtriage rates of 0.319–0.322. Logistic regression achieved undertriage rates ranging from 0.312 to 0.321 with associated overtriage rates ranging from 0.321 to 0.323. In NTDB, XGBoost required the largest training set size of 1000 events per free parameter to achieve robust results, whereas logistic regression achieved stable performance from a training set size of 25 events per free parameter. For the training set size of 1000 events per free parameter, XGBoost obtained an undertriage rate of 0.406 with an overtriage of 0.463. For logistic regression, the corresponding undertriage was 0.395 with an overtriage of 0.468. CONCLUSION: The under- and overtriage rates associated with the advanced machine learner XGBoost were similar to the rates associated with logistic regression regardless of sample size, but XGBoost required larger training sets to obtain robust results. We do not recommend using XGBoost over logistic regression in this context when predictors are few and categorical. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-021-01558-y.
format	Online Article Text
id	pubmed-8215793
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-82157932021-06-23 The advanced machine learner XGBoost did not reduce prehospital trauma mistriage compared with logistic regression: a simulation study Larsson, Anna Berg, Johanna Gellerfors, Mikael Gerdin Wärnberg, Martin BMC Med Inform Decis Mak Research BACKGROUND: Accurate prehospital trauma triage is crucial for identifying critically injured patients and determining the level of care. In the prehospital setting, time and data are often scarce, limiting the complexity of triage models. The aim of this study was to assess whether, compared with logistic regression, the advanced machine learner XGBoost (eXtreme Gradient Boosting) is associated with reduced prehospital trauma mistriage. METHODS: We conducted a simulation study based on data from the US National Trauma Data Bank (NTDB) and the Swedish Trauma Registry (SweTrau). We used categorized systolic blood pressure, respiratory rate, Glasgow Coma Scale and age as our predictors. The outcome was the difference in under- and overtriage rates between the models for different training dataset sizes. RESULTS: We used data from 813,567 patients in the NTDB and 30,577 patients in SweTrau. In SweTrau, the smallest training set of 10 events per free parameter was sufficient for model development. XGBoost achieved undertriage rates in the range of 0.314–0.324 with corresponding overtriage rates of 0.319–0.322. Logistic regression achieved undertriage rates ranging from 0.312 to 0.321 with associated overtriage rates ranging from 0.321 to 0.323. In NTDB, XGBoost required the largest training set size of 1000 events per free parameter to achieve robust results, whereas logistic regression achieved stable performance from a training set size of 25 events per free parameter. For the training set size of 1000 events per free parameter, XGBoost obtained an undertriage rate of 0.406 with an overtriage of 0.463. For logistic regression, the corresponding undertriage was 0.395 with an overtriage of 0.468. CONCLUSION: The under- and overtriage rates associated with the advanced machine learner XGBoost were similar to the rates associated with logistic regression regardless of sample size, but XGBoost required larger training sets to obtain robust results. We do not recommend using XGBoost over logistic regression in this context when predictors are few and categorical. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-021-01558-y. BioMed Central 2021-06-21 /pmc/articles/PMC8215793/ /pubmed/34148560 http://dx.doi.org/10.1186/s12911-021-01558-y Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle	Research Larsson, Anna Berg, Johanna Gellerfors, Mikael Gerdin Wärnberg, Martin The advanced machine learner XGBoost did not reduce prehospital trauma mistriage compared with logistic regression: a simulation study
title	The advanced machine learner XGBoost did not reduce prehospital trauma mistriage compared with logistic regression: a simulation study
title_full	The advanced machine learner XGBoost did not reduce prehospital trauma mistriage compared with logistic regression: a simulation study
title_fullStr	The advanced machine learner XGBoost did not reduce prehospital trauma mistriage compared with logistic regression: a simulation study
title_full_unstemmed	The advanced machine learner XGBoost did not reduce prehospital trauma mistriage compared with logistic regression: a simulation study
title_short	The advanced machine learner XGBoost did not reduce prehospital trauma mistriage compared with logistic regression: a simulation study
title_sort	advanced machine learner xgboost did not reduce prehospital trauma mistriage compared with logistic regression: a simulation study
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8215793/ https://www.ncbi.nlm.nih.gov/pubmed/34148560 http://dx.doi.org/10.1186/s12911-021-01558-y
work_keys_str_mv	AT larssonanna theadvancedmachinelearnerxgboostdidnotreduceprehospitaltraumamistriagecomparedwithlogisticregressionasimulationstudy AT bergjohanna theadvancedmachinelearnerxgboostdidnotreduceprehospitaltraumamistriagecomparedwithlogisticregressionasimulationstudy AT gellerforsmikael theadvancedmachinelearnerxgboostdidnotreduceprehospitaltraumamistriagecomparedwithlogisticregressionasimulationstudy AT gerdinwarnbergmartin theadvancedmachinelearnerxgboostdidnotreduceprehospitaltraumamistriagecomparedwithlogisticregressionasimulationstudy AT larssonanna advancedmachinelearnerxgboostdidnotreduceprehospitaltraumamistriagecomparedwithlogisticregressionasimulationstudy AT bergjohanna advancedmachinelearnerxgboostdidnotreduceprehospitaltraumamistriagecomparedwithlogisticregressionasimulationstudy AT gellerforsmikael advancedmachinelearnerxgboostdidnotreduceprehospitaltraumamistriagecomparedwithlogisticregressionasimulationstudy AT gerdinwarnbergmartin advancedmachinelearnerxgboostdidnotreduceprehospitaltraumamistriagecomparedwithlogisticregressionasimulationstudy

The advanced machine learner XGBoost did not reduce prehospital trauma mistriage compared with logistic regression: a simulation study

Ejemplares similares