Cargando…

Using machine learning methods to predict in-hospital mortality of sepsis patients in the ICU

BACKGROUND: Early and accurate identification of sepsis patients with high risk of in-hospital death can help physicians in intensive care units (ICUs) make optimal clinical decisions. This study aimed to develop machine learning-based tools to predict the risk of hospital death of patients with sep...

Descripción completa

Detalles Bibliográficos
Autores principales: Kong, Guilan, Lin, Ke, Hu, Yonghua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7531110/
https://www.ncbi.nlm.nih.gov/pubmed/33008381
http://dx.doi.org/10.1186/s12911-020-01271-2
_version_ 1783589699534716928
author Kong, Guilan
Lin, Ke
Hu, Yonghua
author_facet Kong, Guilan
Lin, Ke
Hu, Yonghua
author_sort Kong, Guilan
collection PubMed
description BACKGROUND: Early and accurate identification of sepsis patients with high risk of in-hospital death can help physicians in intensive care units (ICUs) make optimal clinical decisions. This study aimed to develop machine learning-based tools to predict the risk of hospital death of patients with sepsis in ICUs. METHODS: The source database used for model development and validation is the medical information mart for intensive care (MIMIC) III. We identified adult sepsis patients using the new sepsis definition Sepsis-3. A total of 86 predictor variables consisting of demographics, laboratory tests and comorbidities were used. We employed the least absolute shrinkage and selection operator (LASSO), random forest (RF), gradient boosting machine (GBM) and the traditional logistic regression (LR) method to develop prediction models. In addition, the prediction performance of the four developed models was evaluated and compared with that of an existent scoring tool – simplified acute physiology score (SAPS) II – using five different performance measures: the area under the receiver operating characteristic curve (AUROC), Brier score, sensitivity, specificity and calibration plot. RESULTS: The records of 16,688 sepsis patients in MIMIC III were used for model training and test. Amongst them, 2949 (17.7%) patients had in-hospital death. The average AUROCs of the LASSO, RF, GBM, LR and SAPS II models were 0.829, 0.829, 0.845, 0.833 and 0.77, respectively. The Brier scores of the LASSO, RF, GBM, LR and SAPS II models were 0.108, 0.109, 0.104, 0.107 and 0.146, respectively. The calibration plots showed that the GBM, LASSO and LR models had good calibration; the RF model underestimated high-risk patients; and SAPS II had the poorest calibration. CONCLUSION: The machine learning-based models developed in this study had good prediction performance. Amongst them, the GBM model showed the best performance in predicting the risk of in-hospital death. It has the potential to assist physicians in the ICU to perform appropriate clinical interventions for critically ill sepsis patients and thus may help improve the prognoses of sepsis patients in the ICU.
format Online
Article
Text
id pubmed-7531110
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-75311102020-10-05 Using machine learning methods to predict in-hospital mortality of sepsis patients in the ICU Kong, Guilan Lin, Ke Hu, Yonghua BMC Med Inform Decis Mak Research Article BACKGROUND: Early and accurate identification of sepsis patients with high risk of in-hospital death can help physicians in intensive care units (ICUs) make optimal clinical decisions. This study aimed to develop machine learning-based tools to predict the risk of hospital death of patients with sepsis in ICUs. METHODS: The source database used for model development and validation is the medical information mart for intensive care (MIMIC) III. We identified adult sepsis patients using the new sepsis definition Sepsis-3. A total of 86 predictor variables consisting of demographics, laboratory tests and comorbidities were used. We employed the least absolute shrinkage and selection operator (LASSO), random forest (RF), gradient boosting machine (GBM) and the traditional logistic regression (LR) method to develop prediction models. In addition, the prediction performance of the four developed models was evaluated and compared with that of an existent scoring tool – simplified acute physiology score (SAPS) II – using five different performance measures: the area under the receiver operating characteristic curve (AUROC), Brier score, sensitivity, specificity and calibration plot. RESULTS: The records of 16,688 sepsis patients in MIMIC III were used for model training and test. Amongst them, 2949 (17.7%) patients had in-hospital death. The average AUROCs of the LASSO, RF, GBM, LR and SAPS II models were 0.829, 0.829, 0.845, 0.833 and 0.77, respectively. The Brier scores of the LASSO, RF, GBM, LR and SAPS II models were 0.108, 0.109, 0.104, 0.107 and 0.146, respectively. The calibration plots showed that the GBM, LASSO and LR models had good calibration; the RF model underestimated high-risk patients; and SAPS II had the poorest calibration. CONCLUSION: The machine learning-based models developed in this study had good prediction performance. Amongst them, the GBM model showed the best performance in predicting the risk of in-hospital death. It has the potential to assist physicians in the ICU to perform appropriate clinical interventions for critically ill sepsis patients and thus may help improve the prognoses of sepsis patients in the ICU. BioMed Central 2020-10-02 /pmc/articles/PMC7531110/ /pubmed/33008381 http://dx.doi.org/10.1186/s12911-020-01271-2 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Kong, Guilan
Lin, Ke
Hu, Yonghua
Using machine learning methods to predict in-hospital mortality of sepsis patients in the ICU
title Using machine learning methods to predict in-hospital mortality of sepsis patients in the ICU
title_full Using machine learning methods to predict in-hospital mortality of sepsis patients in the ICU
title_fullStr Using machine learning methods to predict in-hospital mortality of sepsis patients in the ICU
title_full_unstemmed Using machine learning methods to predict in-hospital mortality of sepsis patients in the ICU
title_short Using machine learning methods to predict in-hospital mortality of sepsis patients in the ICU
title_sort using machine learning methods to predict in-hospital mortality of sepsis patients in the icu
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7531110/
https://www.ncbi.nlm.nih.gov/pubmed/33008381
http://dx.doi.org/10.1186/s12911-020-01271-2
work_keys_str_mv AT kongguilan usingmachinelearningmethodstopredictinhospitalmortalityofsepsispatientsintheicu
AT linke usingmachinelearningmethodstopredictinhospitalmortalityofsepsispatientsintheicu
AT huyonghua usingmachinelearningmethodstopredictinhospitalmortalityofsepsispatientsintheicu