Cargando…

Predicting Sepsis Mortality in a Population-Based National Database: Machine Learning Approach

BACKGROUND: Although machine learning (ML) algorithms have been applied to point-of-care sepsis prognostication, ML has not been used to predict sepsis mortality in an administrative database. Therefore, we examined the performance of common ML algorithms in predicting sepsis mortality in adult pati...

Descripción completa

Detalles Bibliográficos
Autores principales:	Park, James Yeongjun, Hsu, Tzu-Chun, Hu, Jiun-Ruey, Chen, Chun-Yuan, Hsu, Wan-Ting, Lee, Matthew, Ho, Joshua, Lee, Chien-Chang
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	JMIR Publications 2022
Materias:	Original Paper
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9047761/ https://www.ncbi.nlm.nih.gov/pubmed/35416785 http://dx.doi.org/10.2196/29982

_version_	1784695791405760512
author	Park, James Yeongjun Hsu, Tzu-Chun Hu, Jiun-Ruey Chen, Chun-Yuan Hsu, Wan-Ting Lee, Matthew Ho, Joshua Lee, Chien-Chang
author_facet	Park, James Yeongjun Hsu, Tzu-Chun Hu, Jiun-Ruey Chen, Chun-Yuan Hsu, Wan-Ting Lee, Matthew Ho, Joshua Lee, Chien-Chang
author_sort	Park, James Yeongjun
collection	PubMed
description	BACKGROUND: Although machine learning (ML) algorithms have been applied to point-of-care sepsis prognostication, ML has not been used to predict sepsis mortality in an administrative database. Therefore, we examined the performance of common ML algorithms in predicting sepsis mortality in adult patients with sepsis and compared it with that of the conventional context knowledge–based logistic regression approach. OBJECTIVE: The aim of this study is to examine the performance of common ML algorithms in predicting sepsis mortality in adult patients with sepsis and compare it with that of the conventional context knowledge–based logistic regression approach. METHODS: We examined inpatient admissions for sepsis in the US National Inpatient Sample using hospitalizations in 2010-2013 as the training data set. We developed four ML models to predict in-hospital mortality: logistic regression with least absolute shrinkage and selection operator regularization, random forest, gradient-boosted decision tree, and deep neural network. To estimate their performance, we compared our models with the Super Learner model. Using hospitalizations in 2014 as the testing data set, we examined the models’ area under the receiver operating characteristic curve (AUC), confusion matrix results, and net reclassification improvement. RESULTS: Hospitalizations of 923,759 adults were included in the analysis. Compared with the reference logistic regression (AUC: 0.786, 95% CI 0.783-0.788), all ML models showed superior discriminative ability (P<.001), including logistic regression with least absolute shrinkage and selection operator regularization (AUC: 0.878, 95% CI 0.876-0.879), random forest (AUC: 0.878, 95% CI 0.877-0.880), xgboost (AUC: 0.888, 95% CI 0.886-0.889), and neural network (AUC: 0.893, 95% CI 0.891-0.895). All 4 ML models showed higher sensitivity, specificity, positive predictive value, and negative predictive value compared with the reference logistic regression model (P<.001). We obtained similar results from the Super Learner model (AUC: 0.883, 95% CI 0.881-0.885). CONCLUSIONS: ML approaches can improve sensitivity, specificity, positive predictive value, negative predictive value, discrimination, and calibration in predicting in-hospital mortality in patients hospitalized with sepsis in the United States. These models need further validation and could be applied to develop more accurate models to compare risk-standardized mortality rates across hospitals and geographic regions, paving the way for research and policy initiatives studying disparities in sepsis care.
format	Online Article Text
id	pubmed-9047761
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	JMIR Publications
record_format	MEDLINE/PubMed
spelling	pubmed-90477612022-04-29 Predicting Sepsis Mortality in a Population-Based National Database: Machine Learning Approach Park, James Yeongjun Hsu, Tzu-Chun Hu, Jiun-Ruey Chen, Chun-Yuan Hsu, Wan-Ting Lee, Matthew Ho, Joshua Lee, Chien-Chang J Med Internet Res Original Paper BACKGROUND: Although machine learning (ML) algorithms have been applied to point-of-care sepsis prognostication, ML has not been used to predict sepsis mortality in an administrative database. Therefore, we examined the performance of common ML algorithms in predicting sepsis mortality in adult patients with sepsis and compared it with that of the conventional context knowledge–based logistic regression approach. OBJECTIVE: The aim of this study is to examine the performance of common ML algorithms in predicting sepsis mortality in adult patients with sepsis and compare it with that of the conventional context knowledge–based logistic regression approach. METHODS: We examined inpatient admissions for sepsis in the US National Inpatient Sample using hospitalizations in 2010-2013 as the training data set. We developed four ML models to predict in-hospital mortality: logistic regression with least absolute shrinkage and selection operator regularization, random forest, gradient-boosted decision tree, and deep neural network. To estimate their performance, we compared our models with the Super Learner model. Using hospitalizations in 2014 as the testing data set, we examined the models’ area under the receiver operating characteristic curve (AUC), confusion matrix results, and net reclassification improvement. RESULTS: Hospitalizations of 923,759 adults were included in the analysis. Compared with the reference logistic regression (AUC: 0.786, 95% CI 0.783-0.788), all ML models showed superior discriminative ability (P<.001), including logistic regression with least absolute shrinkage and selection operator regularization (AUC: 0.878, 95% CI 0.876-0.879), random forest (AUC: 0.878, 95% CI 0.877-0.880), xgboost (AUC: 0.888, 95% CI 0.886-0.889), and neural network (AUC: 0.893, 95% CI 0.891-0.895). All 4 ML models showed higher sensitivity, specificity, positive predictive value, and negative predictive value compared with the reference logistic regression model (P<.001). We obtained similar results from the Super Learner model (AUC: 0.883, 95% CI 0.881-0.885). CONCLUSIONS: ML approaches can improve sensitivity, specificity, positive predictive value, negative predictive value, discrimination, and calibration in predicting in-hospital mortality in patients hospitalized with sepsis in the United States. These models need further validation and could be applied to develop more accurate models to compare risk-standardized mortality rates across hospitals and geographic regions, paving the way for research and policy initiatives studying disparities in sepsis care. JMIR Publications 2022-04-13 /pmc/articles/PMC9047761/ /pubmed/35416785 http://dx.doi.org/10.2196/29982 Text en ©James Yeongjun Park, Tzu-Chun Hsu, Jiun-Ruey Hu, Chun-Yuan Chen, Wan-Ting Hsu, Matthew Lee, Joshua Ho, Chien-Chang Lee. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 13.04.2022. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle	Original Paper Park, James Yeongjun Hsu, Tzu-Chun Hu, Jiun-Ruey Chen, Chun-Yuan Hsu, Wan-Ting Lee, Matthew Ho, Joshua Lee, Chien-Chang Predicting Sepsis Mortality in a Population-Based National Database: Machine Learning Approach
title	Predicting Sepsis Mortality in a Population-Based National Database: Machine Learning Approach
title_full	Predicting Sepsis Mortality in a Population-Based National Database: Machine Learning Approach
title_fullStr	Predicting Sepsis Mortality in a Population-Based National Database: Machine Learning Approach
title_full_unstemmed	Predicting Sepsis Mortality in a Population-Based National Database: Machine Learning Approach
title_short	Predicting Sepsis Mortality in a Population-Based National Database: Machine Learning Approach
title_sort	predicting sepsis mortality in a population-based national database: machine learning approach
topic	Original Paper
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9047761/ https://www.ncbi.nlm.nih.gov/pubmed/35416785 http://dx.doi.org/10.2196/29982
work_keys_str_mv	AT parkjamesyeongjun predictingsepsismortalityinapopulationbasednationaldatabasemachinelearningapproach AT hsutzuchun predictingsepsismortalityinapopulationbasednationaldatabasemachinelearningapproach AT hujiunruey predictingsepsismortalityinapopulationbasednationaldatabasemachinelearningapproach AT chenchunyuan predictingsepsismortalityinapopulationbasednationaldatabasemachinelearningapproach AT hsuwanting predictingsepsismortalityinapopulationbasednationaldatabasemachinelearningapproach AT leematthew predictingsepsismortalityinapopulationbasednationaldatabasemachinelearningapproach AT hojoshua predictingsepsismortalityinapopulationbasednationaldatabasemachinelearningapproach AT leechienchang predictingsepsismortalityinapopulationbasednationaldatabasemachinelearningapproach

Predicting Sepsis Mortality in a Population-Based National Database: Machine Learning Approach

Ejemplares similares