Cargando…

Comparison between machine learning methods for mortality prediction for sepsis patients with different social determinants

BACKGROUND: Sepsis is one of the most life-threatening circumstances for critically ill patients in the United States, while diagnosis of sepsis is challenging as a standardized criteria for sepsis identification is still under development. Disparities in social determinants of sepsis patients can i...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Hanyin, Li, Yikuan, Naidech, Andrew, Luo, Yuan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9204861/
https://www.ncbi.nlm.nih.gov/pubmed/35710407
http://dx.doi.org/10.1186/s12911-022-01871-0
_version_ 1784729010086871040
author Wang, Hanyin
Li, Yikuan
Naidech, Andrew
Luo, Yuan
author_facet Wang, Hanyin
Li, Yikuan
Naidech, Andrew
Luo, Yuan
author_sort Wang, Hanyin
collection PubMed
description BACKGROUND: Sepsis is one of the most life-threatening circumstances for critically ill patients in the United States, while diagnosis of sepsis is challenging as a standardized criteria for sepsis identification is still under development. Disparities in social determinants of sepsis patients can interfere with the risk prediction performances using machine learning. METHODS: We analyzed a cohort of critical care patients from the Medical Information Mart for Intensive Care (MIMIC)-III database. Disparities in social determinants, including race, sex, marital status, insurance types and languages, among patients identified by six available sepsis criteria were revealed by forest plots with 95% confidence intervals. Sepsis patients were then identified by the Sepsis-3 criteria. Sixteen machine learning classifiers were trained to predict in-hospital mortality for sepsis patients on a training set constructed by random selection. The performance was measured by area under the receiver operating characteristic curve (AUC). The performance of the trained model was tested on the entire randomly conducted test set and each sub-population built based on each of the following social determinants: race, sex, marital status, insurance type, and language. The fluctuations in performances were further examined by permutation tests. RESULTS: We analyzed a total of 11,791 critical care patients from the MIMIC-III database. Within the population identified by each sepsis identification method, significant differences were observed among sub-populations regarding race, marital status, insurance type, and language. On the 5783 sepsis patients identified by the Sepsis-3 criteria statistically significant performance decreases for mortality prediction were observed when applying the trained machine learning model on Asian and Hispanic patients, as well as the Spanish-speaking patients. With pairwise comparison, we detected performance discrepancies in mortality prediction between Asian and White patients, Asians and patients of other races, as well as English-speaking and Spanish-speaking patients. CONCLUSIONS: Disparities in proportions of patients identified by various sepsis criteria were detected among the different social determinant groups. The performances of mortality prediction for sepsis patients can be compromised when applying a universally trained model for each subpopulation. To achieve accurate diagnosis, a versatile diagnostic system for sepsis is needed to overcome the social determinant disparities of patients. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-022-01871-0.
format Online
Article
Text
id pubmed-9204861
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-92048612022-06-18 Comparison between machine learning methods for mortality prediction for sepsis patients with different social determinants Wang, Hanyin Li, Yikuan Naidech, Andrew Luo, Yuan BMC Med Inform Decis Mak Research BACKGROUND: Sepsis is one of the most life-threatening circumstances for critically ill patients in the United States, while diagnosis of sepsis is challenging as a standardized criteria for sepsis identification is still under development. Disparities in social determinants of sepsis patients can interfere with the risk prediction performances using machine learning. METHODS: We analyzed a cohort of critical care patients from the Medical Information Mart for Intensive Care (MIMIC)-III database. Disparities in social determinants, including race, sex, marital status, insurance types and languages, among patients identified by six available sepsis criteria were revealed by forest plots with 95% confidence intervals. Sepsis patients were then identified by the Sepsis-3 criteria. Sixteen machine learning classifiers were trained to predict in-hospital mortality for sepsis patients on a training set constructed by random selection. The performance was measured by area under the receiver operating characteristic curve (AUC). The performance of the trained model was tested on the entire randomly conducted test set and each sub-population built based on each of the following social determinants: race, sex, marital status, insurance type, and language. The fluctuations in performances were further examined by permutation tests. RESULTS: We analyzed a total of 11,791 critical care patients from the MIMIC-III database. Within the population identified by each sepsis identification method, significant differences were observed among sub-populations regarding race, marital status, insurance type, and language. On the 5783 sepsis patients identified by the Sepsis-3 criteria statistically significant performance decreases for mortality prediction were observed when applying the trained machine learning model on Asian and Hispanic patients, as well as the Spanish-speaking patients. With pairwise comparison, we detected performance discrepancies in mortality prediction between Asian and White patients, Asians and patients of other races, as well as English-speaking and Spanish-speaking patients. CONCLUSIONS: Disparities in proportions of patients identified by various sepsis criteria were detected among the different social determinant groups. The performances of mortality prediction for sepsis patients can be compromised when applying a universally trained model for each subpopulation. To achieve accurate diagnosis, a versatile diagnostic system for sepsis is needed to overcome the social determinant disparities of patients. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-022-01871-0. BioMed Central 2022-06-16 /pmc/articles/PMC9204861/ /pubmed/35710407 http://dx.doi.org/10.1186/s12911-022-01871-0 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Wang, Hanyin
Li, Yikuan
Naidech, Andrew
Luo, Yuan
Comparison between machine learning methods for mortality prediction for sepsis patients with different social determinants
title Comparison between machine learning methods for mortality prediction for sepsis patients with different social determinants
title_full Comparison between machine learning methods for mortality prediction for sepsis patients with different social determinants
title_fullStr Comparison between machine learning methods for mortality prediction for sepsis patients with different social determinants
title_full_unstemmed Comparison between machine learning methods for mortality prediction for sepsis patients with different social determinants
title_short Comparison between machine learning methods for mortality prediction for sepsis patients with different social determinants
title_sort comparison between machine learning methods for mortality prediction for sepsis patients with different social determinants
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9204861/
https://www.ncbi.nlm.nih.gov/pubmed/35710407
http://dx.doi.org/10.1186/s12911-022-01871-0
work_keys_str_mv AT wanghanyin comparisonbetweenmachinelearningmethodsformortalitypredictionforsepsispatientswithdifferentsocialdeterminants
AT liyikuan comparisonbetweenmachinelearningmethodsformortalitypredictionforsepsispatientswithdifferentsocialdeterminants
AT naidechandrew comparisonbetweenmachinelearningmethodsformortalitypredictionforsepsispatientswithdifferentsocialdeterminants
AT luoyuan comparisonbetweenmachinelearningmethodsformortalitypredictionforsepsispatientswithdifferentsocialdeterminants