Cargando…
Comparison between machine learning methods for mortality prediction for sepsis patients with different social determinants
BACKGROUND: Sepsis is one of the most life-threatening circumstances for critically ill patients in the United States, while diagnosis of sepsis is challenging as a standardized criteria for sepsis identification is still under development. Disparities in social determinants of sepsis patients can i...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9204861/ https://www.ncbi.nlm.nih.gov/pubmed/35710407 http://dx.doi.org/10.1186/s12911-022-01871-0 |
_version_ | 1784729010086871040 |
---|---|
author | Wang, Hanyin Li, Yikuan Naidech, Andrew Luo, Yuan |
author_facet | Wang, Hanyin Li, Yikuan Naidech, Andrew Luo, Yuan |
author_sort | Wang, Hanyin |
collection | PubMed |
description | BACKGROUND: Sepsis is one of the most life-threatening circumstances for critically ill patients in the United States, while diagnosis of sepsis is challenging as a standardized criteria for sepsis identification is still under development. Disparities in social determinants of sepsis patients can interfere with the risk prediction performances using machine learning. METHODS: We analyzed a cohort of critical care patients from the Medical Information Mart for Intensive Care (MIMIC)-III database. Disparities in social determinants, including race, sex, marital status, insurance types and languages, among patients identified by six available sepsis criteria were revealed by forest plots with 95% confidence intervals. Sepsis patients were then identified by the Sepsis-3 criteria. Sixteen machine learning classifiers were trained to predict in-hospital mortality for sepsis patients on a training set constructed by random selection. The performance was measured by area under the receiver operating characteristic curve (AUC). The performance of the trained model was tested on the entire randomly conducted test set and each sub-population built based on each of the following social determinants: race, sex, marital status, insurance type, and language. The fluctuations in performances were further examined by permutation tests. RESULTS: We analyzed a total of 11,791 critical care patients from the MIMIC-III database. Within the population identified by each sepsis identification method, significant differences were observed among sub-populations regarding race, marital status, insurance type, and language. On the 5783 sepsis patients identified by the Sepsis-3 criteria statistically significant performance decreases for mortality prediction were observed when applying the trained machine learning model on Asian and Hispanic patients, as well as the Spanish-speaking patients. With pairwise comparison, we detected performance discrepancies in mortality prediction between Asian and White patients, Asians and patients of other races, as well as English-speaking and Spanish-speaking patients. CONCLUSIONS: Disparities in proportions of patients identified by various sepsis criteria were detected among the different social determinant groups. The performances of mortality prediction for sepsis patients can be compromised when applying a universally trained model for each subpopulation. To achieve accurate diagnosis, a versatile diagnostic system for sepsis is needed to overcome the social determinant disparities of patients. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-022-01871-0. |
format | Online Article Text |
id | pubmed-9204861 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-92048612022-06-18 Comparison between machine learning methods for mortality prediction for sepsis patients with different social determinants Wang, Hanyin Li, Yikuan Naidech, Andrew Luo, Yuan BMC Med Inform Decis Mak Research BACKGROUND: Sepsis is one of the most life-threatening circumstances for critically ill patients in the United States, while diagnosis of sepsis is challenging as a standardized criteria for sepsis identification is still under development. Disparities in social determinants of sepsis patients can interfere with the risk prediction performances using machine learning. METHODS: We analyzed a cohort of critical care patients from the Medical Information Mart for Intensive Care (MIMIC)-III database. Disparities in social determinants, including race, sex, marital status, insurance types and languages, among patients identified by six available sepsis criteria were revealed by forest plots with 95% confidence intervals. Sepsis patients were then identified by the Sepsis-3 criteria. Sixteen machine learning classifiers were trained to predict in-hospital mortality for sepsis patients on a training set constructed by random selection. The performance was measured by area under the receiver operating characteristic curve (AUC). The performance of the trained model was tested on the entire randomly conducted test set and each sub-population built based on each of the following social determinants: race, sex, marital status, insurance type, and language. The fluctuations in performances were further examined by permutation tests. RESULTS: We analyzed a total of 11,791 critical care patients from the MIMIC-III database. Within the population identified by each sepsis identification method, significant differences were observed among sub-populations regarding race, marital status, insurance type, and language. On the 5783 sepsis patients identified by the Sepsis-3 criteria statistically significant performance decreases for mortality prediction were observed when applying the trained machine learning model on Asian and Hispanic patients, as well as the Spanish-speaking patients. With pairwise comparison, we detected performance discrepancies in mortality prediction between Asian and White patients, Asians and patients of other races, as well as English-speaking and Spanish-speaking patients. CONCLUSIONS: Disparities in proportions of patients identified by various sepsis criteria were detected among the different social determinant groups. The performances of mortality prediction for sepsis patients can be compromised when applying a universally trained model for each subpopulation. To achieve accurate diagnosis, a versatile diagnostic system for sepsis is needed to overcome the social determinant disparities of patients. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-022-01871-0. BioMed Central 2022-06-16 /pmc/articles/PMC9204861/ /pubmed/35710407 http://dx.doi.org/10.1186/s12911-022-01871-0 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Wang, Hanyin Li, Yikuan Naidech, Andrew Luo, Yuan Comparison between machine learning methods for mortality prediction for sepsis patients with different social determinants |
title | Comparison between machine learning methods for mortality prediction for sepsis patients with different social determinants |
title_full | Comparison between machine learning methods for mortality prediction for sepsis patients with different social determinants |
title_fullStr | Comparison between machine learning methods for mortality prediction for sepsis patients with different social determinants |
title_full_unstemmed | Comparison between machine learning methods for mortality prediction for sepsis patients with different social determinants |
title_short | Comparison between machine learning methods for mortality prediction for sepsis patients with different social determinants |
title_sort | comparison between machine learning methods for mortality prediction for sepsis patients with different social determinants |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9204861/ https://www.ncbi.nlm.nih.gov/pubmed/35710407 http://dx.doi.org/10.1186/s12911-022-01871-0 |
work_keys_str_mv | AT wanghanyin comparisonbetweenmachinelearningmethodsformortalitypredictionforsepsispatientswithdifferentsocialdeterminants AT liyikuan comparisonbetweenmachinelearningmethodsformortalitypredictionforsepsispatientswithdifferentsocialdeterminants AT naidechandrew comparisonbetweenmachinelearningmethodsformortalitypredictionforsepsispatientswithdifferentsocialdeterminants AT luoyuan comparisonbetweenmachinelearningmethodsformortalitypredictionforsepsispatientswithdifferentsocialdeterminants |