Cargando…

Improved patient mortality predictions in emergency departments with deep learning data-synthesis and ensemble models

The triage process in emergency departments (EDs) relies on the subjective assessment of medical practitioners, making it unreliable in certain aspects. There is a need for a more accurate and objective algorithm to determine the urgency of patients. This paper explores the application of advanced d...

Descripción completa

Detalles Bibliográficos
Autores principales: Son, Byounghoon, Myung, Jinwoo, Shin, Younghwan, Kim, Sangdo, Kim, Sung Hyun, Chung, Jong-Moon, Noh, Jiyoung, Cho, Junho, Chung, Hyun Soo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10497596/
https://www.ncbi.nlm.nih.gov/pubmed/37699933
http://dx.doi.org/10.1038/s41598-023-41544-0
_version_ 1785105338744176640
author Son, Byounghoon
Myung, Jinwoo
Shin, Younghwan
Kim, Sangdo
Kim, Sung Hyun
Chung, Jong-Moon
Noh, Jiyoung
Cho, Junho
Chung, Hyun Soo
author_facet Son, Byounghoon
Myung, Jinwoo
Shin, Younghwan
Kim, Sangdo
Kim, Sung Hyun
Chung, Jong-Moon
Noh, Jiyoung
Cho, Junho
Chung, Hyun Soo
author_sort Son, Byounghoon
collection PubMed
description The triage process in emergency departments (EDs) relies on the subjective assessment of medical practitioners, making it unreliable in certain aspects. There is a need for a more accurate and objective algorithm to determine the urgency of patients. This paper explores the application of advanced data-synthesis algorithms, machine learning (ML) algorithms, and ensemble models to predict patient mortality. Patients predicted to be at risk of mortality are in a highly critical condition, signifying an urgent need for immediate medical intervention. This paper aims to determine the most effective method for predicting mortality by enhancing the F1 score while maintaining high area under the receiver operating characteristic curve (AUC) score. This study used a dataset of 7325 patients who visited the Yonsei Severance Hospital’s ED, located in Seoul, South Korea. The patients were divided into two groups: patients who deceased in the ED and patients who didn’t. Various data-synthesis techniques, such as SMOTE, ADASYN, CTGAN, TVAE, CopulaGAN, and Gaussian Copula, were deployed to generate synthetic patient data. Twenty two ML models were then utilized, including tree-based algorithms like Decision tree, AdaBoost, LightGBM, CatBoost, XGBoost, NGBoost, TabNet, which are deep neural network algorithms, and statistical algorithms such as Support Vector Machine, Logistic Regression, Random Forest, k-nearest neighbors, and Gaussian Naive Bayes, as well as Ensemble Models which use the results from the ML models. Based on 21 patient information features used in the pandemic influenza triage algorithm (PITA), the models explained previously were applied to aim for the prediction of patient mortality. In evaluating ML algorithms using an imbalanced medical dataset, conventional metrics like accuracy scores or AUC can be misleading. This paper emphasizes the importance of using the F1 score as the primary performance measure, focusing on recall and specificity in detecting patient mortality. The highest-ranked model for predicting mortality utilized the Gaussian Copula data-synthesis technique and the CatBoost classifier, achieving an AUC of 0.9731 and an F1 score of 0.7059. These findings highlight the effectiveness of machine learning algorithms and data-synthesis techniques in improving the prediction performance of mortality in EDs.
format Online
Article
Text
id pubmed-10497596
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-104975962023-09-14 Improved patient mortality predictions in emergency departments with deep learning data-synthesis and ensemble models Son, Byounghoon Myung, Jinwoo Shin, Younghwan Kim, Sangdo Kim, Sung Hyun Chung, Jong-Moon Noh, Jiyoung Cho, Junho Chung, Hyun Soo Sci Rep Article The triage process in emergency departments (EDs) relies on the subjective assessment of medical practitioners, making it unreliable in certain aspects. There is a need for a more accurate and objective algorithm to determine the urgency of patients. This paper explores the application of advanced data-synthesis algorithms, machine learning (ML) algorithms, and ensemble models to predict patient mortality. Patients predicted to be at risk of mortality are in a highly critical condition, signifying an urgent need for immediate medical intervention. This paper aims to determine the most effective method for predicting mortality by enhancing the F1 score while maintaining high area under the receiver operating characteristic curve (AUC) score. This study used a dataset of 7325 patients who visited the Yonsei Severance Hospital’s ED, located in Seoul, South Korea. The patients were divided into two groups: patients who deceased in the ED and patients who didn’t. Various data-synthesis techniques, such as SMOTE, ADASYN, CTGAN, TVAE, CopulaGAN, and Gaussian Copula, were deployed to generate synthetic patient data. Twenty two ML models were then utilized, including tree-based algorithms like Decision tree, AdaBoost, LightGBM, CatBoost, XGBoost, NGBoost, TabNet, which are deep neural network algorithms, and statistical algorithms such as Support Vector Machine, Logistic Regression, Random Forest, k-nearest neighbors, and Gaussian Naive Bayes, as well as Ensemble Models which use the results from the ML models. Based on 21 patient information features used in the pandemic influenza triage algorithm (PITA), the models explained previously were applied to aim for the prediction of patient mortality. In evaluating ML algorithms using an imbalanced medical dataset, conventional metrics like accuracy scores or AUC can be misleading. This paper emphasizes the importance of using the F1 score as the primary performance measure, focusing on recall and specificity in detecting patient mortality. The highest-ranked model for predicting mortality utilized the Gaussian Copula data-synthesis technique and the CatBoost classifier, achieving an AUC of 0.9731 and an F1 score of 0.7059. These findings highlight the effectiveness of machine learning algorithms and data-synthesis techniques in improving the prediction performance of mortality in EDs. Nature Publishing Group UK 2023-09-12 /pmc/articles/PMC10497596/ /pubmed/37699933 http://dx.doi.org/10.1038/s41598-023-41544-0 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Son, Byounghoon
Myung, Jinwoo
Shin, Younghwan
Kim, Sangdo
Kim, Sung Hyun
Chung, Jong-Moon
Noh, Jiyoung
Cho, Junho
Chung, Hyun Soo
Improved patient mortality predictions in emergency departments with deep learning data-synthesis and ensemble models
title Improved patient mortality predictions in emergency departments with deep learning data-synthesis and ensemble models
title_full Improved patient mortality predictions in emergency departments with deep learning data-synthesis and ensemble models
title_fullStr Improved patient mortality predictions in emergency departments with deep learning data-synthesis and ensemble models
title_full_unstemmed Improved patient mortality predictions in emergency departments with deep learning data-synthesis and ensemble models
title_short Improved patient mortality predictions in emergency departments with deep learning data-synthesis and ensemble models
title_sort improved patient mortality predictions in emergency departments with deep learning data-synthesis and ensemble models
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10497596/
https://www.ncbi.nlm.nih.gov/pubmed/37699933
http://dx.doi.org/10.1038/s41598-023-41544-0
work_keys_str_mv AT sonbyounghoon improvedpatientmortalitypredictionsinemergencydepartmentswithdeeplearningdatasynthesisandensemblemodels
AT myungjinwoo improvedpatientmortalitypredictionsinemergencydepartmentswithdeeplearningdatasynthesisandensemblemodels
AT shinyounghwan improvedpatientmortalitypredictionsinemergencydepartmentswithdeeplearningdatasynthesisandensemblemodels
AT kimsangdo improvedpatientmortalitypredictionsinemergencydepartmentswithdeeplearningdatasynthesisandensemblemodels
AT kimsunghyun improvedpatientmortalitypredictionsinemergencydepartmentswithdeeplearningdatasynthesisandensemblemodels
AT chungjongmoon improvedpatientmortalitypredictionsinemergencydepartmentswithdeeplearningdatasynthesisandensemblemodels
AT nohjiyoung improvedpatientmortalitypredictionsinemergencydepartmentswithdeeplearningdatasynthesisandensemblemodels
AT chojunho improvedpatientmortalitypredictionsinemergencydepartmentswithdeeplearningdatasynthesisandensemblemodels
AT chunghyunsoo improvedpatientmortalitypredictionsinemergencydepartmentswithdeeplearningdatasynthesisandensemblemodels