Cargando…
Improved patient mortality predictions in emergency departments with deep learning data-synthesis and ensemble models
The triage process in emergency departments (EDs) relies on the subjective assessment of medical practitioners, making it unreliable in certain aspects. There is a need for a more accurate and objective algorithm to determine the urgency of patients. This paper explores the application of advanced d...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10497596/ https://www.ncbi.nlm.nih.gov/pubmed/37699933 http://dx.doi.org/10.1038/s41598-023-41544-0 |
_version_ | 1785105338744176640 |
---|---|
author | Son, Byounghoon Myung, Jinwoo Shin, Younghwan Kim, Sangdo Kim, Sung Hyun Chung, Jong-Moon Noh, Jiyoung Cho, Junho Chung, Hyun Soo |
author_facet | Son, Byounghoon Myung, Jinwoo Shin, Younghwan Kim, Sangdo Kim, Sung Hyun Chung, Jong-Moon Noh, Jiyoung Cho, Junho Chung, Hyun Soo |
author_sort | Son, Byounghoon |
collection | PubMed |
description | The triage process in emergency departments (EDs) relies on the subjective assessment of medical practitioners, making it unreliable in certain aspects. There is a need for a more accurate and objective algorithm to determine the urgency of patients. This paper explores the application of advanced data-synthesis algorithms, machine learning (ML) algorithms, and ensemble models to predict patient mortality. Patients predicted to be at risk of mortality are in a highly critical condition, signifying an urgent need for immediate medical intervention. This paper aims to determine the most effective method for predicting mortality by enhancing the F1 score while maintaining high area under the receiver operating characteristic curve (AUC) score. This study used a dataset of 7325 patients who visited the Yonsei Severance Hospital’s ED, located in Seoul, South Korea. The patients were divided into two groups: patients who deceased in the ED and patients who didn’t. Various data-synthesis techniques, such as SMOTE, ADASYN, CTGAN, TVAE, CopulaGAN, and Gaussian Copula, were deployed to generate synthetic patient data. Twenty two ML models were then utilized, including tree-based algorithms like Decision tree, AdaBoost, LightGBM, CatBoost, XGBoost, NGBoost, TabNet, which are deep neural network algorithms, and statistical algorithms such as Support Vector Machine, Logistic Regression, Random Forest, k-nearest neighbors, and Gaussian Naive Bayes, as well as Ensemble Models which use the results from the ML models. Based on 21 patient information features used in the pandemic influenza triage algorithm (PITA), the models explained previously were applied to aim for the prediction of patient mortality. In evaluating ML algorithms using an imbalanced medical dataset, conventional metrics like accuracy scores or AUC can be misleading. This paper emphasizes the importance of using the F1 score as the primary performance measure, focusing on recall and specificity in detecting patient mortality. The highest-ranked model for predicting mortality utilized the Gaussian Copula data-synthesis technique and the CatBoost classifier, achieving an AUC of 0.9731 and an F1 score of 0.7059. These findings highlight the effectiveness of machine learning algorithms and data-synthesis techniques in improving the prediction performance of mortality in EDs. |
format | Online Article Text |
id | pubmed-10497596 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-104975962023-09-14 Improved patient mortality predictions in emergency departments with deep learning data-synthesis and ensemble models Son, Byounghoon Myung, Jinwoo Shin, Younghwan Kim, Sangdo Kim, Sung Hyun Chung, Jong-Moon Noh, Jiyoung Cho, Junho Chung, Hyun Soo Sci Rep Article The triage process in emergency departments (EDs) relies on the subjective assessment of medical practitioners, making it unreliable in certain aspects. There is a need for a more accurate and objective algorithm to determine the urgency of patients. This paper explores the application of advanced data-synthesis algorithms, machine learning (ML) algorithms, and ensemble models to predict patient mortality. Patients predicted to be at risk of mortality are in a highly critical condition, signifying an urgent need for immediate medical intervention. This paper aims to determine the most effective method for predicting mortality by enhancing the F1 score while maintaining high area under the receiver operating characteristic curve (AUC) score. This study used a dataset of 7325 patients who visited the Yonsei Severance Hospital’s ED, located in Seoul, South Korea. The patients were divided into two groups: patients who deceased in the ED and patients who didn’t. Various data-synthesis techniques, such as SMOTE, ADASYN, CTGAN, TVAE, CopulaGAN, and Gaussian Copula, were deployed to generate synthetic patient data. Twenty two ML models were then utilized, including tree-based algorithms like Decision tree, AdaBoost, LightGBM, CatBoost, XGBoost, NGBoost, TabNet, which are deep neural network algorithms, and statistical algorithms such as Support Vector Machine, Logistic Regression, Random Forest, k-nearest neighbors, and Gaussian Naive Bayes, as well as Ensemble Models which use the results from the ML models. Based on 21 patient information features used in the pandemic influenza triage algorithm (PITA), the models explained previously were applied to aim for the prediction of patient mortality. In evaluating ML algorithms using an imbalanced medical dataset, conventional metrics like accuracy scores or AUC can be misleading. This paper emphasizes the importance of using the F1 score as the primary performance measure, focusing on recall and specificity in detecting patient mortality. The highest-ranked model for predicting mortality utilized the Gaussian Copula data-synthesis technique and the CatBoost classifier, achieving an AUC of 0.9731 and an F1 score of 0.7059. These findings highlight the effectiveness of machine learning algorithms and data-synthesis techniques in improving the prediction performance of mortality in EDs. Nature Publishing Group UK 2023-09-12 /pmc/articles/PMC10497596/ /pubmed/37699933 http://dx.doi.org/10.1038/s41598-023-41544-0 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Son, Byounghoon Myung, Jinwoo Shin, Younghwan Kim, Sangdo Kim, Sung Hyun Chung, Jong-Moon Noh, Jiyoung Cho, Junho Chung, Hyun Soo Improved patient mortality predictions in emergency departments with deep learning data-synthesis and ensemble models |
title | Improved patient mortality predictions in emergency departments with deep learning data-synthesis and ensemble models |
title_full | Improved patient mortality predictions in emergency departments with deep learning data-synthesis and ensemble models |
title_fullStr | Improved patient mortality predictions in emergency departments with deep learning data-synthesis and ensemble models |
title_full_unstemmed | Improved patient mortality predictions in emergency departments with deep learning data-synthesis and ensemble models |
title_short | Improved patient mortality predictions in emergency departments with deep learning data-synthesis and ensemble models |
title_sort | improved patient mortality predictions in emergency departments with deep learning data-synthesis and ensemble models |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10497596/ https://www.ncbi.nlm.nih.gov/pubmed/37699933 http://dx.doi.org/10.1038/s41598-023-41544-0 |
work_keys_str_mv | AT sonbyounghoon improvedpatientmortalitypredictionsinemergencydepartmentswithdeeplearningdatasynthesisandensemblemodels AT myungjinwoo improvedpatientmortalitypredictionsinemergencydepartmentswithdeeplearningdatasynthesisandensemblemodels AT shinyounghwan improvedpatientmortalitypredictionsinemergencydepartmentswithdeeplearningdatasynthesisandensemblemodels AT kimsangdo improvedpatientmortalitypredictionsinemergencydepartmentswithdeeplearningdatasynthesisandensemblemodels AT kimsunghyun improvedpatientmortalitypredictionsinemergencydepartmentswithdeeplearningdatasynthesisandensemblemodels AT chungjongmoon improvedpatientmortalitypredictionsinemergencydepartmentswithdeeplearningdatasynthesisandensemblemodels AT nohjiyoung improvedpatientmortalitypredictionsinemergencydepartmentswithdeeplearningdatasynthesisandensemblemodels AT chojunho improvedpatientmortalitypredictionsinemergencydepartmentswithdeeplearningdatasynthesisandensemblemodels AT chunghyunsoo improvedpatientmortalitypredictionsinemergencydepartmentswithdeeplearningdatasynthesisandensemblemodels |