Cargando…
Mortality Prediction from Hospital-Acquired Infections in Trauma Patients Using an Unbalanced Dataset
OBJECTIVES: Machine learning has been widely used to predict diseases, and it is used to derive impressive knowledge in the healthcare domain. Our objective was to predict in-hospital mortality from hospital-acquired infections in trauma patients on an unbalanced dataset. METHODS: Our study was a cr...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Korean Society of Medical Informatics
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7674815/ https://www.ncbi.nlm.nih.gov/pubmed/33190462 http://dx.doi.org/10.4258/hir.2020.26.4.284 |
_version_ | 1783611587428352000 |
---|---|
author | Karajizadeh, Mehrdad Nasiri, Mahdi Yadollahi, Mahnaz Zolfaghari, Amir Hussain Pakdam, Ali |
author_facet | Karajizadeh, Mehrdad Nasiri, Mahdi Yadollahi, Mahnaz Zolfaghari, Amir Hussain Pakdam, Ali |
author_sort | Karajizadeh, Mehrdad |
collection | PubMed |
description | OBJECTIVES: Machine learning has been widely used to predict diseases, and it is used to derive impressive knowledge in the healthcare domain. Our objective was to predict in-hospital mortality from hospital-acquired infections in trauma patients on an unbalanced dataset. METHODS: Our study was a cross-sectional analysis on trauma patients with hospital-acquired infections who were admitted to Shiraz Trauma Hospital from March 20, 2017, to March 21, 2018. The study data was obtained from the surveillance hospital infection database. The data included sex, age, mechanism of injury, body region injured, severity score, type of intervention, infection day after admission, and microorganism causes of infections. We developed our mortality prediction model by random under-sampling, random over-sampling, clustering (k-mean)-C5.0, SMOTE-C5.0, ADASYN-C5.5, SMOTE-SVM, ADASYN-SVM, SMOTE-ANN, and ADASYN-ANN among hospital-acquired infections in trauma patients. All mortality predictions were conducted by IBM SPSS Modeler 18. RESULTS: We studied 549 individuals with hospital-acquired infections in a trauma hospital in Shiraz during 2017 and 2018. Prediction accuracy before balancing of the dataset was 86.16%. In contrast, the prediction accuracy for the balanced dataset achieved by random under-sampling, random over-sampling, clustering (k-mean)-C5.0, SMOTE-C5.0, ADASYN-C5.5, and SMOTE-SVM was 70.69%, 94.74%, 93.02%, 93.66%, 90.93%, and 100%, respectively. CONCLUSIONS: Our findings demonstrate that cleaning an unbalanced dataset increases the accuracy of the classification model. Also, predicting mortality by a clustered under-sampling approach was more precise in comparison to random under-sampling and random over-sampling methods. |
format | Online Article Text |
id | pubmed-7674815 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Korean Society of Medical Informatics |
record_format | MEDLINE/PubMed |
spelling | pubmed-76748152020-11-19 Mortality Prediction from Hospital-Acquired Infections in Trauma Patients Using an Unbalanced Dataset Karajizadeh, Mehrdad Nasiri, Mahdi Yadollahi, Mahnaz Zolfaghari, Amir Hussain Pakdam, Ali Healthc Inform Res Original Article OBJECTIVES: Machine learning has been widely used to predict diseases, and it is used to derive impressive knowledge in the healthcare domain. Our objective was to predict in-hospital mortality from hospital-acquired infections in trauma patients on an unbalanced dataset. METHODS: Our study was a cross-sectional analysis on trauma patients with hospital-acquired infections who were admitted to Shiraz Trauma Hospital from March 20, 2017, to March 21, 2018. The study data was obtained from the surveillance hospital infection database. The data included sex, age, mechanism of injury, body region injured, severity score, type of intervention, infection day after admission, and microorganism causes of infections. We developed our mortality prediction model by random under-sampling, random over-sampling, clustering (k-mean)-C5.0, SMOTE-C5.0, ADASYN-C5.5, SMOTE-SVM, ADASYN-SVM, SMOTE-ANN, and ADASYN-ANN among hospital-acquired infections in trauma patients. All mortality predictions were conducted by IBM SPSS Modeler 18. RESULTS: We studied 549 individuals with hospital-acquired infections in a trauma hospital in Shiraz during 2017 and 2018. Prediction accuracy before balancing of the dataset was 86.16%. In contrast, the prediction accuracy for the balanced dataset achieved by random under-sampling, random over-sampling, clustering (k-mean)-C5.0, SMOTE-C5.0, ADASYN-C5.5, and SMOTE-SVM was 70.69%, 94.74%, 93.02%, 93.66%, 90.93%, and 100%, respectively. CONCLUSIONS: Our findings demonstrate that cleaning an unbalanced dataset increases the accuracy of the classification model. Also, predicting mortality by a clustered under-sampling approach was more precise in comparison to random under-sampling and random over-sampling methods. Korean Society of Medical Informatics 2020-10 2020-10-31 /pmc/articles/PMC7674815/ /pubmed/33190462 http://dx.doi.org/10.4258/hir.2020.26.4.284 Text en © 2020 The Korean Society of Medical Informatics This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Article Karajizadeh, Mehrdad Nasiri, Mahdi Yadollahi, Mahnaz Zolfaghari, Amir Hussain Pakdam, Ali Mortality Prediction from Hospital-Acquired Infections in Trauma Patients Using an Unbalanced Dataset |
title | Mortality Prediction from Hospital-Acquired Infections in Trauma Patients Using an Unbalanced Dataset |
title_full | Mortality Prediction from Hospital-Acquired Infections in Trauma Patients Using an Unbalanced Dataset |
title_fullStr | Mortality Prediction from Hospital-Acquired Infections in Trauma Patients Using an Unbalanced Dataset |
title_full_unstemmed | Mortality Prediction from Hospital-Acquired Infections in Trauma Patients Using an Unbalanced Dataset |
title_short | Mortality Prediction from Hospital-Acquired Infections in Trauma Patients Using an Unbalanced Dataset |
title_sort | mortality prediction from hospital-acquired infections in trauma patients using an unbalanced dataset |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7674815/ https://www.ncbi.nlm.nih.gov/pubmed/33190462 http://dx.doi.org/10.4258/hir.2020.26.4.284 |
work_keys_str_mv | AT karajizadehmehrdad mortalitypredictionfromhospitalacquiredinfectionsintraumapatientsusinganunbalanceddataset AT nasirimahdi mortalitypredictionfromhospitalacquiredinfectionsintraumapatientsusinganunbalanceddataset AT yadollahimahnaz mortalitypredictionfromhospitalacquiredinfectionsintraumapatientsusinganunbalanceddataset AT zolfaghariamirhussain mortalitypredictionfromhospitalacquiredinfectionsintraumapatientsusinganunbalanceddataset AT pakdamali mortalitypredictionfromhospitalacquiredinfectionsintraumapatientsusinganunbalanceddataset |