Cargando…

Mortality Prediction from Hospital-Acquired Infections in Trauma Patients Using an Unbalanced Dataset

OBJECTIVES: Machine learning has been widely used to predict diseases, and it is used to derive impressive knowledge in the healthcare domain. Our objective was to predict in-hospital mortality from hospital-acquired infections in trauma patients on an unbalanced dataset. METHODS: Our study was a cr...

Descripción completa

Detalles Bibliográficos
Autores principales: Karajizadeh, Mehrdad, Nasiri, Mahdi, Yadollahi, Mahnaz, Zolfaghari, Amir Hussain, Pakdam, Ali
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Korean Society of Medical Informatics 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7674815/
https://www.ncbi.nlm.nih.gov/pubmed/33190462
http://dx.doi.org/10.4258/hir.2020.26.4.284
_version_ 1783611587428352000
author Karajizadeh, Mehrdad
Nasiri, Mahdi
Yadollahi, Mahnaz
Zolfaghari, Amir Hussain
Pakdam, Ali
author_facet Karajizadeh, Mehrdad
Nasiri, Mahdi
Yadollahi, Mahnaz
Zolfaghari, Amir Hussain
Pakdam, Ali
author_sort Karajizadeh, Mehrdad
collection PubMed
description OBJECTIVES: Machine learning has been widely used to predict diseases, and it is used to derive impressive knowledge in the healthcare domain. Our objective was to predict in-hospital mortality from hospital-acquired infections in trauma patients on an unbalanced dataset. METHODS: Our study was a cross-sectional analysis on trauma patients with hospital-acquired infections who were admitted to Shiraz Trauma Hospital from March 20, 2017, to March 21, 2018. The study data was obtained from the surveillance hospital infection database. The data included sex, age, mechanism of injury, body region injured, severity score, type of intervention, infection day after admission, and microorganism causes of infections. We developed our mortality prediction model by random under-sampling, random over-sampling, clustering (k-mean)-C5.0, SMOTE-C5.0, ADASYN-C5.5, SMOTE-SVM, ADASYN-SVM, SMOTE-ANN, and ADASYN-ANN among hospital-acquired infections in trauma patients. All mortality predictions were conducted by IBM SPSS Modeler 18. RESULTS: We studied 549 individuals with hospital-acquired infections in a trauma hospital in Shiraz during 2017 and 2018. Prediction accuracy before balancing of the dataset was 86.16%. In contrast, the prediction accuracy for the balanced dataset achieved by random under-sampling, random over-sampling, clustering (k-mean)-C5.0, SMOTE-C5.0, ADASYN-C5.5, and SMOTE-SVM was 70.69%, 94.74%, 93.02%, 93.66%, 90.93%, and 100%, respectively. CONCLUSIONS: Our findings demonstrate that cleaning an unbalanced dataset increases the accuracy of the classification model. Also, predicting mortality by a clustered under-sampling approach was more precise in comparison to random under-sampling and random over-sampling methods.
format Online
Article
Text
id pubmed-7674815
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Korean Society of Medical Informatics
record_format MEDLINE/PubMed
spelling pubmed-76748152020-11-19 Mortality Prediction from Hospital-Acquired Infections in Trauma Patients Using an Unbalanced Dataset Karajizadeh, Mehrdad Nasiri, Mahdi Yadollahi, Mahnaz Zolfaghari, Amir Hussain Pakdam, Ali Healthc Inform Res Original Article OBJECTIVES: Machine learning has been widely used to predict diseases, and it is used to derive impressive knowledge in the healthcare domain. Our objective was to predict in-hospital mortality from hospital-acquired infections in trauma patients on an unbalanced dataset. METHODS: Our study was a cross-sectional analysis on trauma patients with hospital-acquired infections who were admitted to Shiraz Trauma Hospital from March 20, 2017, to March 21, 2018. The study data was obtained from the surveillance hospital infection database. The data included sex, age, mechanism of injury, body region injured, severity score, type of intervention, infection day after admission, and microorganism causes of infections. We developed our mortality prediction model by random under-sampling, random over-sampling, clustering (k-mean)-C5.0, SMOTE-C5.0, ADASYN-C5.5, SMOTE-SVM, ADASYN-SVM, SMOTE-ANN, and ADASYN-ANN among hospital-acquired infections in trauma patients. All mortality predictions were conducted by IBM SPSS Modeler 18. RESULTS: We studied 549 individuals with hospital-acquired infections in a trauma hospital in Shiraz during 2017 and 2018. Prediction accuracy before balancing of the dataset was 86.16%. In contrast, the prediction accuracy for the balanced dataset achieved by random under-sampling, random over-sampling, clustering (k-mean)-C5.0, SMOTE-C5.0, ADASYN-C5.5, and SMOTE-SVM was 70.69%, 94.74%, 93.02%, 93.66%, 90.93%, and 100%, respectively. CONCLUSIONS: Our findings demonstrate that cleaning an unbalanced dataset increases the accuracy of the classification model. Also, predicting mortality by a clustered under-sampling approach was more precise in comparison to random under-sampling and random over-sampling methods. Korean Society of Medical Informatics 2020-10 2020-10-31 /pmc/articles/PMC7674815/ /pubmed/33190462 http://dx.doi.org/10.4258/hir.2020.26.4.284 Text en © 2020 The Korean Society of Medical Informatics This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Karajizadeh, Mehrdad
Nasiri, Mahdi
Yadollahi, Mahnaz
Zolfaghari, Amir Hussain
Pakdam, Ali
Mortality Prediction from Hospital-Acquired Infections in Trauma Patients Using an Unbalanced Dataset
title Mortality Prediction from Hospital-Acquired Infections in Trauma Patients Using an Unbalanced Dataset
title_full Mortality Prediction from Hospital-Acquired Infections in Trauma Patients Using an Unbalanced Dataset
title_fullStr Mortality Prediction from Hospital-Acquired Infections in Trauma Patients Using an Unbalanced Dataset
title_full_unstemmed Mortality Prediction from Hospital-Acquired Infections in Trauma Patients Using an Unbalanced Dataset
title_short Mortality Prediction from Hospital-Acquired Infections in Trauma Patients Using an Unbalanced Dataset
title_sort mortality prediction from hospital-acquired infections in trauma patients using an unbalanced dataset
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7674815/
https://www.ncbi.nlm.nih.gov/pubmed/33190462
http://dx.doi.org/10.4258/hir.2020.26.4.284
work_keys_str_mv AT karajizadehmehrdad mortalitypredictionfromhospitalacquiredinfectionsintraumapatientsusinganunbalanceddataset
AT nasirimahdi mortalitypredictionfromhospitalacquiredinfectionsintraumapatientsusinganunbalanceddataset
AT yadollahimahnaz mortalitypredictionfromhospitalacquiredinfectionsintraumapatientsusinganunbalanceddataset
AT zolfaghariamirhussain mortalitypredictionfromhospitalacquiredinfectionsintraumapatientsusinganunbalanceddataset
AT pakdamali mortalitypredictionfromhospitalacquiredinfectionsintraumapatientsusinganunbalanceddataset