Cargando…
Machine learning analysis to identify the association between risk factors and onset of nosocomial diarrhea: a retrospective cohort study
BACKGROUND: Although several risk factors for nosocomial diarrhea have been identified, the detail of association between these factors and onset of nosocomial diarrhea, such as degree of importance or temporal pattern of influence, remains unclear. We aimed to determine the association between risk...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6825409/ https://www.ncbi.nlm.nih.gov/pubmed/31687281 http://dx.doi.org/10.7717/peerj.7969 |
_version_ | 1783464895394611200 |
---|---|
author | Kurisu, Ken Yoshiuchi, Kazuhiro Ogino, Kei Oda, Toshimi |
author_facet | Kurisu, Ken Yoshiuchi, Kazuhiro Ogino, Kei Oda, Toshimi |
author_sort | Kurisu, Ken |
collection | PubMed |
description | BACKGROUND: Although several risk factors for nosocomial diarrhea have been identified, the detail of association between these factors and onset of nosocomial diarrhea, such as degree of importance or temporal pattern of influence, remains unclear. We aimed to determine the association between risk factors and onset of nosocomial diarrhea using machine learning algorithms. METHODS: We retrospectively collected data of patients with acute cerebral infarction. Seven variables, including age, sex, modified Rankin Scale (mRS) score, and number of days of antibiotics, tube feeding, proton pump inhibitors, and histamine 2-receptor antagonist use, were used in the analysis. We split the data into a training dataset and independant test dataset. Based on the training dataset, we developed a random forest, support vector machine (SVM), and radial basis function (RBF) network model. By calculating an area under the curve (AUC) of the receiver operating characteristic curve using 5-fold cross-validation, we performed feature selection and hyperparameter optimization in each model. According to their final performances, we selected the optimal model and also validated it in the independent test dataset. Based on the selected model, we visualized the variable importance and the association between each variable and the outcome using partial dependence plots. RESULTS: Two-hundred and eighteen patients were included. In the cross-validation within the training dataset, the random forest model achieved an AUC of 0.944, which was higher than in the SVM and RBF network models. The random forest model also achieved an AUC of 0.832 in the independent test dataset. Tube feeding use days, mRS score, antibiotic use days, age and sex were strongly associated with the onset of nosocomial diarrhea, in this order. Tube feeding use had an inverse U-shaped association with the outcome. The mRS score and age had a convex downward and increasing association, while antibiotic use had a convex upward association with the outcome. CONCLUSION: We revealed the degree of importance and temporal pattern of the influence of several risk factors for nosocomial diarrhea, which could help clinicians manage nosocomial diarrhea. |
format | Online Article Text |
id | pubmed-6825409 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-68254092019-11-04 Machine learning analysis to identify the association between risk factors and onset of nosocomial diarrhea: a retrospective cohort study Kurisu, Ken Yoshiuchi, Kazuhiro Ogino, Kei Oda, Toshimi PeerJ Gastroenterology and Hepatology BACKGROUND: Although several risk factors for nosocomial diarrhea have been identified, the detail of association between these factors and onset of nosocomial diarrhea, such as degree of importance or temporal pattern of influence, remains unclear. We aimed to determine the association between risk factors and onset of nosocomial diarrhea using machine learning algorithms. METHODS: We retrospectively collected data of patients with acute cerebral infarction. Seven variables, including age, sex, modified Rankin Scale (mRS) score, and number of days of antibiotics, tube feeding, proton pump inhibitors, and histamine 2-receptor antagonist use, were used in the analysis. We split the data into a training dataset and independant test dataset. Based on the training dataset, we developed a random forest, support vector machine (SVM), and radial basis function (RBF) network model. By calculating an area under the curve (AUC) of the receiver operating characteristic curve using 5-fold cross-validation, we performed feature selection and hyperparameter optimization in each model. According to their final performances, we selected the optimal model and also validated it in the independent test dataset. Based on the selected model, we visualized the variable importance and the association between each variable and the outcome using partial dependence plots. RESULTS: Two-hundred and eighteen patients were included. In the cross-validation within the training dataset, the random forest model achieved an AUC of 0.944, which was higher than in the SVM and RBF network models. The random forest model also achieved an AUC of 0.832 in the independent test dataset. Tube feeding use days, mRS score, antibiotic use days, age and sex were strongly associated with the onset of nosocomial diarrhea, in this order. Tube feeding use had an inverse U-shaped association with the outcome. The mRS score and age had a convex downward and increasing association, while antibiotic use had a convex upward association with the outcome. CONCLUSION: We revealed the degree of importance and temporal pattern of the influence of several risk factors for nosocomial diarrhea, which could help clinicians manage nosocomial diarrhea. PeerJ Inc. 2019-10-30 /pmc/articles/PMC6825409/ /pubmed/31687281 http://dx.doi.org/10.7717/peerj.7969 Text en ©2019 Kurisu et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited. |
spellingShingle | Gastroenterology and Hepatology Kurisu, Ken Yoshiuchi, Kazuhiro Ogino, Kei Oda, Toshimi Machine learning analysis to identify the association between risk factors and onset of nosocomial diarrhea: a retrospective cohort study |
title | Machine learning analysis to identify the association between risk factors and onset of nosocomial diarrhea: a retrospective cohort study |
title_full | Machine learning analysis to identify the association between risk factors and onset of nosocomial diarrhea: a retrospective cohort study |
title_fullStr | Machine learning analysis to identify the association between risk factors and onset of nosocomial diarrhea: a retrospective cohort study |
title_full_unstemmed | Machine learning analysis to identify the association between risk factors and onset of nosocomial diarrhea: a retrospective cohort study |
title_short | Machine learning analysis to identify the association between risk factors and onset of nosocomial diarrhea: a retrospective cohort study |
title_sort | machine learning analysis to identify the association between risk factors and onset of nosocomial diarrhea: a retrospective cohort study |
topic | Gastroenterology and Hepatology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6825409/ https://www.ncbi.nlm.nih.gov/pubmed/31687281 http://dx.doi.org/10.7717/peerj.7969 |
work_keys_str_mv | AT kurisuken machinelearninganalysistoidentifytheassociationbetweenriskfactorsandonsetofnosocomialdiarrheaaretrospectivecohortstudy AT yoshiuchikazuhiro machinelearninganalysistoidentifytheassociationbetweenriskfactorsandonsetofnosocomialdiarrheaaretrospectivecohortstudy AT oginokei machinelearninganalysistoidentifytheassociationbetweenriskfactorsandonsetofnosocomialdiarrheaaretrospectivecohortstudy AT odatoshimi machinelearninganalysistoidentifytheassociationbetweenriskfactorsandonsetofnosocomialdiarrheaaretrospectivecohortstudy |