Cargando…

2107. Decision Trees vs. Neural Networks for Supervised Machine Learning-Based Prediction of Healthcare-Associated Urinary Tract Infections

BACKGROUND: Supervised machine learning (SML)-based methods could facilitate early prediction of healthcare-related adverse events. The role of SML in stratifying patient-risk of infectious events during hospitalization and their performance using limited subsets of standardized and widely available...

Descripción completa

Detalles Bibliográficos
Autores principales: Zachariah, Philip, Buenaventura, Elioth Mirsha Sanabria, Liu, Jianfang, Cohen, Bevin, Yao, David, Larson, Elaine
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6253432/
http://dx.doi.org/10.1093/ofid/ofy210.1763
_version_ 1783373493003354112
author Zachariah, Philip
Buenaventura, Elioth Mirsha Sanabria
Liu, Jianfang
Cohen, Bevin
Yao, David
Larson, Elaine
author_facet Zachariah, Philip
Buenaventura, Elioth Mirsha Sanabria
Liu, Jianfang
Cohen, Bevin
Yao, David
Larson, Elaine
author_sort Zachariah, Philip
collection PubMed
description BACKGROUND: Supervised machine learning (SML)-based methods could facilitate early prediction of healthcare-related adverse events. The role of SML in stratifying patient-risk of infectious events during hospitalization and their performance using limited subsets of standardized and widely available predictors is less known. Using a large cohort of adult inpatients, we use SML techniques to predict a diagnosis of urinary tract infection (UTI) during hospitalization. METHODS: We used previously validated data from adults (≥18 years old) hospitalized between 2009 and 2016 in a healthcare system as part of a federally funded study. The outcome was a UTI detected >2 days after admission. Predictors measured clinical complexity, history of healthcare-associated complications and specific risk factors for UTI. Predictors were restricted to those standardized and readily obtainable across facilities (e.g., ICD codes). Two SML methods, neural networks (NN) and decision trees (DT) were used. The NN used two hidden layers and a sigmoid output function. The DT used binary recursive portioning and Gini coefficient to measure node impurity. 60% of available hospitalizations were the training set, and 40% used as test set for validation. Cross validation was used to refine the model. Oversampling was used to adjust for the rare outcome. The area under the curve (AUC) for the test set measured model performance. RESULTS: From a total of 897,344 hospitalizations there were 16,069 UTIs identified from the data set during the study period. Applying NN and DT to the raw dataset, AUC’s of 0.55 and 0.69 were achieved respectively with the test set. Model performance for DNN and DT improved with oversampling to 0.77 and 0.78, outperforming traditional logistic regression (Figure 1). The optimal DT is presented (Figure 2). [Image: see text] [Image: see text] CONCLUSION: Reasonable prediction performance for an infectious event during hospitalization was achieved using a limited set of routinely available and standardized variables. While both SML methods had comparable performance, the DT was more interpretable. Further work will extend these methods to other infectious events, use more specific EHR data and link these predictions to interventions in real time. DISCLOSURES: All authors: No reported disclosures.
format Online
Article
Text
id pubmed-6253432
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-62534322018-11-28 2107. Decision Trees vs. Neural Networks for Supervised Machine Learning-Based Prediction of Healthcare-Associated Urinary Tract Infections Zachariah, Philip Buenaventura, Elioth Mirsha Sanabria Liu, Jianfang Cohen, Bevin Yao, David Larson, Elaine Open Forum Infect Dis Abstracts BACKGROUND: Supervised machine learning (SML)-based methods could facilitate early prediction of healthcare-related adverse events. The role of SML in stratifying patient-risk of infectious events during hospitalization and their performance using limited subsets of standardized and widely available predictors is less known. Using a large cohort of adult inpatients, we use SML techniques to predict a diagnosis of urinary tract infection (UTI) during hospitalization. METHODS: We used previously validated data from adults (≥18 years old) hospitalized between 2009 and 2016 in a healthcare system as part of a federally funded study. The outcome was a UTI detected >2 days after admission. Predictors measured clinical complexity, history of healthcare-associated complications and specific risk factors for UTI. Predictors were restricted to those standardized and readily obtainable across facilities (e.g., ICD codes). Two SML methods, neural networks (NN) and decision trees (DT) were used. The NN used two hidden layers and a sigmoid output function. The DT used binary recursive portioning and Gini coefficient to measure node impurity. 60% of available hospitalizations were the training set, and 40% used as test set for validation. Cross validation was used to refine the model. Oversampling was used to adjust for the rare outcome. The area under the curve (AUC) for the test set measured model performance. RESULTS: From a total of 897,344 hospitalizations there were 16,069 UTIs identified from the data set during the study period. Applying NN and DT to the raw dataset, AUC’s of 0.55 and 0.69 were achieved respectively with the test set. Model performance for DNN and DT improved with oversampling to 0.77 and 0.78, outperforming traditional logistic regression (Figure 1). The optimal DT is presented (Figure 2). [Image: see text] [Image: see text] CONCLUSION: Reasonable prediction performance for an infectious event during hospitalization was achieved using a limited set of routinely available and standardized variables. While both SML methods had comparable performance, the DT was more interpretable. Further work will extend these methods to other infectious events, use more specific EHR data and link these predictions to interventions in real time. DISCLOSURES: All authors: No reported disclosures. Oxford University Press 2018-11-26 /pmc/articles/PMC6253432/ http://dx.doi.org/10.1093/ofid/ofy210.1763 Text en © The Author(s) 2018. Published by Oxford University Press on behalf of Infectious Diseases Society of America. http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs licence (http://creativecommons.org/licenses/by-nc-nd/4.0/), which permits non-commercial reproduction and distribution of the work, in any medium, provided the original work is not altered or transformed in any way, and that the work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Abstracts
Zachariah, Philip
Buenaventura, Elioth Mirsha Sanabria
Liu, Jianfang
Cohen, Bevin
Yao, David
Larson, Elaine
2107. Decision Trees vs. Neural Networks for Supervised Machine Learning-Based Prediction of Healthcare-Associated Urinary Tract Infections
title 2107. Decision Trees vs. Neural Networks for Supervised Machine Learning-Based Prediction of Healthcare-Associated Urinary Tract Infections
title_full 2107. Decision Trees vs. Neural Networks for Supervised Machine Learning-Based Prediction of Healthcare-Associated Urinary Tract Infections
title_fullStr 2107. Decision Trees vs. Neural Networks for Supervised Machine Learning-Based Prediction of Healthcare-Associated Urinary Tract Infections
title_full_unstemmed 2107. Decision Trees vs. Neural Networks for Supervised Machine Learning-Based Prediction of Healthcare-Associated Urinary Tract Infections
title_short 2107. Decision Trees vs. Neural Networks for Supervised Machine Learning-Based Prediction of Healthcare-Associated Urinary Tract Infections
title_sort 2107. decision trees vs. neural networks for supervised machine learning-based prediction of healthcare-associated urinary tract infections
topic Abstracts
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6253432/
http://dx.doi.org/10.1093/ofid/ofy210.1763
work_keys_str_mv AT zachariahphilip 2107decisiontreesvsneuralnetworksforsupervisedmachinelearningbasedpredictionofhealthcareassociatedurinarytractinfections
AT buenaventuraeliothmirshasanabria 2107decisiontreesvsneuralnetworksforsupervisedmachinelearningbasedpredictionofhealthcareassociatedurinarytractinfections
AT liujianfang 2107decisiontreesvsneuralnetworksforsupervisedmachinelearningbasedpredictionofhealthcareassociatedurinarytractinfections
AT cohenbevin 2107decisiontreesvsneuralnetworksforsupervisedmachinelearningbasedpredictionofhealthcareassociatedurinarytractinfections
AT yaodavid 2107decisiontreesvsneuralnetworksforsupervisedmachinelearningbasedpredictionofhealthcareassociatedurinarytractinfections
AT larsonelaine 2107decisiontreesvsneuralnetworksforsupervisedmachinelearningbasedpredictionofhealthcareassociatedurinarytractinfections