Cargando…

Detection of Patients with Influenza Syndrome Using Machine-Learning Models Learned from Emergency Department Reports

OBJECTIVE: Compare 7 machine learning algorithms with an expert constructed Bayesian network on detection of patients with influenza syndrome. INTRODUCTION: Early detection of influenza outbreaks is critical to public health officials. Case detection is the foundation for outbreak detection. Previou...

Descripción completa

Detalles Bibliográficos
Autores principales:	Pineda, Arturo López, Tsui, Fu-Chiang, Visweswaran, Shyam, Cooper, Gregory F.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	University of Illinois at Chicago Library 2013
Materias:	ISDS 2012 Conference Abstracts
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3692886/

_version_	1782274679176691712
author	Pineda, Arturo López Tsui, Fu-Chiang Visweswaran, Shyam Cooper, Gregory F.
author_facet	Pineda, Arturo López Tsui, Fu-Chiang Visweswaran, Shyam Cooper, Gregory F.
author_sort	Pineda, Arturo López
collection	PubMed
description	OBJECTIVE: Compare 7 machine learning algorithms with an expert constructed Bayesian network on detection of patients with influenza syndrome. INTRODUCTION: Early detection of influenza outbreaks is critical to public health officials. Case detection is the foundation for outbreak detection. Previous study by Elkin el al. demonstrated that using individual emergency department (ED) reports can better detect influenza cases than using chief complaints [1]. Our recent study using ED reports processed by Bayesian networks (using expert constructed network structure) showed high detection accuracy on detection of influenza cases [2]. METHODS: The dataset used in this study includes 182 ED reports with confirmed PCR influenza tests (Jan 1, 2007–Dec 31, 2009) and 40853 ED reports as control cases from 8 EDs in UPMC (Jul 1, 2010–Aug 31, 2010). All ED reports were deidentified by De-ID software with IRB approval. An NLP system, Topaz, was used to extract relevant findings and symptoms from the reports and encoded them with the UMLS concept unique identifier codes [2]. Two subsets were created: DS1-train (67% of cases) and DS1-test (remaining 33%). The algorithms used for training the models are: Naïve Bayes Classifier, Efficient Bayesian Multivariate Classification (EBMC) [3], Bayesian Network with K2 algorithm, Logistic Regression (LR), Support Vector Machine (SVM), Artificial Neural Networks (ANN) and Random Forest (RF). The predictive performance of each method was evaluated using the area under the receiver operator characteristic (AUROC) and the Hosmer-Lemeshow (HL) statistical significance testing, that describes the lack-of-fit of the model to the dataset. RESULTS: The evaluation results of all the models using DS1-test, including the AUROC, its confidence interval, p-value (between each algorithm and the expert) and the calibration with HL are shown in Table 1. CONCLUSIONS: All models achieved high AUROC values. The pairwise comparison of p-values in Table 1 demonstrates that the AUROCs of all the machine-learning models and the expert model were not significantly different. Nevertheless, EBMC is the best fitted. The model created by EBMC is shown in Figure 1. One limitation of the study is that the test dataset has low influenza prevalence, which may bias the detection algorithm performance. We are in the process of testing the algorithms using higher prevalence rate. The same process could also be applied to other diseases to further research the generalizability of our method. [Table: see text] [Figure: see text]
format	Online Article Text
id	pubmed-3692886
institution	National Center for Biotechnology Information
language	English
publishDate	2013
publisher	University of Illinois at Chicago Library
record_format	MEDLINE/PubMed
spelling	pubmed-36928862013-06-26 Detection of Patients with Influenza Syndrome Using Machine-Learning Models Learned from Emergency Department Reports Pineda, Arturo López Tsui, Fu-Chiang Visweswaran, Shyam Cooper, Gregory F. Online J Public Health Inform ISDS 2012 Conference Abstracts OBJECTIVE: Compare 7 machine learning algorithms with an expert constructed Bayesian network on detection of patients with influenza syndrome. INTRODUCTION: Early detection of influenza outbreaks is critical to public health officials. Case detection is the foundation for outbreak detection. Previous study by Elkin el al. demonstrated that using individual emergency department (ED) reports can better detect influenza cases than using chief complaints [1]. Our recent study using ED reports processed by Bayesian networks (using expert constructed network structure) showed high detection accuracy on detection of influenza cases [2]. METHODS: The dataset used in this study includes 182 ED reports with confirmed PCR influenza tests (Jan 1, 2007–Dec 31, 2009) and 40853 ED reports as control cases from 8 EDs in UPMC (Jul 1, 2010–Aug 31, 2010). All ED reports were deidentified by De-ID software with IRB approval. An NLP system, Topaz, was used to extract relevant findings and symptoms from the reports and encoded them with the UMLS concept unique identifier codes [2]. Two subsets were created: DS1-train (67% of cases) and DS1-test (remaining 33%). The algorithms used for training the models are: Naïve Bayes Classifier, Efficient Bayesian Multivariate Classification (EBMC) [3], Bayesian Network with K2 algorithm, Logistic Regression (LR), Support Vector Machine (SVM), Artificial Neural Networks (ANN) and Random Forest (RF). The predictive performance of each method was evaluated using the area under the receiver operator characteristic (AUROC) and the Hosmer-Lemeshow (HL) statistical significance testing, that describes the lack-of-fit of the model to the dataset. RESULTS: The evaluation results of all the models using DS1-test, including the AUROC, its confidence interval, p-value (between each algorithm and the expert) and the calibration with HL are shown in Table 1. CONCLUSIONS: All models achieved high AUROC values. The pairwise comparison of p-values in Table 1 demonstrates that the AUROCs of all the machine-learning models and the expert model were not significantly different. Nevertheless, EBMC is the best fitted. The model created by EBMC is shown in Figure 1. One limitation of the study is that the test dataset has low influenza prevalence, which may bias the detection algorithm performance. We are in the process of testing the algorithms using higher prevalence rate. The same process could also be applied to other diseases to further research the generalizability of our method. [Table: see text] [Figure: see text] University of Illinois at Chicago Library 2013-04-04 /pmc/articles/PMC3692886/ Text en ©2013 the author(s) http://www.uic.edu/htbin/cgiwrap/bin/ojs/index.php/ojphi/about/submissions#copyrightNotice This is an Open Access article. Authors own copyright of their articles appearing in the Online Journal of Public Health Informatics. Readers may copy articles without permission of the copyright owner(s), as long as the author and OJPHI are acknowledged in the copy and the copy is used for educational, not-for-profit purposes.
spellingShingle	ISDS 2012 Conference Abstracts Pineda, Arturo López Tsui, Fu-Chiang Visweswaran, Shyam Cooper, Gregory F. Detection of Patients with Influenza Syndrome Using Machine-Learning Models Learned from Emergency Department Reports
title	Detection of Patients with Influenza Syndrome Using Machine-Learning Models Learned from Emergency Department Reports
title_full	Detection of Patients with Influenza Syndrome Using Machine-Learning Models Learned from Emergency Department Reports
title_fullStr	Detection of Patients with Influenza Syndrome Using Machine-Learning Models Learned from Emergency Department Reports
title_full_unstemmed	Detection of Patients with Influenza Syndrome Using Machine-Learning Models Learned from Emergency Department Reports
title_short	Detection of Patients with Influenza Syndrome Using Machine-Learning Models Learned from Emergency Department Reports
title_sort	detection of patients with influenza syndrome using machine-learning models learned from emergency department reports
topic	ISDS 2012 Conference Abstracts
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3692886/
work_keys_str_mv	AT pinedaarturolopez detectionofpatientswithinfluenzasyndromeusingmachinelearningmodelslearnedfromemergencydepartmentreports AT tsuifuchiang detectionofpatientswithinfluenzasyndromeusingmachinelearningmodelslearnedfromemergencydepartmentreports AT visweswaranshyam detectionofpatientswithinfluenzasyndromeusingmachinelearningmodelslearnedfromemergencydepartmentreports AT coopergregoryf detectionofpatientswithinfluenzasyndromeusingmachinelearningmodelslearnedfromemergencydepartmentreports

Detection of Patients with Influenza Syndrome Using Machine-Learning Models Learned from Emergency Department Reports

Ejemplares similares