Cargando…

Real Time Influenza Monitoring Using Hospital Big Data in Combination with Machine Learning Methods: Comparison Study

BACKGROUND: Traditional surveillance systems produce estimates of influenza-like illness (ILI) incidence rates, but with 1- to 3-week delay. Accurate real-time monitoring systems for influenza outbreaks could be useful for making public health decisions. Several studies have investigated the possibi...

Descripción completa

Detalles Bibliográficos
Autores principales: Poirier, Canelle, Lavenu, Audrey, Bertaud, Valérie, Campillo-Gimenez, Boris, Chazard, Emmanuel, Cuggia, Marc, Bouzillé, Guillaume
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6320394/
https://www.ncbi.nlm.nih.gov/pubmed/30578212
http://dx.doi.org/10.2196/11361
_version_ 1783385216289603584
author Poirier, Canelle
Lavenu, Audrey
Bertaud, Valérie
Campillo-Gimenez, Boris
Chazard, Emmanuel
Cuggia, Marc
Bouzillé, Guillaume
author_facet Poirier, Canelle
Lavenu, Audrey
Bertaud, Valérie
Campillo-Gimenez, Boris
Chazard, Emmanuel
Cuggia, Marc
Bouzillé, Guillaume
author_sort Poirier, Canelle
collection PubMed
description BACKGROUND: Traditional surveillance systems produce estimates of influenza-like illness (ILI) incidence rates, but with 1- to 3-week delay. Accurate real-time monitoring systems for influenza outbreaks could be useful for making public health decisions. Several studies have investigated the possibility of using internet users’ activity data and different statistical models to predict influenza epidemics in near real time. However, very few studies have investigated hospital big data. OBJECTIVE: Here, we compared internet and electronic health records (EHRs) data and different statistical models to identify the best approach (data type and statistical model) for ILI estimates in real time. METHODS: We used Google data for internet data and the clinical data warehouse eHOP, which included all EHRs from Rennes University Hospital (France), for hospital data. We compared 3 statistical models—random forest, elastic net, and support vector machine (SVM). RESULTS: For national ILI incidence rate, the best correlation was 0.98 and the mean squared error (MSE) was 866 obtained with hospital data and the SVM model. For the Brittany region, the best correlation was 0.923 and MSE was 2364 obtained with hospital data and the SVM model. CONCLUSIONS: We found that EHR data together with historical epidemiological information (French Sentinelles network) allowed for accurately predicting ILI incidence rates for the entire France as well as for the Brittany region and outperformed the internet data whatever was the statistical model used. Moreover, the performance of the two statistical models, elastic net and SVM, was comparable.
format Online
Article
Text
id pubmed-6320394
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-63203942019-01-28 Real Time Influenza Monitoring Using Hospital Big Data in Combination with Machine Learning Methods: Comparison Study Poirier, Canelle Lavenu, Audrey Bertaud, Valérie Campillo-Gimenez, Boris Chazard, Emmanuel Cuggia, Marc Bouzillé, Guillaume JMIR Public Health Surveill Original Paper BACKGROUND: Traditional surveillance systems produce estimates of influenza-like illness (ILI) incidence rates, but with 1- to 3-week delay. Accurate real-time monitoring systems for influenza outbreaks could be useful for making public health decisions. Several studies have investigated the possibility of using internet users’ activity data and different statistical models to predict influenza epidemics in near real time. However, very few studies have investigated hospital big data. OBJECTIVE: Here, we compared internet and electronic health records (EHRs) data and different statistical models to identify the best approach (data type and statistical model) for ILI estimates in real time. METHODS: We used Google data for internet data and the clinical data warehouse eHOP, which included all EHRs from Rennes University Hospital (France), for hospital data. We compared 3 statistical models—random forest, elastic net, and support vector machine (SVM). RESULTS: For national ILI incidence rate, the best correlation was 0.98 and the mean squared error (MSE) was 866 obtained with hospital data and the SVM model. For the Brittany region, the best correlation was 0.923 and MSE was 2364 obtained with hospital data and the SVM model. CONCLUSIONS: We found that EHR data together with historical epidemiological information (French Sentinelles network) allowed for accurately predicting ILI incidence rates for the entire France as well as for the Brittany region and outperformed the internet data whatever was the statistical model used. Moreover, the performance of the two statistical models, elastic net and SVM, was comparable. JMIR Publications 2018-12-21 /pmc/articles/PMC6320394/ /pubmed/30578212 http://dx.doi.org/10.2196/11361 Text en ©Canelle Poirier, Audrey Lavenu, Valérie Bertaud, Boris Campillo-Gimenez, Emmanuel Chazard, Marc Cuggia, Guillaume Bouzillé. Originally published in JMIR Public Health and Surveillance (http://publichealth.jmir.org), 21.12.2018. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Public Health and Surveillance, is properly cited. The complete bibliographic information, a link to the original publication on http://publichealth.jmir.org, as well as this copyright and license information must be included.
spellingShingle Original Paper
Poirier, Canelle
Lavenu, Audrey
Bertaud, Valérie
Campillo-Gimenez, Boris
Chazard, Emmanuel
Cuggia, Marc
Bouzillé, Guillaume
Real Time Influenza Monitoring Using Hospital Big Data in Combination with Machine Learning Methods: Comparison Study
title Real Time Influenza Monitoring Using Hospital Big Data in Combination with Machine Learning Methods: Comparison Study
title_full Real Time Influenza Monitoring Using Hospital Big Data in Combination with Machine Learning Methods: Comparison Study
title_fullStr Real Time Influenza Monitoring Using Hospital Big Data in Combination with Machine Learning Methods: Comparison Study
title_full_unstemmed Real Time Influenza Monitoring Using Hospital Big Data in Combination with Machine Learning Methods: Comparison Study
title_short Real Time Influenza Monitoring Using Hospital Big Data in Combination with Machine Learning Methods: Comparison Study
title_sort real time influenza monitoring using hospital big data in combination with machine learning methods: comparison study
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6320394/
https://www.ncbi.nlm.nih.gov/pubmed/30578212
http://dx.doi.org/10.2196/11361
work_keys_str_mv AT poiriercanelle realtimeinfluenzamonitoringusinghospitalbigdataincombinationwithmachinelearningmethodscomparisonstudy
AT lavenuaudrey realtimeinfluenzamonitoringusinghospitalbigdataincombinationwithmachinelearningmethodscomparisonstudy
AT bertaudvalerie realtimeinfluenzamonitoringusinghospitalbigdataincombinationwithmachinelearningmethodscomparisonstudy
AT campillogimenezboris realtimeinfluenzamonitoringusinghospitalbigdataincombinationwithmachinelearningmethodscomparisonstudy
AT chazardemmanuel realtimeinfluenzamonitoringusinghospitalbigdataincombinationwithmachinelearningmethodscomparisonstudy
AT cuggiamarc realtimeinfluenzamonitoringusinghospitalbigdataincombinationwithmachinelearningmethodscomparisonstudy
AT bouzilleguillaume realtimeinfluenzamonitoringusinghospitalbigdataincombinationwithmachinelearningmethodscomparisonstudy