Cargando…
Real Time Influenza Monitoring Using Hospital Big Data in Combination with Machine Learning Methods: Comparison Study
BACKGROUND: Traditional surveillance systems produce estimates of influenza-like illness (ILI) incidence rates, but with 1- to 3-week delay. Accurate real-time monitoring systems for influenza outbreaks could be useful for making public health decisions. Several studies have investigated the possibi...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6320394/ https://www.ncbi.nlm.nih.gov/pubmed/30578212 http://dx.doi.org/10.2196/11361 |
_version_ | 1783385216289603584 |
---|---|
author | Poirier, Canelle Lavenu, Audrey Bertaud, Valérie Campillo-Gimenez, Boris Chazard, Emmanuel Cuggia, Marc Bouzillé, Guillaume |
author_facet | Poirier, Canelle Lavenu, Audrey Bertaud, Valérie Campillo-Gimenez, Boris Chazard, Emmanuel Cuggia, Marc Bouzillé, Guillaume |
author_sort | Poirier, Canelle |
collection | PubMed |
description | BACKGROUND: Traditional surveillance systems produce estimates of influenza-like illness (ILI) incidence rates, but with 1- to 3-week delay. Accurate real-time monitoring systems for influenza outbreaks could be useful for making public health decisions. Several studies have investigated the possibility of using internet users’ activity data and different statistical models to predict influenza epidemics in near real time. However, very few studies have investigated hospital big data. OBJECTIVE: Here, we compared internet and electronic health records (EHRs) data and different statistical models to identify the best approach (data type and statistical model) for ILI estimates in real time. METHODS: We used Google data for internet data and the clinical data warehouse eHOP, which included all EHRs from Rennes University Hospital (France), for hospital data. We compared 3 statistical models—random forest, elastic net, and support vector machine (SVM). RESULTS: For national ILI incidence rate, the best correlation was 0.98 and the mean squared error (MSE) was 866 obtained with hospital data and the SVM model. For the Brittany region, the best correlation was 0.923 and MSE was 2364 obtained with hospital data and the SVM model. CONCLUSIONS: We found that EHR data together with historical epidemiological information (French Sentinelles network) allowed for accurately predicting ILI incidence rates for the entire France as well as for the Brittany region and outperformed the internet data whatever was the statistical model used. Moreover, the performance of the two statistical models, elastic net and SVM, was comparable. |
format | Online Article Text |
id | pubmed-6320394 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | JMIR Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-63203942019-01-28 Real Time Influenza Monitoring Using Hospital Big Data in Combination with Machine Learning Methods: Comparison Study Poirier, Canelle Lavenu, Audrey Bertaud, Valérie Campillo-Gimenez, Boris Chazard, Emmanuel Cuggia, Marc Bouzillé, Guillaume JMIR Public Health Surveill Original Paper BACKGROUND: Traditional surveillance systems produce estimates of influenza-like illness (ILI) incidence rates, but with 1- to 3-week delay. Accurate real-time monitoring systems for influenza outbreaks could be useful for making public health decisions. Several studies have investigated the possibility of using internet users’ activity data and different statistical models to predict influenza epidemics in near real time. However, very few studies have investigated hospital big data. OBJECTIVE: Here, we compared internet and electronic health records (EHRs) data and different statistical models to identify the best approach (data type and statistical model) for ILI estimates in real time. METHODS: We used Google data for internet data and the clinical data warehouse eHOP, which included all EHRs from Rennes University Hospital (France), for hospital data. We compared 3 statistical models—random forest, elastic net, and support vector machine (SVM). RESULTS: For national ILI incidence rate, the best correlation was 0.98 and the mean squared error (MSE) was 866 obtained with hospital data and the SVM model. For the Brittany region, the best correlation was 0.923 and MSE was 2364 obtained with hospital data and the SVM model. CONCLUSIONS: We found that EHR data together with historical epidemiological information (French Sentinelles network) allowed for accurately predicting ILI incidence rates for the entire France as well as for the Brittany region and outperformed the internet data whatever was the statistical model used. Moreover, the performance of the two statistical models, elastic net and SVM, was comparable. JMIR Publications 2018-12-21 /pmc/articles/PMC6320394/ /pubmed/30578212 http://dx.doi.org/10.2196/11361 Text en ©Canelle Poirier, Audrey Lavenu, Valérie Bertaud, Boris Campillo-Gimenez, Emmanuel Chazard, Marc Cuggia, Guillaume Bouzillé. Originally published in JMIR Public Health and Surveillance (http://publichealth.jmir.org), 21.12.2018. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Public Health and Surveillance, is properly cited. The complete bibliographic information, a link to the original publication on http://publichealth.jmir.org, as well as this copyright and license information must be included. |
spellingShingle | Original Paper Poirier, Canelle Lavenu, Audrey Bertaud, Valérie Campillo-Gimenez, Boris Chazard, Emmanuel Cuggia, Marc Bouzillé, Guillaume Real Time Influenza Monitoring Using Hospital Big Data in Combination with Machine Learning Methods: Comparison Study |
title | Real Time Influenza Monitoring Using Hospital Big Data in Combination with Machine Learning Methods: Comparison Study |
title_full | Real Time Influenza Monitoring Using Hospital Big Data in Combination with Machine Learning Methods: Comparison Study |
title_fullStr | Real Time Influenza Monitoring Using Hospital Big Data in Combination with Machine Learning Methods: Comparison Study |
title_full_unstemmed | Real Time Influenza Monitoring Using Hospital Big Data in Combination with Machine Learning Methods: Comparison Study |
title_short | Real Time Influenza Monitoring Using Hospital Big Data in Combination with Machine Learning Methods: Comparison Study |
title_sort | real time influenza monitoring using hospital big data in combination with machine learning methods: comparison study |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6320394/ https://www.ncbi.nlm.nih.gov/pubmed/30578212 http://dx.doi.org/10.2196/11361 |
work_keys_str_mv | AT poiriercanelle realtimeinfluenzamonitoringusinghospitalbigdataincombinationwithmachinelearningmethodscomparisonstudy AT lavenuaudrey realtimeinfluenzamonitoringusinghospitalbigdataincombinationwithmachinelearningmethodscomparisonstudy AT bertaudvalerie realtimeinfluenzamonitoringusinghospitalbigdataincombinationwithmachinelearningmethodscomparisonstudy AT campillogimenezboris realtimeinfluenzamonitoringusinghospitalbigdataincombinationwithmachinelearningmethodscomparisonstudy AT chazardemmanuel realtimeinfluenzamonitoringusinghospitalbigdataincombinationwithmachinelearningmethodscomparisonstudy AT cuggiamarc realtimeinfluenzamonitoringusinghospitalbigdataincombinationwithmachinelearningmethodscomparisonstudy AT bouzilleguillaume realtimeinfluenzamonitoringusinghospitalbigdataincombinationwithmachinelearningmethodscomparisonstudy |