Cargando…
Forecasting influenza in Hong Kong with Google search queries and statistical model fusion
BACKGROUND: The objective of this study is to investigate predictive utility of online social media and web search queries, particularly, Google search data, to forecast new cases of influenza-like-illness (ILI) in general outpatient clinics (GOPC) in Hong Kong. To mitigate the impact of sensitivity...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5413039/ https://www.ncbi.nlm.nih.gov/pubmed/28464015 http://dx.doi.org/10.1371/journal.pone.0176690 |
_version_ | 1783233124520427520 |
---|---|
author | Xu, Qinneng Gel, Yulia R. Ramirez Ramirez, L. Leticia Nezafati, Kusha Zhang, Qingpeng Tsui, Kwok-Leung |
author_facet | Xu, Qinneng Gel, Yulia R. Ramirez Ramirez, L. Leticia Nezafati, Kusha Zhang, Qingpeng Tsui, Kwok-Leung |
author_sort | Xu, Qinneng |
collection | PubMed |
description | BACKGROUND: The objective of this study is to investigate predictive utility of online social media and web search queries, particularly, Google search data, to forecast new cases of influenza-like-illness (ILI) in general outpatient clinics (GOPC) in Hong Kong. To mitigate the impact of sensitivity to self-excitement (i.e., fickle media interest) and other artifacts of online social media data, in our approach we fuse multiple offline and online data sources. METHODS: Four individual models: generalized linear model (GLM), least absolute shrinkage and selection operator (LASSO), autoregressive integrated moving average (ARIMA), and deep learning (DL) with Feedforward Neural Networks (FNN) are employed to forecast ILI-GOPC both one week and two weeks in advance. The covariates include Google search queries, meteorological data, and previously recorded offline ILI. To our knowledge, this is the first study that introduces deep learning methodology into surveillance of infectious diseases and investigates its predictive utility. Furthermore, to exploit the strength from each individual forecasting models, we use statistical model fusion, using Bayesian model averaging (BMA), which allows a systematic integration of multiple forecast scenarios. For each model, an adaptive approach is used to capture the recent relationship between ILI and covariates. RESULTS: DL with FNN appears to deliver the most competitive predictive performance among the four considered individual models. Combing all four models in a comprehensive BMA framework allows to further improve such predictive evaluation metrics as root mean squared error (RMSE) and mean absolute predictive error (MAPE). Nevertheless, DL with FNN remains the preferred method for predicting locations of influenza peaks. CONCLUSIONS: The proposed approach can be viewed a feasible alternative to forecast ILI in Hong Kong or other countries where ILI has no constant seasonal trend and influenza data resources are limited. The proposed methodology is easily tractable and computationally efficient. |
format | Online Article Text |
id | pubmed-5413039 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-54130392017-05-14 Forecasting influenza in Hong Kong with Google search queries and statistical model fusion Xu, Qinneng Gel, Yulia R. Ramirez Ramirez, L. Leticia Nezafati, Kusha Zhang, Qingpeng Tsui, Kwok-Leung PLoS One Research Article BACKGROUND: The objective of this study is to investigate predictive utility of online social media and web search queries, particularly, Google search data, to forecast new cases of influenza-like-illness (ILI) in general outpatient clinics (GOPC) in Hong Kong. To mitigate the impact of sensitivity to self-excitement (i.e., fickle media interest) and other artifacts of online social media data, in our approach we fuse multiple offline and online data sources. METHODS: Four individual models: generalized linear model (GLM), least absolute shrinkage and selection operator (LASSO), autoregressive integrated moving average (ARIMA), and deep learning (DL) with Feedforward Neural Networks (FNN) are employed to forecast ILI-GOPC both one week and two weeks in advance. The covariates include Google search queries, meteorological data, and previously recorded offline ILI. To our knowledge, this is the first study that introduces deep learning methodology into surveillance of infectious diseases and investigates its predictive utility. Furthermore, to exploit the strength from each individual forecasting models, we use statistical model fusion, using Bayesian model averaging (BMA), which allows a systematic integration of multiple forecast scenarios. For each model, an adaptive approach is used to capture the recent relationship between ILI and covariates. RESULTS: DL with FNN appears to deliver the most competitive predictive performance among the four considered individual models. Combing all four models in a comprehensive BMA framework allows to further improve such predictive evaluation metrics as root mean squared error (RMSE) and mean absolute predictive error (MAPE). Nevertheless, DL with FNN remains the preferred method for predicting locations of influenza peaks. CONCLUSIONS: The proposed approach can be viewed a feasible alternative to forecast ILI in Hong Kong or other countries where ILI has no constant seasonal trend and influenza data resources are limited. The proposed methodology is easily tractable and computationally efficient. Public Library of Science 2017-05-02 /pmc/articles/PMC5413039/ /pubmed/28464015 http://dx.doi.org/10.1371/journal.pone.0176690 Text en © 2017 Xu et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Xu, Qinneng Gel, Yulia R. Ramirez Ramirez, L. Leticia Nezafati, Kusha Zhang, Qingpeng Tsui, Kwok-Leung Forecasting influenza in Hong Kong with Google search queries and statistical model fusion |
title | Forecasting influenza in Hong Kong with Google search queries and statistical model fusion |
title_full | Forecasting influenza in Hong Kong with Google search queries and statistical model fusion |
title_fullStr | Forecasting influenza in Hong Kong with Google search queries and statistical model fusion |
title_full_unstemmed | Forecasting influenza in Hong Kong with Google search queries and statistical model fusion |
title_short | Forecasting influenza in Hong Kong with Google search queries and statistical model fusion |
title_sort | forecasting influenza in hong kong with google search queries and statistical model fusion |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5413039/ https://www.ncbi.nlm.nih.gov/pubmed/28464015 http://dx.doi.org/10.1371/journal.pone.0176690 |
work_keys_str_mv | AT xuqinneng forecastinginfluenzainhongkongwithgooglesearchqueriesandstatisticalmodelfusion AT gelyuliar forecastinginfluenzainhongkongwithgooglesearchqueriesandstatisticalmodelfusion AT ramirezramirezlleticia forecastinginfluenzainhongkongwithgooglesearchqueriesandstatisticalmodelfusion AT nezafatikusha forecastinginfluenzainhongkongwithgooglesearchqueriesandstatisticalmodelfusion AT zhangqingpeng forecastinginfluenzainhongkongwithgooglesearchqueriesandstatisticalmodelfusion AT tsuikwokleung forecastinginfluenzainhongkongwithgooglesearchqueriesandstatisticalmodelfusion |