Cargando…

Forecasting influenza in Hong Kong with Google search queries and statistical model fusion

BACKGROUND: The objective of this study is to investigate predictive utility of online social media and web search queries, particularly, Google search data, to forecast new cases of influenza-like-illness (ILI) in general outpatient clinics (GOPC) in Hong Kong. To mitigate the impact of sensitivity...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Qinneng, Gel, Yulia R., Ramirez Ramirez, L. Leticia, Nezafati, Kusha, Zhang, Qingpeng, Tsui, Kwok-Leung
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5413039/
https://www.ncbi.nlm.nih.gov/pubmed/28464015
http://dx.doi.org/10.1371/journal.pone.0176690
_version_ 1783233124520427520
author Xu, Qinneng
Gel, Yulia R.
Ramirez Ramirez, L. Leticia
Nezafati, Kusha
Zhang, Qingpeng
Tsui, Kwok-Leung
author_facet Xu, Qinneng
Gel, Yulia R.
Ramirez Ramirez, L. Leticia
Nezafati, Kusha
Zhang, Qingpeng
Tsui, Kwok-Leung
author_sort Xu, Qinneng
collection PubMed
description BACKGROUND: The objective of this study is to investigate predictive utility of online social media and web search queries, particularly, Google search data, to forecast new cases of influenza-like-illness (ILI) in general outpatient clinics (GOPC) in Hong Kong. To mitigate the impact of sensitivity to self-excitement (i.e., fickle media interest) and other artifacts of online social media data, in our approach we fuse multiple offline and online data sources. METHODS: Four individual models: generalized linear model (GLM), least absolute shrinkage and selection operator (LASSO), autoregressive integrated moving average (ARIMA), and deep learning (DL) with Feedforward Neural Networks (FNN) are employed to forecast ILI-GOPC both one week and two weeks in advance. The covariates include Google search queries, meteorological data, and previously recorded offline ILI. To our knowledge, this is the first study that introduces deep learning methodology into surveillance of infectious diseases and investigates its predictive utility. Furthermore, to exploit the strength from each individual forecasting models, we use statistical model fusion, using Bayesian model averaging (BMA), which allows a systematic integration of multiple forecast scenarios. For each model, an adaptive approach is used to capture the recent relationship between ILI and covariates. RESULTS: DL with FNN appears to deliver the most competitive predictive performance among the four considered individual models. Combing all four models in a comprehensive BMA framework allows to further improve such predictive evaluation metrics as root mean squared error (RMSE) and mean absolute predictive error (MAPE). Nevertheless, DL with FNN remains the preferred method for predicting locations of influenza peaks. CONCLUSIONS: The proposed approach can be viewed a feasible alternative to forecast ILI in Hong Kong or other countries where ILI has no constant seasonal trend and influenza data resources are limited. The proposed methodology is easily tractable and computationally efficient.
format Online
Article
Text
id pubmed-5413039
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-54130392017-05-14 Forecasting influenza in Hong Kong with Google search queries and statistical model fusion Xu, Qinneng Gel, Yulia R. Ramirez Ramirez, L. Leticia Nezafati, Kusha Zhang, Qingpeng Tsui, Kwok-Leung PLoS One Research Article BACKGROUND: The objective of this study is to investigate predictive utility of online social media and web search queries, particularly, Google search data, to forecast new cases of influenza-like-illness (ILI) in general outpatient clinics (GOPC) in Hong Kong. To mitigate the impact of sensitivity to self-excitement (i.e., fickle media interest) and other artifacts of online social media data, in our approach we fuse multiple offline and online data sources. METHODS: Four individual models: generalized linear model (GLM), least absolute shrinkage and selection operator (LASSO), autoregressive integrated moving average (ARIMA), and deep learning (DL) with Feedforward Neural Networks (FNN) are employed to forecast ILI-GOPC both one week and two weeks in advance. The covariates include Google search queries, meteorological data, and previously recorded offline ILI. To our knowledge, this is the first study that introduces deep learning methodology into surveillance of infectious diseases and investigates its predictive utility. Furthermore, to exploit the strength from each individual forecasting models, we use statistical model fusion, using Bayesian model averaging (BMA), which allows a systematic integration of multiple forecast scenarios. For each model, an adaptive approach is used to capture the recent relationship between ILI and covariates. RESULTS: DL with FNN appears to deliver the most competitive predictive performance among the four considered individual models. Combing all four models in a comprehensive BMA framework allows to further improve such predictive evaluation metrics as root mean squared error (RMSE) and mean absolute predictive error (MAPE). Nevertheless, DL with FNN remains the preferred method for predicting locations of influenza peaks. CONCLUSIONS: The proposed approach can be viewed a feasible alternative to forecast ILI in Hong Kong or other countries where ILI has no constant seasonal trend and influenza data resources are limited. The proposed methodology is easily tractable and computationally efficient. Public Library of Science 2017-05-02 /pmc/articles/PMC5413039/ /pubmed/28464015 http://dx.doi.org/10.1371/journal.pone.0176690 Text en © 2017 Xu et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Xu, Qinneng
Gel, Yulia R.
Ramirez Ramirez, L. Leticia
Nezafati, Kusha
Zhang, Qingpeng
Tsui, Kwok-Leung
Forecasting influenza in Hong Kong with Google search queries and statistical model fusion
title Forecasting influenza in Hong Kong with Google search queries and statistical model fusion
title_full Forecasting influenza in Hong Kong with Google search queries and statistical model fusion
title_fullStr Forecasting influenza in Hong Kong with Google search queries and statistical model fusion
title_full_unstemmed Forecasting influenza in Hong Kong with Google search queries and statistical model fusion
title_short Forecasting influenza in Hong Kong with Google search queries and statistical model fusion
title_sort forecasting influenza in hong kong with google search queries and statistical model fusion
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5413039/
https://www.ncbi.nlm.nih.gov/pubmed/28464015
http://dx.doi.org/10.1371/journal.pone.0176690
work_keys_str_mv AT xuqinneng forecastinginfluenzainhongkongwithgooglesearchqueriesandstatisticalmodelfusion
AT gelyuliar forecastinginfluenzainhongkongwithgooglesearchqueriesandstatisticalmodelfusion
AT ramirezramirezlleticia forecastinginfluenzainhongkongwithgooglesearchqueriesandstatisticalmodelfusion
AT nezafatikusha forecastinginfluenzainhongkongwithgooglesearchqueriesandstatisticalmodelfusion
AT zhangqingpeng forecastinginfluenzainhongkongwithgooglesearchqueriesandstatisticalmodelfusion
AT tsuikwokleung forecastinginfluenzainhongkongwithgooglesearchqueriesandstatisticalmodelfusion