Cargando…

Developing a dengue forecast model using machine learning: A case study in China

BACKGROUND: In China, dengue remains an important public health issue with expanded areas and increased incidence recently. Accurate and timely forecasts of dengue incidence in China are still lacking. We aimed to use the state-of-the-art machine learning algorithms to develop an accurate predictive...

Descripción completa

Detalles Bibliográficos
Autores principales: Guo, Pi, Liu, Tao, Zhang, Qin, Wang, Li, Xiao, Jianpeng, Zhang, Qingying, Luo, Ganfeng, Li, Zhihao, He, Jianfeng, Zhang, Yonghui, Ma, Wenjun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5658193/
https://www.ncbi.nlm.nih.gov/pubmed/29036169
http://dx.doi.org/10.1371/journal.pntd.0005973
_version_ 1783273952220545024
author Guo, Pi
Liu, Tao
Zhang, Qin
Wang, Li
Xiao, Jianpeng
Zhang, Qingying
Luo, Ganfeng
Li, Zhihao
He, Jianfeng
Zhang, Yonghui
Ma, Wenjun
author_facet Guo, Pi
Liu, Tao
Zhang, Qin
Wang, Li
Xiao, Jianpeng
Zhang, Qingying
Luo, Ganfeng
Li, Zhihao
He, Jianfeng
Zhang, Yonghui
Ma, Wenjun
author_sort Guo, Pi
collection PubMed
description BACKGROUND: In China, dengue remains an important public health issue with expanded areas and increased incidence recently. Accurate and timely forecasts of dengue incidence in China are still lacking. We aimed to use the state-of-the-art machine learning algorithms to develop an accurate predictive model of dengue. METHODOLOGY/PRINCIPAL FINDINGS: Weekly dengue cases, Baidu search queries and climate factors (mean temperature, relative humidity and rainfall) during 2011–2014 in Guangdong were gathered. A dengue search index was constructed for developing the predictive models in combination with climate factors. The observed year and week were also included in the models to control for the long-term trend and seasonality. Several machine learning algorithms, including the support vector regression (SVR) algorithm, step-down linear regression model, gradient boosted regression tree algorithm (GBM), negative binomial regression model (NBM), least absolute shrinkage and selection operator (LASSO) linear regression model and generalized additive model (GAM), were used as candidate models to predict dengue incidence. Performance and goodness of fit of the models were assessed using the root-mean-square error (RMSE) and R-squared measures. The residuals of the models were examined using the autocorrelation and partial autocorrelation function analyses to check the validity of the models. The models were further validated using dengue surveillance data from five other provinces. The epidemics during the last 12 weeks and the peak of the 2014 large outbreak were accurately forecasted by the SVR model selected by a cross-validation technique. Moreover, the SVR model had the consistently smallest prediction error rates for tracking the dynamics of dengue and forecasting the outbreaks in other areas in China. CONCLUSION AND SIGNIFICANCE: The proposed SVR model achieved a superior performance in comparison with other forecasting techniques assessed in this study. The findings can help the government and community respond early to dengue epidemics.
format Online
Article
Text
id pubmed-5658193
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-56581932017-11-09 Developing a dengue forecast model using machine learning: A case study in China Guo, Pi Liu, Tao Zhang, Qin Wang, Li Xiao, Jianpeng Zhang, Qingying Luo, Ganfeng Li, Zhihao He, Jianfeng Zhang, Yonghui Ma, Wenjun PLoS Negl Trop Dis Research Article BACKGROUND: In China, dengue remains an important public health issue with expanded areas and increased incidence recently. Accurate and timely forecasts of dengue incidence in China are still lacking. We aimed to use the state-of-the-art machine learning algorithms to develop an accurate predictive model of dengue. METHODOLOGY/PRINCIPAL FINDINGS: Weekly dengue cases, Baidu search queries and climate factors (mean temperature, relative humidity and rainfall) during 2011–2014 in Guangdong were gathered. A dengue search index was constructed for developing the predictive models in combination with climate factors. The observed year and week were also included in the models to control for the long-term trend and seasonality. Several machine learning algorithms, including the support vector regression (SVR) algorithm, step-down linear regression model, gradient boosted regression tree algorithm (GBM), negative binomial regression model (NBM), least absolute shrinkage and selection operator (LASSO) linear regression model and generalized additive model (GAM), were used as candidate models to predict dengue incidence. Performance and goodness of fit of the models were assessed using the root-mean-square error (RMSE) and R-squared measures. The residuals of the models were examined using the autocorrelation and partial autocorrelation function analyses to check the validity of the models. The models were further validated using dengue surveillance data from five other provinces. The epidemics during the last 12 weeks and the peak of the 2014 large outbreak were accurately forecasted by the SVR model selected by a cross-validation technique. Moreover, the SVR model had the consistently smallest prediction error rates for tracking the dynamics of dengue and forecasting the outbreaks in other areas in China. CONCLUSION AND SIGNIFICANCE: The proposed SVR model achieved a superior performance in comparison with other forecasting techniques assessed in this study. The findings can help the government and community respond early to dengue epidemics. Public Library of Science 2017-10-16 /pmc/articles/PMC5658193/ /pubmed/29036169 http://dx.doi.org/10.1371/journal.pntd.0005973 Text en © 2017 Guo et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Guo, Pi
Liu, Tao
Zhang, Qin
Wang, Li
Xiao, Jianpeng
Zhang, Qingying
Luo, Ganfeng
Li, Zhihao
He, Jianfeng
Zhang, Yonghui
Ma, Wenjun
Developing a dengue forecast model using machine learning: A case study in China
title Developing a dengue forecast model using machine learning: A case study in China
title_full Developing a dengue forecast model using machine learning: A case study in China
title_fullStr Developing a dengue forecast model using machine learning: A case study in China
title_full_unstemmed Developing a dengue forecast model using machine learning: A case study in China
title_short Developing a dengue forecast model using machine learning: A case study in China
title_sort developing a dengue forecast model using machine learning: a case study in china
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5658193/
https://www.ncbi.nlm.nih.gov/pubmed/29036169
http://dx.doi.org/10.1371/journal.pntd.0005973
work_keys_str_mv AT guopi developingadengueforecastmodelusingmachinelearningacasestudyinchina
AT liutao developingadengueforecastmodelusingmachinelearningacasestudyinchina
AT zhangqin developingadengueforecastmodelusingmachinelearningacasestudyinchina
AT wangli developingadengueforecastmodelusingmachinelearningacasestudyinchina
AT xiaojianpeng developingadengueforecastmodelusingmachinelearningacasestudyinchina
AT zhangqingying developingadengueforecastmodelusingmachinelearningacasestudyinchina
AT luoganfeng developingadengueforecastmodelusingmachinelearningacasestudyinchina
AT lizhihao developingadengueforecastmodelusingmachinelearningacasestudyinchina
AT hejianfeng developingadengueforecastmodelusingmachinelearningacasestudyinchina
AT zhangyonghui developingadengueforecastmodelusingmachinelearningacasestudyinchina
AT mawenjun developingadengueforecastmodelusingmachinelearningacasestudyinchina