Cargando…

A novel model for malaria prediction based on ensemble algorithms

BACKGROUND AND OBJECTIVE: Most previous studies adopted single traditional time series models to predict incidences of malaria. A single model cannot effectively capture all the properties of the data structure. However, a stacking architecture can solve this problem by combining distinct algorithms...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Mengyang, Wang, Hui, Wang, Jiao, Liu, Hongwei, Lu, Rui, Duan, Tongqing, Gong, Xiaowen, Feng, Siyuan, Liu, Yuanyuan, Cui, Zhuang, Li, Changping, Ma, Jun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6932799/
https://www.ncbi.nlm.nih.gov/pubmed/31877185
http://dx.doi.org/10.1371/journal.pone.0226910
_version_ 1783483079341375488
author Wang, Mengyang
Wang, Hui
Wang, Jiao
Liu, Hongwei
Lu, Rui
Duan, Tongqing
Gong, Xiaowen
Feng, Siyuan
Liu, Yuanyuan
Cui, Zhuang
Li, Changping
Ma, Jun
author_facet Wang, Mengyang
Wang, Hui
Wang, Jiao
Liu, Hongwei
Lu, Rui
Duan, Tongqing
Gong, Xiaowen
Feng, Siyuan
Liu, Yuanyuan
Cui, Zhuang
Li, Changping
Ma, Jun
author_sort Wang, Mengyang
collection PubMed
description BACKGROUND AND OBJECTIVE: Most previous studies adopted single traditional time series models to predict incidences of malaria. A single model cannot effectively capture all the properties of the data structure. However, a stacking architecture can solve this problem by combining distinct algorithms and models. This study compares the performance of traditional time series models and deep learning algorithms in malaria case prediction and explores the application value of stacking methods in the field of infectious disease prediction. METHODS: The ARIMA, STL+ARIMA, BP-ANN and LSTM network models were separately applied in simulations using malaria data and meteorological data in Yunnan Province from 2011 to 2017. We compared the predictive performance of each model through evaluation measures: RMSE, MASE, MAD. In addition, gradient-boosting regression trees (GBRTs) were used to combine the above four models. We also determined whether stacking structure improved the model prediction performance. RESULTS: The root mean square errors (RMSEs) of the four sub-models were 13.176, 14.543, 9.571 and 7.208; the mean absolute scaled errors (MASEs) were 0.469, 0.472, 0.296 and 0.266 and the mean absolute deviation (MAD) were 6.403, 7.658, 5.871 and 5.691. After using the stacking architecture combined with the above four models, the RMSE, MASE and MAD values of the ensemble model decreased to 6.810, 0.224 and 4.625, respectively. CONCLUSIONS: A novel ensemble model based on the robustness of structured prediction and model combination through stacking was developed. The findings suggest that the predictive performance of the final model is superior to that of the other four sub-models, indicating that stacking architecture may have significant implications in infectious disease prediction.
format Online
Article
Text
id pubmed-6932799
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-69327992020-01-07 A novel model for malaria prediction based on ensemble algorithms Wang, Mengyang Wang, Hui Wang, Jiao Liu, Hongwei Lu, Rui Duan, Tongqing Gong, Xiaowen Feng, Siyuan Liu, Yuanyuan Cui, Zhuang Li, Changping Ma, Jun PLoS One Research Article BACKGROUND AND OBJECTIVE: Most previous studies adopted single traditional time series models to predict incidences of malaria. A single model cannot effectively capture all the properties of the data structure. However, a stacking architecture can solve this problem by combining distinct algorithms and models. This study compares the performance of traditional time series models and deep learning algorithms in malaria case prediction and explores the application value of stacking methods in the field of infectious disease prediction. METHODS: The ARIMA, STL+ARIMA, BP-ANN and LSTM network models were separately applied in simulations using malaria data and meteorological data in Yunnan Province from 2011 to 2017. We compared the predictive performance of each model through evaluation measures: RMSE, MASE, MAD. In addition, gradient-boosting regression trees (GBRTs) were used to combine the above four models. We also determined whether stacking structure improved the model prediction performance. RESULTS: The root mean square errors (RMSEs) of the four sub-models were 13.176, 14.543, 9.571 and 7.208; the mean absolute scaled errors (MASEs) were 0.469, 0.472, 0.296 and 0.266 and the mean absolute deviation (MAD) were 6.403, 7.658, 5.871 and 5.691. After using the stacking architecture combined with the above four models, the RMSE, MASE and MAD values of the ensemble model decreased to 6.810, 0.224 and 4.625, respectively. CONCLUSIONS: A novel ensemble model based on the robustness of structured prediction and model combination through stacking was developed. The findings suggest that the predictive performance of the final model is superior to that of the other four sub-models, indicating that stacking architecture may have significant implications in infectious disease prediction. Public Library of Science 2019-12-26 /pmc/articles/PMC6932799/ /pubmed/31877185 http://dx.doi.org/10.1371/journal.pone.0226910 Text en © 2019 Wang et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Wang, Mengyang
Wang, Hui
Wang, Jiao
Liu, Hongwei
Lu, Rui
Duan, Tongqing
Gong, Xiaowen
Feng, Siyuan
Liu, Yuanyuan
Cui, Zhuang
Li, Changping
Ma, Jun
A novel model for malaria prediction based on ensemble algorithms
title A novel model for malaria prediction based on ensemble algorithms
title_full A novel model for malaria prediction based on ensemble algorithms
title_fullStr A novel model for malaria prediction based on ensemble algorithms
title_full_unstemmed A novel model for malaria prediction based on ensemble algorithms
title_short A novel model for malaria prediction based on ensemble algorithms
title_sort novel model for malaria prediction based on ensemble algorithms
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6932799/
https://www.ncbi.nlm.nih.gov/pubmed/31877185
http://dx.doi.org/10.1371/journal.pone.0226910
work_keys_str_mv AT wangmengyang anovelmodelformalariapredictionbasedonensemblealgorithms
AT wanghui anovelmodelformalariapredictionbasedonensemblealgorithms
AT wangjiao anovelmodelformalariapredictionbasedonensemblealgorithms
AT liuhongwei anovelmodelformalariapredictionbasedonensemblealgorithms
AT lurui anovelmodelformalariapredictionbasedonensemblealgorithms
AT duantongqing anovelmodelformalariapredictionbasedonensemblealgorithms
AT gongxiaowen anovelmodelformalariapredictionbasedonensemblealgorithms
AT fengsiyuan anovelmodelformalariapredictionbasedonensemblealgorithms
AT liuyuanyuan anovelmodelformalariapredictionbasedonensemblealgorithms
AT cuizhuang anovelmodelformalariapredictionbasedonensemblealgorithms
AT lichangping anovelmodelformalariapredictionbasedonensemblealgorithms
AT majun anovelmodelformalariapredictionbasedonensemblealgorithms
AT wangmengyang novelmodelformalariapredictionbasedonensemblealgorithms
AT wanghui novelmodelformalariapredictionbasedonensemblealgorithms
AT wangjiao novelmodelformalariapredictionbasedonensemblealgorithms
AT liuhongwei novelmodelformalariapredictionbasedonensemblealgorithms
AT lurui novelmodelformalariapredictionbasedonensemblealgorithms
AT duantongqing novelmodelformalariapredictionbasedonensemblealgorithms
AT gongxiaowen novelmodelformalariapredictionbasedonensemblealgorithms
AT fengsiyuan novelmodelformalariapredictionbasedonensemblealgorithms
AT liuyuanyuan novelmodelformalariapredictionbasedonensemblealgorithms
AT cuizhuang novelmodelformalariapredictionbasedonensemblealgorithms
AT lichangping novelmodelformalariapredictionbasedonensemblealgorithms
AT majun novelmodelformalariapredictionbasedonensemblealgorithms