Cargando…

A Variational Bayesian Deep Network with Data Self-Screening Layer for Massive Time-Series Data Forecasting

Compared with mechanism-based modeling methods, data-driven modeling based on big data has become a popular research field in recent years because of its applicability. However, it is not always better to have more data when building a forecasting model in practical areas. Due to the noise and confl...

Descripción completa

Detalles Bibliográficos
Autores principales: Jin, Xue-Bo, Gong, Wen-Tao, Kong, Jian-Lei, Bai, Yu-Ting, Su, Ting-Li
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8947458/
https://www.ncbi.nlm.nih.gov/pubmed/35327846
http://dx.doi.org/10.3390/e24030335
_version_ 1784674444055150592
author Jin, Xue-Bo
Gong, Wen-Tao
Kong, Jian-Lei
Bai, Yu-Ting
Su, Ting-Li
author_facet Jin, Xue-Bo
Gong, Wen-Tao
Kong, Jian-Lei
Bai, Yu-Ting
Su, Ting-Li
author_sort Jin, Xue-Bo
collection PubMed
description Compared with mechanism-based modeling methods, data-driven modeling based on big data has become a popular research field in recent years because of its applicability. However, it is not always better to have more data when building a forecasting model in practical areas. Due to the noise and conflict, redundancy, and inconsistency of big time-series data, the forecasting accuracy may reduce on the contrary. This paper proposes a deep network by selecting and understanding data to improve performance. Firstly, a data self-screening layer (DSSL) with a maximal information distance coefficient (MIDC) is designed to filter input data with high correlation and low redundancy; then, a variational Bayesian gated recurrent unit (VBGRU) is used to improve the anti-noise ability and robustness of the model. Beijing’s air quality and meteorological data are conducted in a verification experiment of 24 h PM2.5 concentration forecasting, proving that the proposed model is superior to other models in accuracy.
format Online
Article
Text
id pubmed-8947458
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-89474582022-03-25 A Variational Bayesian Deep Network with Data Self-Screening Layer for Massive Time-Series Data Forecasting Jin, Xue-Bo Gong, Wen-Tao Kong, Jian-Lei Bai, Yu-Ting Su, Ting-Li Entropy (Basel) Article Compared with mechanism-based modeling methods, data-driven modeling based on big data has become a popular research field in recent years because of its applicability. However, it is not always better to have more data when building a forecasting model in practical areas. Due to the noise and conflict, redundancy, and inconsistency of big time-series data, the forecasting accuracy may reduce on the contrary. This paper proposes a deep network by selecting and understanding data to improve performance. Firstly, a data self-screening layer (DSSL) with a maximal information distance coefficient (MIDC) is designed to filter input data with high correlation and low redundancy; then, a variational Bayesian gated recurrent unit (VBGRU) is used to improve the anti-noise ability and robustness of the model. Beijing’s air quality and meteorological data are conducted in a verification experiment of 24 h PM2.5 concentration forecasting, proving that the proposed model is superior to other models in accuracy. MDPI 2022-02-25 /pmc/articles/PMC8947458/ /pubmed/35327846 http://dx.doi.org/10.3390/e24030335 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Jin, Xue-Bo
Gong, Wen-Tao
Kong, Jian-Lei
Bai, Yu-Ting
Su, Ting-Li
A Variational Bayesian Deep Network with Data Self-Screening Layer for Massive Time-Series Data Forecasting
title A Variational Bayesian Deep Network with Data Self-Screening Layer for Massive Time-Series Data Forecasting
title_full A Variational Bayesian Deep Network with Data Self-Screening Layer for Massive Time-Series Data Forecasting
title_fullStr A Variational Bayesian Deep Network with Data Self-Screening Layer for Massive Time-Series Data Forecasting
title_full_unstemmed A Variational Bayesian Deep Network with Data Self-Screening Layer for Massive Time-Series Data Forecasting
title_short A Variational Bayesian Deep Network with Data Self-Screening Layer for Massive Time-Series Data Forecasting
title_sort variational bayesian deep network with data self-screening layer for massive time-series data forecasting
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8947458/
https://www.ncbi.nlm.nih.gov/pubmed/35327846
http://dx.doi.org/10.3390/e24030335
work_keys_str_mv AT jinxuebo avariationalbayesiandeepnetworkwithdataselfscreeninglayerformassivetimeseriesdataforecasting
AT gongwentao avariationalbayesiandeepnetworkwithdataselfscreeninglayerformassivetimeseriesdataforecasting
AT kongjianlei avariationalbayesiandeepnetworkwithdataselfscreeninglayerformassivetimeseriesdataforecasting
AT baiyuting avariationalbayesiandeepnetworkwithdataselfscreeninglayerformassivetimeseriesdataforecasting
AT sutingli avariationalbayesiandeepnetworkwithdataselfscreeninglayerformassivetimeseriesdataforecasting
AT jinxuebo variationalbayesiandeepnetworkwithdataselfscreeninglayerformassivetimeseriesdataforecasting
AT gongwentao variationalbayesiandeepnetworkwithdataselfscreeninglayerformassivetimeseriesdataforecasting
AT kongjianlei variationalbayesiandeepnetworkwithdataselfscreeninglayerformassivetimeseriesdataforecasting
AT baiyuting variationalbayesiandeepnetworkwithdataselfscreeninglayerformassivetimeseriesdataforecasting
AT sutingli variationalbayesiandeepnetworkwithdataselfscreeninglayerformassivetimeseriesdataforecasting