Cargando…
ADST: Forecasting Metro Flow Using Attention-Based Deep Spatial-Temporal Networks with Multi-Task Learning
Passenger flow prediction has drawn increasing attention in the deep learning research field due to its great importance in traffic management and public safety. The major challenge of this essential task lies in multiple spatiotemporal correlations that exhibit complex non-linear correlations. Alth...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7472615/ https://www.ncbi.nlm.nih.gov/pubmed/32824074 http://dx.doi.org/10.3390/s20164574 |
_version_ | 1783579020772769792 |
---|---|
author | Jia, Hongwei Luo, Haiyong Wang, Hao Zhao, Fang Ke, Qixue Wu, Mingyao Zhao, Yunyun |
author_facet | Jia, Hongwei Luo, Haiyong Wang, Hao Zhao, Fang Ke, Qixue Wu, Mingyao Zhao, Yunyun |
author_sort | Jia, Hongwei |
collection | PubMed |
description | Passenger flow prediction has drawn increasing attention in the deep learning research field due to its great importance in traffic management and public safety. The major challenge of this essential task lies in multiple spatiotemporal correlations that exhibit complex non-linear correlations. Although both the spatial and temporal perspectives have been considered in modeling, most existing works have ignored complex temporal correlations or underlying spatial similarity. In this paper, we identify the unique spatiotemporal correlation of urban metro flow, and propose an attention-based deep spatiotemporal network with multi-task learning (ADST-Net) at a citywide level to predict the future flow from historical observations. ADST-Net uses three independent channels with the same structure to model the recent, daily-periodic and weekly-periodic complicated spatiotemporal correlations, respectively. Specifically, each channel uses the framework of residual networks, the rectified block and the multi-scale convolutions to mine spatiotemporal correlations. The residual networks can effectively overcome the gradient vanishing problem. The rectified block adopts an attentional mechanism to automatically reweigh measurements at different time intervals, and the multi-scale convolutions are used to extract explicit spatial relationships. ADST-Net also introduces an external embedding mechanism to extract the influence of external factors on flow prediction, such as weather conditions. Furthermore, we enforce multi-task learning to utilize transition passenger flow volume prediction as an auxiliary task during the training process for generalization. Through this model, we can not only capture the steady trend, but also the sudden changes of passenger flow. Extensive experimental results on two real-world traffic flow datasets demonstrate the obvious improvement and superior performance of our proposed algorithm compared with state-of-the-art baselines. |
format | Online Article Text |
id | pubmed-7472615 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-74726152020-09-17 ADST: Forecasting Metro Flow Using Attention-Based Deep Spatial-Temporal Networks with Multi-Task Learning Jia, Hongwei Luo, Haiyong Wang, Hao Zhao, Fang Ke, Qixue Wu, Mingyao Zhao, Yunyun Sensors (Basel) Article Passenger flow prediction has drawn increasing attention in the deep learning research field due to its great importance in traffic management and public safety. The major challenge of this essential task lies in multiple spatiotemporal correlations that exhibit complex non-linear correlations. Although both the spatial and temporal perspectives have been considered in modeling, most existing works have ignored complex temporal correlations or underlying spatial similarity. In this paper, we identify the unique spatiotemporal correlation of urban metro flow, and propose an attention-based deep spatiotemporal network with multi-task learning (ADST-Net) at a citywide level to predict the future flow from historical observations. ADST-Net uses three independent channels with the same structure to model the recent, daily-periodic and weekly-periodic complicated spatiotemporal correlations, respectively. Specifically, each channel uses the framework of residual networks, the rectified block and the multi-scale convolutions to mine spatiotemporal correlations. The residual networks can effectively overcome the gradient vanishing problem. The rectified block adopts an attentional mechanism to automatically reweigh measurements at different time intervals, and the multi-scale convolutions are used to extract explicit spatial relationships. ADST-Net also introduces an external embedding mechanism to extract the influence of external factors on flow prediction, such as weather conditions. Furthermore, we enforce multi-task learning to utilize transition passenger flow volume prediction as an auxiliary task during the training process for generalization. Through this model, we can not only capture the steady trend, but also the sudden changes of passenger flow. Extensive experimental results on two real-world traffic flow datasets demonstrate the obvious improvement and superior performance of our proposed algorithm compared with state-of-the-art baselines. MDPI 2020-08-14 /pmc/articles/PMC7472615/ /pubmed/32824074 http://dx.doi.org/10.3390/s20164574 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Jia, Hongwei Luo, Haiyong Wang, Hao Zhao, Fang Ke, Qixue Wu, Mingyao Zhao, Yunyun ADST: Forecasting Metro Flow Using Attention-Based Deep Spatial-Temporal Networks with Multi-Task Learning |
title | ADST: Forecasting Metro Flow Using Attention-Based Deep Spatial-Temporal Networks with Multi-Task Learning |
title_full | ADST: Forecasting Metro Flow Using Attention-Based Deep Spatial-Temporal Networks with Multi-Task Learning |
title_fullStr | ADST: Forecasting Metro Flow Using Attention-Based Deep Spatial-Temporal Networks with Multi-Task Learning |
title_full_unstemmed | ADST: Forecasting Metro Flow Using Attention-Based Deep Spatial-Temporal Networks with Multi-Task Learning |
title_short | ADST: Forecasting Metro Flow Using Attention-Based Deep Spatial-Temporal Networks with Multi-Task Learning |
title_sort | adst: forecasting metro flow using attention-based deep spatial-temporal networks with multi-task learning |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7472615/ https://www.ncbi.nlm.nih.gov/pubmed/32824074 http://dx.doi.org/10.3390/s20164574 |
work_keys_str_mv | AT jiahongwei adstforecastingmetroflowusingattentionbaseddeepspatialtemporalnetworkswithmultitasklearning AT luohaiyong adstforecastingmetroflowusingattentionbaseddeepspatialtemporalnetworkswithmultitasklearning AT wanghao adstforecastingmetroflowusingattentionbaseddeepspatialtemporalnetworkswithmultitasklearning AT zhaofang adstforecastingmetroflowusingattentionbaseddeepspatialtemporalnetworkswithmultitasklearning AT keqixue adstforecastingmetroflowusingattentionbaseddeepspatialtemporalnetworkswithmultitasklearning AT wumingyao adstforecastingmetroflowusingattentionbaseddeepspatialtemporalnetworkswithmultitasklearning AT zhaoyunyun adstforecastingmetroflowusingattentionbaseddeepspatialtemporalnetworkswithmultitasklearning |