Cargando…

ADST: Forecasting Metro Flow Using Attention-Based Deep Spatial-Temporal Networks with Multi-Task Learning

Passenger flow prediction has drawn increasing attention in the deep learning research field due to its great importance in traffic management and public safety. The major challenge of this essential task lies in multiple spatiotemporal correlations that exhibit complex non-linear correlations. Alth...

Descripción completa

Detalles Bibliográficos
Autores principales: Jia, Hongwei, Luo, Haiyong, Wang, Hao, Zhao, Fang, Ke, Qixue, Wu, Mingyao, Zhao, Yunyun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7472615/
https://www.ncbi.nlm.nih.gov/pubmed/32824074
http://dx.doi.org/10.3390/s20164574
_version_ 1783579020772769792
author Jia, Hongwei
Luo, Haiyong
Wang, Hao
Zhao, Fang
Ke, Qixue
Wu, Mingyao
Zhao, Yunyun
author_facet Jia, Hongwei
Luo, Haiyong
Wang, Hao
Zhao, Fang
Ke, Qixue
Wu, Mingyao
Zhao, Yunyun
author_sort Jia, Hongwei
collection PubMed
description Passenger flow prediction has drawn increasing attention in the deep learning research field due to its great importance in traffic management and public safety. The major challenge of this essential task lies in multiple spatiotemporal correlations that exhibit complex non-linear correlations. Although both the spatial and temporal perspectives have been considered in modeling, most existing works have ignored complex temporal correlations or underlying spatial similarity. In this paper, we identify the unique spatiotemporal correlation of urban metro flow, and propose an attention-based deep spatiotemporal network with multi-task learning (ADST-Net) at a citywide level to predict the future flow from historical observations. ADST-Net uses three independent channels with the same structure to model the recent, daily-periodic and weekly-periodic complicated spatiotemporal correlations, respectively. Specifically, each channel uses the framework of residual networks, the rectified block and the multi-scale convolutions to mine spatiotemporal correlations. The residual networks can effectively overcome the gradient vanishing problem. The rectified block adopts an attentional mechanism to automatically reweigh measurements at different time intervals, and the multi-scale convolutions are used to extract explicit spatial relationships. ADST-Net also introduces an external embedding mechanism to extract the influence of external factors on flow prediction, such as weather conditions. Furthermore, we enforce multi-task learning to utilize transition passenger flow volume prediction as an auxiliary task during the training process for generalization. Through this model, we can not only capture the steady trend, but also the sudden changes of passenger flow. Extensive experimental results on two real-world traffic flow datasets demonstrate the obvious improvement and superior performance of our proposed algorithm compared with state-of-the-art baselines.
format Online
Article
Text
id pubmed-7472615
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-74726152020-09-17 ADST: Forecasting Metro Flow Using Attention-Based Deep Spatial-Temporal Networks with Multi-Task Learning Jia, Hongwei Luo, Haiyong Wang, Hao Zhao, Fang Ke, Qixue Wu, Mingyao Zhao, Yunyun Sensors (Basel) Article Passenger flow prediction has drawn increasing attention in the deep learning research field due to its great importance in traffic management and public safety. The major challenge of this essential task lies in multiple spatiotemporal correlations that exhibit complex non-linear correlations. Although both the spatial and temporal perspectives have been considered in modeling, most existing works have ignored complex temporal correlations or underlying spatial similarity. In this paper, we identify the unique spatiotemporal correlation of urban metro flow, and propose an attention-based deep spatiotemporal network with multi-task learning (ADST-Net) at a citywide level to predict the future flow from historical observations. ADST-Net uses three independent channels with the same structure to model the recent, daily-periodic and weekly-periodic complicated spatiotemporal correlations, respectively. Specifically, each channel uses the framework of residual networks, the rectified block and the multi-scale convolutions to mine spatiotemporal correlations. The residual networks can effectively overcome the gradient vanishing problem. The rectified block adopts an attentional mechanism to automatically reweigh measurements at different time intervals, and the multi-scale convolutions are used to extract explicit spatial relationships. ADST-Net also introduces an external embedding mechanism to extract the influence of external factors on flow prediction, such as weather conditions. Furthermore, we enforce multi-task learning to utilize transition passenger flow volume prediction as an auxiliary task during the training process for generalization. Through this model, we can not only capture the steady trend, but also the sudden changes of passenger flow. Extensive experimental results on two real-world traffic flow datasets demonstrate the obvious improvement and superior performance of our proposed algorithm compared with state-of-the-art baselines. MDPI 2020-08-14 /pmc/articles/PMC7472615/ /pubmed/32824074 http://dx.doi.org/10.3390/s20164574 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Jia, Hongwei
Luo, Haiyong
Wang, Hao
Zhao, Fang
Ke, Qixue
Wu, Mingyao
Zhao, Yunyun
ADST: Forecasting Metro Flow Using Attention-Based Deep Spatial-Temporal Networks with Multi-Task Learning
title ADST: Forecasting Metro Flow Using Attention-Based Deep Spatial-Temporal Networks with Multi-Task Learning
title_full ADST: Forecasting Metro Flow Using Attention-Based Deep Spatial-Temporal Networks with Multi-Task Learning
title_fullStr ADST: Forecasting Metro Flow Using Attention-Based Deep Spatial-Temporal Networks with Multi-Task Learning
title_full_unstemmed ADST: Forecasting Metro Flow Using Attention-Based Deep Spatial-Temporal Networks with Multi-Task Learning
title_short ADST: Forecasting Metro Flow Using Attention-Based Deep Spatial-Temporal Networks with Multi-Task Learning
title_sort adst: forecasting metro flow using attention-based deep spatial-temporal networks with multi-task learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7472615/
https://www.ncbi.nlm.nih.gov/pubmed/32824074
http://dx.doi.org/10.3390/s20164574
work_keys_str_mv AT jiahongwei adstforecastingmetroflowusingattentionbaseddeepspatialtemporalnetworkswithmultitasklearning
AT luohaiyong adstforecastingmetroflowusingattentionbaseddeepspatialtemporalnetworkswithmultitasklearning
AT wanghao adstforecastingmetroflowusingattentionbaseddeepspatialtemporalnetworkswithmultitasklearning
AT zhaofang adstforecastingmetroflowusingattentionbaseddeepspatialtemporalnetworkswithmultitasklearning
AT keqixue adstforecastingmetroflowusingattentionbaseddeepspatialtemporalnetworkswithmultitasklearning
AT wumingyao adstforecastingmetroflowusingattentionbaseddeepspatialtemporalnetworkswithmultitasklearning
AT zhaoyunyun adstforecastingmetroflowusingattentionbaseddeepspatialtemporalnetworkswithmultitasklearning