Cargando…

Graph-based representation for identifying individual travel activities with spatiotemporal trajectories and POI data

Individual daily travel activities (e.g., work, eating) are identified with various machine learning models (e.g., Bayesian Network, Random Forest) for understanding people’s frequent travel purposes. However, labor-intensive engineering work is often required to extract effective features. Addition...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Xinyi, Wu, Meiliu, Peng, Bo, Huang, Qunying
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9492902/
https://www.ncbi.nlm.nih.gov/pubmed/36130956
http://dx.doi.org/10.1038/s41598-022-19441-9
_version_ 1784793577835986944
author Liu, Xinyi
Wu, Meiliu
Peng, Bo
Huang, Qunying
author_facet Liu, Xinyi
Wu, Meiliu
Peng, Bo
Huang, Qunying
author_sort Liu, Xinyi
collection PubMed
description Individual daily travel activities (e.g., work, eating) are identified with various machine learning models (e.g., Bayesian Network, Random Forest) for understanding people’s frequent travel purposes. However, labor-intensive engineering work is often required to extract effective features. Additionally, features and models are mostly calibrated for individual trajectories with regular daily travel routines and patterns, and therefore suffer from poor generalizability when applied to new trajectories with more irregular patterns. Meanwhile, most existing models cannot extract features to explicitly represent regular travel activity sequences. Therefore, this paper proposes a graph-based representation of spatiotemporal trajectories and point-of-interest (POI) data for travel activity type identification, defined as Gstp2Vec. Specifically, a weighted directed graph is constructed by connecting regular activity areas (i.e., zones) detected via clustering individual daily travel trajectories as graph nodes, with edges denoting trips between pairs of zones. Statistics of trajectories (e.g., visit frequency, activity duration) and POI distributions (e.g., percentage of restaurants) at each activity zone are encoded as node features. Next, trip frequency, average trip duration, and average trip distance are encoded as edge weights. Then a series of feedforward neural networks are trained to generate low-dimensional embeddings for activity nodes through sampling and aggregating spatiotemporal and POI features from their multihop neighborhoods. Activity type labels collected via travel surveys are used as ground truth for backpropagation. The experiment results with real-world GPS trajectories show that Gstp2Vec significantly reduces feature engineering efforts by automatically learning feature embeddings from raw trajectories with minimal prepossessing efforts. It not only enhances model generalizability to receive higher identification accuracy on test individual trajectories with diverse travel patterns, but also obtains better efficiency and robustness. In particular, our identification of the most common daily travel activities (e.g., Dwelling and Work) for people with diverse travel patterns outperforms state-of-the-art classification models.
format Online
Article
Text
id pubmed-9492902
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-94929022022-09-23 Graph-based representation for identifying individual travel activities with spatiotemporal trajectories and POI data Liu, Xinyi Wu, Meiliu Peng, Bo Huang, Qunying Sci Rep Article Individual daily travel activities (e.g., work, eating) are identified with various machine learning models (e.g., Bayesian Network, Random Forest) for understanding people’s frequent travel purposes. However, labor-intensive engineering work is often required to extract effective features. Additionally, features and models are mostly calibrated for individual trajectories with regular daily travel routines and patterns, and therefore suffer from poor generalizability when applied to new trajectories with more irregular patterns. Meanwhile, most existing models cannot extract features to explicitly represent regular travel activity sequences. Therefore, this paper proposes a graph-based representation of spatiotemporal trajectories and point-of-interest (POI) data for travel activity type identification, defined as Gstp2Vec. Specifically, a weighted directed graph is constructed by connecting regular activity areas (i.e., zones) detected via clustering individual daily travel trajectories as graph nodes, with edges denoting trips between pairs of zones. Statistics of trajectories (e.g., visit frequency, activity duration) and POI distributions (e.g., percentage of restaurants) at each activity zone are encoded as node features. Next, trip frequency, average trip duration, and average trip distance are encoded as edge weights. Then a series of feedforward neural networks are trained to generate low-dimensional embeddings for activity nodes through sampling and aggregating spatiotemporal and POI features from their multihop neighborhoods. Activity type labels collected via travel surveys are used as ground truth for backpropagation. The experiment results with real-world GPS trajectories show that Gstp2Vec significantly reduces feature engineering efforts by automatically learning feature embeddings from raw trajectories with minimal prepossessing efforts. It not only enhances model generalizability to receive higher identification accuracy on test individual trajectories with diverse travel patterns, but also obtains better efficiency and robustness. In particular, our identification of the most common daily travel activities (e.g., Dwelling and Work) for people with diverse travel patterns outperforms state-of-the-art classification models. Nature Publishing Group UK 2022-09-21 /pmc/articles/PMC9492902/ /pubmed/36130956 http://dx.doi.org/10.1038/s41598-022-19441-9 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Liu, Xinyi
Wu, Meiliu
Peng, Bo
Huang, Qunying
Graph-based representation for identifying individual travel activities with spatiotemporal trajectories and POI data
title Graph-based representation for identifying individual travel activities with spatiotemporal trajectories and POI data
title_full Graph-based representation for identifying individual travel activities with spatiotemporal trajectories and POI data
title_fullStr Graph-based representation for identifying individual travel activities with spatiotemporal trajectories and POI data
title_full_unstemmed Graph-based representation for identifying individual travel activities with spatiotemporal trajectories and POI data
title_short Graph-based representation for identifying individual travel activities with spatiotemporal trajectories and POI data
title_sort graph-based representation for identifying individual travel activities with spatiotemporal trajectories and poi data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9492902/
https://www.ncbi.nlm.nih.gov/pubmed/36130956
http://dx.doi.org/10.1038/s41598-022-19441-9
work_keys_str_mv AT liuxinyi graphbasedrepresentationforidentifyingindividualtravelactivitieswithspatiotemporaltrajectoriesandpoidata
AT wumeiliu graphbasedrepresentationforidentifyingindividualtravelactivitieswithspatiotemporaltrajectoriesandpoidata
AT pengbo graphbasedrepresentationforidentifyingindividualtravelactivitieswithspatiotemporaltrajectoriesandpoidata
AT huangqunying graphbasedrepresentationforidentifyingindividualtravelactivitieswithspatiotemporaltrajectoriesandpoidata