Cargando…
Graph-based representation for identifying individual travel activities with spatiotemporal trajectories and POI data
Individual daily travel activities (e.g., work, eating) are identified with various machine learning models (e.g., Bayesian Network, Random Forest) for understanding people’s frequent travel purposes. However, labor-intensive engineering work is often required to extract effective features. Addition...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9492902/ https://www.ncbi.nlm.nih.gov/pubmed/36130956 http://dx.doi.org/10.1038/s41598-022-19441-9 |
_version_ | 1784793577835986944 |
---|---|
author | Liu, Xinyi Wu, Meiliu Peng, Bo Huang, Qunying |
author_facet | Liu, Xinyi Wu, Meiliu Peng, Bo Huang, Qunying |
author_sort | Liu, Xinyi |
collection | PubMed |
description | Individual daily travel activities (e.g., work, eating) are identified with various machine learning models (e.g., Bayesian Network, Random Forest) for understanding people’s frequent travel purposes. However, labor-intensive engineering work is often required to extract effective features. Additionally, features and models are mostly calibrated for individual trajectories with regular daily travel routines and patterns, and therefore suffer from poor generalizability when applied to new trajectories with more irregular patterns. Meanwhile, most existing models cannot extract features to explicitly represent regular travel activity sequences. Therefore, this paper proposes a graph-based representation of spatiotemporal trajectories and point-of-interest (POI) data for travel activity type identification, defined as Gstp2Vec. Specifically, a weighted directed graph is constructed by connecting regular activity areas (i.e., zones) detected via clustering individual daily travel trajectories as graph nodes, with edges denoting trips between pairs of zones. Statistics of trajectories (e.g., visit frequency, activity duration) and POI distributions (e.g., percentage of restaurants) at each activity zone are encoded as node features. Next, trip frequency, average trip duration, and average trip distance are encoded as edge weights. Then a series of feedforward neural networks are trained to generate low-dimensional embeddings for activity nodes through sampling and aggregating spatiotemporal and POI features from their multihop neighborhoods. Activity type labels collected via travel surveys are used as ground truth for backpropagation. The experiment results with real-world GPS trajectories show that Gstp2Vec significantly reduces feature engineering efforts by automatically learning feature embeddings from raw trajectories with minimal prepossessing efforts. It not only enhances model generalizability to receive higher identification accuracy on test individual trajectories with diverse travel patterns, but also obtains better efficiency and robustness. In particular, our identification of the most common daily travel activities (e.g., Dwelling and Work) for people with diverse travel patterns outperforms state-of-the-art classification models. |
format | Online Article Text |
id | pubmed-9492902 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-94929022022-09-23 Graph-based representation for identifying individual travel activities with spatiotemporal trajectories and POI data Liu, Xinyi Wu, Meiliu Peng, Bo Huang, Qunying Sci Rep Article Individual daily travel activities (e.g., work, eating) are identified with various machine learning models (e.g., Bayesian Network, Random Forest) for understanding people’s frequent travel purposes. However, labor-intensive engineering work is often required to extract effective features. Additionally, features and models are mostly calibrated for individual trajectories with regular daily travel routines and patterns, and therefore suffer from poor generalizability when applied to new trajectories with more irregular patterns. Meanwhile, most existing models cannot extract features to explicitly represent regular travel activity sequences. Therefore, this paper proposes a graph-based representation of spatiotemporal trajectories and point-of-interest (POI) data for travel activity type identification, defined as Gstp2Vec. Specifically, a weighted directed graph is constructed by connecting regular activity areas (i.e., zones) detected via clustering individual daily travel trajectories as graph nodes, with edges denoting trips between pairs of zones. Statistics of trajectories (e.g., visit frequency, activity duration) and POI distributions (e.g., percentage of restaurants) at each activity zone are encoded as node features. Next, trip frequency, average trip duration, and average trip distance are encoded as edge weights. Then a series of feedforward neural networks are trained to generate low-dimensional embeddings for activity nodes through sampling and aggregating spatiotemporal and POI features from their multihop neighborhoods. Activity type labels collected via travel surveys are used as ground truth for backpropagation. The experiment results with real-world GPS trajectories show that Gstp2Vec significantly reduces feature engineering efforts by automatically learning feature embeddings from raw trajectories with minimal prepossessing efforts. It not only enhances model generalizability to receive higher identification accuracy on test individual trajectories with diverse travel patterns, but also obtains better efficiency and robustness. In particular, our identification of the most common daily travel activities (e.g., Dwelling and Work) for people with diverse travel patterns outperforms state-of-the-art classification models. Nature Publishing Group UK 2022-09-21 /pmc/articles/PMC9492902/ /pubmed/36130956 http://dx.doi.org/10.1038/s41598-022-19441-9 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Liu, Xinyi Wu, Meiliu Peng, Bo Huang, Qunying Graph-based representation for identifying individual travel activities with spatiotemporal trajectories and POI data |
title | Graph-based representation for identifying individual travel activities with spatiotemporal trajectories and POI data |
title_full | Graph-based representation for identifying individual travel activities with spatiotemporal trajectories and POI data |
title_fullStr | Graph-based representation for identifying individual travel activities with spatiotemporal trajectories and POI data |
title_full_unstemmed | Graph-based representation for identifying individual travel activities with spatiotemporal trajectories and POI data |
title_short | Graph-based representation for identifying individual travel activities with spatiotemporal trajectories and POI data |
title_sort | graph-based representation for identifying individual travel activities with spatiotemporal trajectories and poi data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9492902/ https://www.ncbi.nlm.nih.gov/pubmed/36130956 http://dx.doi.org/10.1038/s41598-022-19441-9 |
work_keys_str_mv | AT liuxinyi graphbasedrepresentationforidentifyingindividualtravelactivitieswithspatiotemporaltrajectoriesandpoidata AT wumeiliu graphbasedrepresentationforidentifyingindividualtravelactivitieswithspatiotemporaltrajectoriesandpoidata AT pengbo graphbasedrepresentationforidentifyingindividualtravelactivitieswithspatiotemporaltrajectoriesandpoidata AT huangqunying graphbasedrepresentationforidentifyingindividualtravelactivitieswithspatiotemporaltrajectoriesandpoidata |