Cargando…
A Comparative Study of Frequent Pattern Mining with Trajectory Data
Sequential pattern mining (SPM) is a major class of data mining topics with a wide range of applications. The continuity and uncertain nature of trajectory data make it distinctively different from typical transactional data, which requires additional data transformation to prepare for SPM. However,...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9571407/ https://www.ncbi.nlm.nih.gov/pubmed/36236703 http://dx.doi.org/10.3390/s22197608 |
_version_ | 1784810356988706816 |
---|---|
author | Ding, Shiting Li, Zhiheng Zhang, Kai Mao, Feng |
author_facet | Ding, Shiting Li, Zhiheng Zhang, Kai Mao, Feng |
author_sort | Ding, Shiting |
collection | PubMed |
description | Sequential pattern mining (SPM) is a major class of data mining topics with a wide range of applications. The continuity and uncertain nature of trajectory data make it distinctively different from typical transactional data, which requires additional data transformation to prepare for SPM. However, little research focuses on comparing the performance of SPM algorithms and their applications in the context of trajectory data. This study selected some representative sequential pattern mining algorithms and evaluated them with various parameters to understand the effect of the involved parameters on their performances. We studied the resultant sequential patterns, runtime, and RAM consumption in the context of the taxi trajectory dataset, the T-drive dataset. It was demonstrated in this work that a method to discretize trajectory data and different SPM algorithms were performed on trajectory databases. The results were visualized on actual Beijing road maps, reflecting traffic congestion conditions. Results demonstrated contiguous constraint-based algorithms could provide a concise representation of output sequences and functions at low [Formula: see text] with balanced RAM consumption and execution time. This study can be used as a guide for academics and professionals when determining the most suitable SPM algorithm for applications that involve trajectory data. |
format | Online Article Text |
id | pubmed-9571407 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-95714072022-10-17 A Comparative Study of Frequent Pattern Mining with Trajectory Data Ding, Shiting Li, Zhiheng Zhang, Kai Mao, Feng Sensors (Basel) Article Sequential pattern mining (SPM) is a major class of data mining topics with a wide range of applications. The continuity and uncertain nature of trajectory data make it distinctively different from typical transactional data, which requires additional data transformation to prepare for SPM. However, little research focuses on comparing the performance of SPM algorithms and their applications in the context of trajectory data. This study selected some representative sequential pattern mining algorithms and evaluated them with various parameters to understand the effect of the involved parameters on their performances. We studied the resultant sequential patterns, runtime, and RAM consumption in the context of the taxi trajectory dataset, the T-drive dataset. It was demonstrated in this work that a method to discretize trajectory data and different SPM algorithms were performed on trajectory databases. The results were visualized on actual Beijing road maps, reflecting traffic congestion conditions. Results demonstrated contiguous constraint-based algorithms could provide a concise representation of output sequences and functions at low [Formula: see text] with balanced RAM consumption and execution time. This study can be used as a guide for academics and professionals when determining the most suitable SPM algorithm for applications that involve trajectory data. MDPI 2022-10-07 /pmc/articles/PMC9571407/ /pubmed/36236703 http://dx.doi.org/10.3390/s22197608 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Ding, Shiting Li, Zhiheng Zhang, Kai Mao, Feng A Comparative Study of Frequent Pattern Mining with Trajectory Data |
title | A Comparative Study of Frequent Pattern Mining with Trajectory Data |
title_full | A Comparative Study of Frequent Pattern Mining with Trajectory Data |
title_fullStr | A Comparative Study of Frequent Pattern Mining with Trajectory Data |
title_full_unstemmed | A Comparative Study of Frequent Pattern Mining with Trajectory Data |
title_short | A Comparative Study of Frequent Pattern Mining with Trajectory Data |
title_sort | comparative study of frequent pattern mining with trajectory data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9571407/ https://www.ncbi.nlm.nih.gov/pubmed/36236703 http://dx.doi.org/10.3390/s22197608 |
work_keys_str_mv | AT dingshiting acomparativestudyoffrequentpatternminingwithtrajectorydata AT lizhiheng acomparativestudyoffrequentpatternminingwithtrajectorydata AT zhangkai acomparativestudyoffrequentpatternminingwithtrajectorydata AT maofeng acomparativestudyoffrequentpatternminingwithtrajectorydata AT dingshiting comparativestudyoffrequentpatternminingwithtrajectorydata AT lizhiheng comparativestudyoffrequentpatternminingwithtrajectorydata AT zhangkai comparativestudyoffrequentpatternminingwithtrajectorydata AT maofeng comparativestudyoffrequentpatternminingwithtrajectorydata |