Cargando…

A Comparative Study of Frequent Pattern Mining with Trajectory Data

Sequential pattern mining (SPM) is a major class of data mining topics with a wide range of applications. The continuity and uncertain nature of trajectory data make it distinctively different from typical transactional data, which requires additional data transformation to prepare for SPM. However,...

Descripción completa

Detalles Bibliográficos
Autores principales: Ding, Shiting, Li, Zhiheng, Zhang, Kai, Mao, Feng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9571407/
https://www.ncbi.nlm.nih.gov/pubmed/36236703
http://dx.doi.org/10.3390/s22197608
_version_ 1784810356988706816
author Ding, Shiting
Li, Zhiheng
Zhang, Kai
Mao, Feng
author_facet Ding, Shiting
Li, Zhiheng
Zhang, Kai
Mao, Feng
author_sort Ding, Shiting
collection PubMed
description Sequential pattern mining (SPM) is a major class of data mining topics with a wide range of applications. The continuity and uncertain nature of trajectory data make it distinctively different from typical transactional data, which requires additional data transformation to prepare for SPM. However, little research focuses on comparing the performance of SPM algorithms and their applications in the context of trajectory data. This study selected some representative sequential pattern mining algorithms and evaluated them with various parameters to understand the effect of the involved parameters on their performances. We studied the resultant sequential patterns, runtime, and RAM consumption in the context of the taxi trajectory dataset, the T-drive dataset. It was demonstrated in this work that a method to discretize trajectory data and different SPM algorithms were performed on trajectory databases. The results were visualized on actual Beijing road maps, reflecting traffic congestion conditions. Results demonstrated contiguous constraint-based algorithms could provide a concise representation of output sequences and functions at low [Formula: see text] with balanced RAM consumption and execution time. This study can be used as a guide for academics and professionals when determining the most suitable SPM algorithm for applications that involve trajectory data.
format Online
Article
Text
id pubmed-9571407
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-95714072022-10-17 A Comparative Study of Frequent Pattern Mining with Trajectory Data Ding, Shiting Li, Zhiheng Zhang, Kai Mao, Feng Sensors (Basel) Article Sequential pattern mining (SPM) is a major class of data mining topics with a wide range of applications. The continuity and uncertain nature of trajectory data make it distinctively different from typical transactional data, which requires additional data transformation to prepare for SPM. However, little research focuses on comparing the performance of SPM algorithms and their applications in the context of trajectory data. This study selected some representative sequential pattern mining algorithms and evaluated them with various parameters to understand the effect of the involved parameters on their performances. We studied the resultant sequential patterns, runtime, and RAM consumption in the context of the taxi trajectory dataset, the T-drive dataset. It was demonstrated in this work that a method to discretize trajectory data and different SPM algorithms were performed on trajectory databases. The results were visualized on actual Beijing road maps, reflecting traffic congestion conditions. Results demonstrated contiguous constraint-based algorithms could provide a concise representation of output sequences and functions at low [Formula: see text] with balanced RAM consumption and execution time. This study can be used as a guide for academics and professionals when determining the most suitable SPM algorithm for applications that involve trajectory data. MDPI 2022-10-07 /pmc/articles/PMC9571407/ /pubmed/36236703 http://dx.doi.org/10.3390/s22197608 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Ding, Shiting
Li, Zhiheng
Zhang, Kai
Mao, Feng
A Comparative Study of Frequent Pattern Mining with Trajectory Data
title A Comparative Study of Frequent Pattern Mining with Trajectory Data
title_full A Comparative Study of Frequent Pattern Mining with Trajectory Data
title_fullStr A Comparative Study of Frequent Pattern Mining with Trajectory Data
title_full_unstemmed A Comparative Study of Frequent Pattern Mining with Trajectory Data
title_short A Comparative Study of Frequent Pattern Mining with Trajectory Data
title_sort comparative study of frequent pattern mining with trajectory data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9571407/
https://www.ncbi.nlm.nih.gov/pubmed/36236703
http://dx.doi.org/10.3390/s22197608
work_keys_str_mv AT dingshiting acomparativestudyoffrequentpatternminingwithtrajectorydata
AT lizhiheng acomparativestudyoffrequentpatternminingwithtrajectorydata
AT zhangkai acomparativestudyoffrequentpatternminingwithtrajectorydata
AT maofeng acomparativestudyoffrequentpatternminingwithtrajectorydata
AT dingshiting comparativestudyoffrequentpatternminingwithtrajectorydata
AT lizhiheng comparativestudyoffrequentpatternminingwithtrajectorydata
AT zhangkai comparativestudyoffrequentpatternminingwithtrajectorydata
AT maofeng comparativestudyoffrequentpatternminingwithtrajectorydata