Cargando…
An Efficient Method of Sharing Mass Spatio-Temporal Trajectory Data Based on Cloudera Impala for Traffic Distribution Mapping in an Urban City
The efficient sharing of spatio-temporal trajectory data is important to understand traffic congestion in mass data. However, the data volumes of bus networks in urban cities are growing rapidly, reaching daily volumes of one hundred million datapoints. Accessing and retrieving mass spatio-temporal...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5134472/ https://www.ncbi.nlm.nih.gov/pubmed/27801869 http://dx.doi.org/10.3390/s16111813 |
_version_ | 1782471460210606080 |
---|---|
author | Zhou, Lianjie Chen, Nengcheng Yuan, Sai Chen, Zeqiang |
author_facet | Zhou, Lianjie Chen, Nengcheng Yuan, Sai Chen, Zeqiang |
author_sort | Zhou, Lianjie |
collection | PubMed |
description | The efficient sharing of spatio-temporal trajectory data is important to understand traffic congestion in mass data. However, the data volumes of bus networks in urban cities are growing rapidly, reaching daily volumes of one hundred million datapoints. Accessing and retrieving mass spatio-temporal trajectory data in any field is hard and inefficient due to limited computational capabilities and incomplete data organization mechanisms. Therefore, we propose an optimized and efficient spatio-temporal trajectory data retrieval method based on the Cloudera Impala query engine, called ESTRI, to enhance the efficiency of mass data sharing. As an excellent query tool for mass data, Impala can be applied for mass spatio-temporal trajectory data sharing. In ESTRI we extend the spatio-temporal trajectory data retrieval function of Impala and design a suitable data partitioning method. In our experiments, the Taiyuan BeiDou (BD) bus network is selected, containing 2300 buses with BD positioning sensors, producing 20 million records every day, resulting in two difficulties as described in the Introduction section. In addition, ESTRI and MongoDB are applied in experiments. The experiments show that ESTRI achieves the most efficient data retrieval compared to retrieval using MongoDB for data volumes of fifty million, one hundred million, one hundred and fifty million, and two hundred million. The performance of ESTRI is approximately seven times higher than that of MongoDB. The experiments show that ESTRI is an effective method for retrieving mass spatio-temporal trajectory data. Finally, bus distribution mapping in Taiyuan city is achieved, describing the buses density in different regions at different times throughout the day, which can be applied in future studies of transport, such as traffic scheduling, traffic planning and traffic behavior management in intelligent public transportation systems. |
format | Online Article Text |
id | pubmed-5134472 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-51344722017-01-03 An Efficient Method of Sharing Mass Spatio-Temporal Trajectory Data Based on Cloudera Impala for Traffic Distribution Mapping in an Urban City Zhou, Lianjie Chen, Nengcheng Yuan, Sai Chen, Zeqiang Sensors (Basel) Article The efficient sharing of spatio-temporal trajectory data is important to understand traffic congestion in mass data. However, the data volumes of bus networks in urban cities are growing rapidly, reaching daily volumes of one hundred million datapoints. Accessing and retrieving mass spatio-temporal trajectory data in any field is hard and inefficient due to limited computational capabilities and incomplete data organization mechanisms. Therefore, we propose an optimized and efficient spatio-temporal trajectory data retrieval method based on the Cloudera Impala query engine, called ESTRI, to enhance the efficiency of mass data sharing. As an excellent query tool for mass data, Impala can be applied for mass spatio-temporal trajectory data sharing. In ESTRI we extend the spatio-temporal trajectory data retrieval function of Impala and design a suitable data partitioning method. In our experiments, the Taiyuan BeiDou (BD) bus network is selected, containing 2300 buses with BD positioning sensors, producing 20 million records every day, resulting in two difficulties as described in the Introduction section. In addition, ESTRI and MongoDB are applied in experiments. The experiments show that ESTRI achieves the most efficient data retrieval compared to retrieval using MongoDB for data volumes of fifty million, one hundred million, one hundred and fifty million, and two hundred million. The performance of ESTRI is approximately seven times higher than that of MongoDB. The experiments show that ESTRI is an effective method for retrieving mass spatio-temporal trajectory data. Finally, bus distribution mapping in Taiyuan city is achieved, describing the buses density in different regions at different times throughout the day, which can be applied in future studies of transport, such as traffic scheduling, traffic planning and traffic behavior management in intelligent public transportation systems. MDPI 2016-10-29 /pmc/articles/PMC5134472/ /pubmed/27801869 http://dx.doi.org/10.3390/s16111813 Text en © 2016 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC-BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Zhou, Lianjie Chen, Nengcheng Yuan, Sai Chen, Zeqiang An Efficient Method of Sharing Mass Spatio-Temporal Trajectory Data Based on Cloudera Impala for Traffic Distribution Mapping in an Urban City |
title | An Efficient Method of Sharing Mass Spatio-Temporal Trajectory Data Based on Cloudera Impala for Traffic Distribution Mapping in an Urban City |
title_full | An Efficient Method of Sharing Mass Spatio-Temporal Trajectory Data Based on Cloudera Impala for Traffic Distribution Mapping in an Urban City |
title_fullStr | An Efficient Method of Sharing Mass Spatio-Temporal Trajectory Data Based on Cloudera Impala for Traffic Distribution Mapping in an Urban City |
title_full_unstemmed | An Efficient Method of Sharing Mass Spatio-Temporal Trajectory Data Based on Cloudera Impala for Traffic Distribution Mapping in an Urban City |
title_short | An Efficient Method of Sharing Mass Spatio-Temporal Trajectory Data Based on Cloudera Impala for Traffic Distribution Mapping in an Urban City |
title_sort | efficient method of sharing mass spatio-temporal trajectory data based on cloudera impala for traffic distribution mapping in an urban city |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5134472/ https://www.ncbi.nlm.nih.gov/pubmed/27801869 http://dx.doi.org/10.3390/s16111813 |
work_keys_str_mv | AT zhoulianjie anefficientmethodofsharingmassspatiotemporaltrajectorydatabasedonclouderaimpalafortrafficdistributionmappinginanurbancity AT chennengcheng anefficientmethodofsharingmassspatiotemporaltrajectorydatabasedonclouderaimpalafortrafficdistributionmappinginanurbancity AT yuansai anefficientmethodofsharingmassspatiotemporaltrajectorydatabasedonclouderaimpalafortrafficdistributionmappinginanurbancity AT chenzeqiang anefficientmethodofsharingmassspatiotemporaltrajectorydatabasedonclouderaimpalafortrafficdistributionmappinginanurbancity AT zhoulianjie efficientmethodofsharingmassspatiotemporaltrajectorydatabasedonclouderaimpalafortrafficdistributionmappinginanurbancity AT chennengcheng efficientmethodofsharingmassspatiotemporaltrajectorydatabasedonclouderaimpalafortrafficdistributionmappinginanurbancity AT yuansai efficientmethodofsharingmassspatiotemporaltrajectorydatabasedonclouderaimpalafortrafficdistributionmappinginanurbancity AT chenzeqiang efficientmethodofsharingmassspatiotemporaltrajectorydatabasedonclouderaimpalafortrafficdistributionmappinginanurbancity |