Cargando…
Polygon Simplification for the Efficient Approximate Analytics of Georeferenced Big Data
The unprecedented availability of sensor networks and GPS-enabled devices has caused the accumulation of voluminous georeferenced data streams. These data streams offer an opportunity to derive valuable insights and facilitate decision making for urban planning. However, processing and managing such...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10575302/ https://www.ncbi.nlm.nih.gov/pubmed/37837008 http://dx.doi.org/10.3390/s23198178 |
_version_ | 1785120892923150336 |
---|---|
author | Al Jawarneh, Isam Mashhour Foschini, Luca Bellavista, Paolo |
author_facet | Al Jawarneh, Isam Mashhour Foschini, Luca Bellavista, Paolo |
author_sort | Al Jawarneh, Isam Mashhour |
collection | PubMed |
description | The unprecedented availability of sensor networks and GPS-enabled devices has caused the accumulation of voluminous georeferenced data streams. These data streams offer an opportunity to derive valuable insights and facilitate decision making for urban planning. However, processing and managing such data is challenging, given the size and multidimensionality of these data. Therefore, there is a growing interest in spatial approximate query processing depending on stratified-like sampling methods. However, in these solutions, as the number of strata increases, response time grows, thus counteracting the benefits of sampling. In this paper, we originally show the design and realization of a novel online geospatial approximate processing solution called GeoRAP. GeoRAP employs a front-stage filter based on the Ramer–Douglas–Peucker line simplification algorithm to reduce the size of study area coverage; thereafter, it employs a spatial stratified-like sampling method that minimizes the number of strata, thus increasing throughput and minimizing response time, while keeping the accuracy loss in check. Our method is applicable for various online and batch geospatial processing workloads, including complex geo-statistics, aggregation queries, and the generation of region-based aggregate geo-maps such as choropleth maps and heatmaps. We have extensively tested the performance of our prototyped solution with real-world big spatial data, and this paper shows that GeoRAP can outperform state-of-the-art baselines by an order of magnitude in terms of throughput while statistically obtaining results with good accuracy. |
format | Online Article Text |
id | pubmed-10575302 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-105753022023-10-14 Polygon Simplification for the Efficient Approximate Analytics of Georeferenced Big Data Al Jawarneh, Isam Mashhour Foschini, Luca Bellavista, Paolo Sensors (Basel) Article The unprecedented availability of sensor networks and GPS-enabled devices has caused the accumulation of voluminous georeferenced data streams. These data streams offer an opportunity to derive valuable insights and facilitate decision making for urban planning. However, processing and managing such data is challenging, given the size and multidimensionality of these data. Therefore, there is a growing interest in spatial approximate query processing depending on stratified-like sampling methods. However, in these solutions, as the number of strata increases, response time grows, thus counteracting the benefits of sampling. In this paper, we originally show the design and realization of a novel online geospatial approximate processing solution called GeoRAP. GeoRAP employs a front-stage filter based on the Ramer–Douglas–Peucker line simplification algorithm to reduce the size of study area coverage; thereafter, it employs a spatial stratified-like sampling method that minimizes the number of strata, thus increasing throughput and minimizing response time, while keeping the accuracy loss in check. Our method is applicable for various online and batch geospatial processing workloads, including complex geo-statistics, aggregation queries, and the generation of region-based aggregate geo-maps such as choropleth maps and heatmaps. We have extensively tested the performance of our prototyped solution with real-world big spatial data, and this paper shows that GeoRAP can outperform state-of-the-art baselines by an order of magnitude in terms of throughput while statistically obtaining results with good accuracy. MDPI 2023-09-29 /pmc/articles/PMC10575302/ /pubmed/37837008 http://dx.doi.org/10.3390/s23198178 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Al Jawarneh, Isam Mashhour Foschini, Luca Bellavista, Paolo Polygon Simplification for the Efficient Approximate Analytics of Georeferenced Big Data |
title | Polygon Simplification for the Efficient Approximate Analytics of Georeferenced Big Data |
title_full | Polygon Simplification for the Efficient Approximate Analytics of Georeferenced Big Data |
title_fullStr | Polygon Simplification for the Efficient Approximate Analytics of Georeferenced Big Data |
title_full_unstemmed | Polygon Simplification for the Efficient Approximate Analytics of Georeferenced Big Data |
title_short | Polygon Simplification for the Efficient Approximate Analytics of Georeferenced Big Data |
title_sort | polygon simplification for the efficient approximate analytics of georeferenced big data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10575302/ https://www.ncbi.nlm.nih.gov/pubmed/37837008 http://dx.doi.org/10.3390/s23198178 |
work_keys_str_mv | AT aljawarnehisammashhour polygonsimplificationfortheefficientapproximateanalyticsofgeoreferencedbigdata AT foschiniluca polygonsimplificationfortheefficientapproximateanalyticsofgeoreferencedbigdata AT bellavistapaolo polygonsimplificationfortheefficientapproximateanalyticsofgeoreferencedbigdata |