Cargando…

QoS-Aware Approximate Query Processing for Smart Cities Spatial Data Streams

Large amounts of georeferenced data streams arrive daily to stream processing systems. This is attributable to the overabundance of affordable IoT devices. In addition, interested practitioners desire to exploit Internet of Things (IoT) data streams for strategic decision-making purposes. However, m...

Descripción completa

Detalles Bibliográficos
Autores principales: Al Jawarneh, Isam Mashhour, Bellavista, Paolo, Corradi, Antonio, Foschini, Luca, Montanari, Rebecca
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8235266/
https://www.ncbi.nlm.nih.gov/pubmed/34204451
http://dx.doi.org/10.3390/s21124160
_version_ 1783714276380246016
author Al Jawarneh, Isam Mashhour
Bellavista, Paolo
Corradi, Antonio
Foschini, Luca
Montanari, Rebecca
author_facet Al Jawarneh, Isam Mashhour
Bellavista, Paolo
Corradi, Antonio
Foschini, Luca
Montanari, Rebecca
author_sort Al Jawarneh, Isam Mashhour
collection PubMed
description Large amounts of georeferenced data streams arrive daily to stream processing systems. This is attributable to the overabundance of affordable IoT devices. In addition, interested practitioners desire to exploit Internet of Things (IoT) data streams for strategic decision-making purposes. However, mobility data are highly skewed and their arrival rates fluctuate. This nature poses an extra challenge on data stream processing systems, which are required in order to achieve pre-specified latency and accuracy goals. In this paper, we propose ApproxSSPS, which is a system for approximate processing of geo-referenced mobility data, at scale with quality of service guarantees. We focus on stateful aggregations (e.g., means, counts) and top-N queries. ApproxSSPS features a controller that interactively learns the latency statistics and calculates proper sampling rates to meet latency or/and accuracy targets. An overarching trait of ApproxSSPS is its ability to strike a plausible balance between latency and accuracy targets. We evaluate ApproxSSPS on Apache Spark Structured Streaming with real mobility data. We also compared ApproxSSPS against a state-of-the-art online adaptive processing system. Our extensive experiments prove that ApproxSSPS can fulfill latency and accuracy targets with varying sets of parameter configurations and load intensities (i.e., transient peaks in data loads versus slow arriving streams). Moreover, our results show that ApproxSSPS outperforms the baseline counterpart by significant magnitudes. In short, ApproxSSPS is a novel spatial data stream processing system that can deliver real accurate results in a timely manner, by dynamically specifying the limits on data samples.
format Online
Article
Text
id pubmed-8235266
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-82352662021-06-27 QoS-Aware Approximate Query Processing for Smart Cities Spatial Data Streams Al Jawarneh, Isam Mashhour Bellavista, Paolo Corradi, Antonio Foschini, Luca Montanari, Rebecca Sensors (Basel) Article Large amounts of georeferenced data streams arrive daily to stream processing systems. This is attributable to the overabundance of affordable IoT devices. In addition, interested practitioners desire to exploit Internet of Things (IoT) data streams for strategic decision-making purposes. However, mobility data are highly skewed and their arrival rates fluctuate. This nature poses an extra challenge on data stream processing systems, which are required in order to achieve pre-specified latency and accuracy goals. In this paper, we propose ApproxSSPS, which is a system for approximate processing of geo-referenced mobility data, at scale with quality of service guarantees. We focus on stateful aggregations (e.g., means, counts) and top-N queries. ApproxSSPS features a controller that interactively learns the latency statistics and calculates proper sampling rates to meet latency or/and accuracy targets. An overarching trait of ApproxSSPS is its ability to strike a plausible balance between latency and accuracy targets. We evaluate ApproxSSPS on Apache Spark Structured Streaming with real mobility data. We also compared ApproxSSPS against a state-of-the-art online adaptive processing system. Our extensive experiments prove that ApproxSSPS can fulfill latency and accuracy targets with varying sets of parameter configurations and load intensities (i.e., transient peaks in data loads versus slow arriving streams). Moreover, our results show that ApproxSSPS outperforms the baseline counterpart by significant magnitudes. In short, ApproxSSPS is a novel spatial data stream processing system that can deliver real accurate results in a timely manner, by dynamically specifying the limits on data samples. MDPI 2021-06-17 /pmc/articles/PMC8235266/ /pubmed/34204451 http://dx.doi.org/10.3390/s21124160 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Al Jawarneh, Isam Mashhour
Bellavista, Paolo
Corradi, Antonio
Foschini, Luca
Montanari, Rebecca
QoS-Aware Approximate Query Processing for Smart Cities Spatial Data Streams
title QoS-Aware Approximate Query Processing for Smart Cities Spatial Data Streams
title_full QoS-Aware Approximate Query Processing for Smart Cities Spatial Data Streams
title_fullStr QoS-Aware Approximate Query Processing for Smart Cities Spatial Data Streams
title_full_unstemmed QoS-Aware Approximate Query Processing for Smart Cities Spatial Data Streams
title_short QoS-Aware Approximate Query Processing for Smart Cities Spatial Data Streams
title_sort qos-aware approximate query processing for smart cities spatial data streams
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8235266/
https://www.ncbi.nlm.nih.gov/pubmed/34204451
http://dx.doi.org/10.3390/s21124160
work_keys_str_mv AT aljawarnehisammashhour qosawareapproximatequeryprocessingforsmartcitiesspatialdatastreams
AT bellavistapaolo qosawareapproximatequeryprocessingforsmartcitiesspatialdatastreams
AT corradiantonio qosawareapproximatequeryprocessingforsmartcitiesspatialdatastreams
AT foschiniluca qosawareapproximatequeryprocessingforsmartcitiesspatialdatastreams
AT montanarirebecca qosawareapproximatequeryprocessingforsmartcitiesspatialdatastreams