Cargando…

A geospatial source selector for federated GeoSPARQL querying

Background: Geospatial linked data brings into the scope of the Semantic Web and its technologies, a wealth of datasets that combine semantically-rich descriptions of resources with their geo-location. There are, however, various Semantic Web technologies where technical work is needed in order to a...

Descripción completa

Detalles Bibliográficos
Autores principales: Troumpoukis, Antonis, Konstantopoulos, Stasinos, Prokopaki-Kostopoulou, Nefeli
Formato: Online Artículo Texto
Lenguaje:English
Publicado: F1000 Research Limited 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10446020/
https://www.ncbi.nlm.nih.gov/pubmed/37645331
http://dx.doi.org/10.12688/openreseurope.14605.2
_version_ 1785094309550227456
author Troumpoukis, Antonis
Konstantopoulos, Stasinos
Prokopaki-Kostopoulou, Nefeli
author_facet Troumpoukis, Antonis
Konstantopoulos, Stasinos
Prokopaki-Kostopoulou, Nefeli
author_sort Troumpoukis, Antonis
collection PubMed
description Background: Geospatial linked data brings into the scope of the Semantic Web and its technologies, a wealth of datasets that combine semantically-rich descriptions of resources with their geo-location. There are, however, various Semantic Web technologies where technical work is needed in order to achieve the full integration of geospatial data, and federated query processing is one of these technologies. Methods: In this paper, we explore the idea of annotating data sources with a bounding polygon that summarizes the spatial extent of the resources in each data source, and of using such a summary as an (additional) source selection criterion in order to reduce the set of sources that will be tested as potentially holding relevant data. We present our source selection method, and we discuss its correctness and implementation. Results: We evaluate the proposed source selection using three different types of summaries with different degrees of accuracy, against not using geospatial summaries. We use datasets and queries from a practical use case that combines crop-type data with water availability data for food security. The experimental results suggest that more complex summaries lead to slower source selection times, but also to more precise exclusion of unneeded sources. Moreover, we observe the source selection runtime is (partially or fully) recovered by shorter planning and execution runtimes. As a result, the federated sources are not burdened by pointless querying from the federation engine. Conclusions: The evaluation draws on data and queries from the agroenvironmental domain and shows that our source selection method substantially improves the effectiveness of federated GeoSPARQL query processing.
format Online
Article
Text
id pubmed-10446020
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher F1000 Research Limited
record_format MEDLINE/PubMed
spelling pubmed-104460202023-08-29 A geospatial source selector for federated GeoSPARQL querying Troumpoukis, Antonis Konstantopoulos, Stasinos Prokopaki-Kostopoulou, Nefeli Open Res Eur Method Article Background: Geospatial linked data brings into the scope of the Semantic Web and its technologies, a wealth of datasets that combine semantically-rich descriptions of resources with their geo-location. There are, however, various Semantic Web technologies where technical work is needed in order to achieve the full integration of geospatial data, and federated query processing is one of these technologies. Methods: In this paper, we explore the idea of annotating data sources with a bounding polygon that summarizes the spatial extent of the resources in each data source, and of using such a summary as an (additional) source selection criterion in order to reduce the set of sources that will be tested as potentially holding relevant data. We present our source selection method, and we discuss its correctness and implementation. Results: We evaluate the proposed source selection using three different types of summaries with different degrees of accuracy, against not using geospatial summaries. We use datasets and queries from a practical use case that combines crop-type data with water availability data for food security. The experimental results suggest that more complex summaries lead to slower source selection times, but also to more precise exclusion of unneeded sources. Moreover, we observe the source selection runtime is (partially or fully) recovered by shorter planning and execution runtimes. As a result, the federated sources are not burdened by pointless querying from the federation engine. Conclusions: The evaluation draws on data and queries from the agroenvironmental domain and shows that our source selection method substantially improves the effectiveness of federated GeoSPARQL query processing. F1000 Research Limited 2022-10-06 /pmc/articles/PMC10446020/ /pubmed/37645331 http://dx.doi.org/10.12688/openreseurope.14605.2 Text en Copyright: © 2022 Troumpoukis A et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Method Article
Troumpoukis, Antonis
Konstantopoulos, Stasinos
Prokopaki-Kostopoulou, Nefeli
A geospatial source selector for federated GeoSPARQL querying
title A geospatial source selector for federated GeoSPARQL querying
title_full A geospatial source selector for federated GeoSPARQL querying
title_fullStr A geospatial source selector for federated GeoSPARQL querying
title_full_unstemmed A geospatial source selector for federated GeoSPARQL querying
title_short A geospatial source selector for federated GeoSPARQL querying
title_sort geospatial source selector for federated geosparql querying
topic Method Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10446020/
https://www.ncbi.nlm.nih.gov/pubmed/37645331
http://dx.doi.org/10.12688/openreseurope.14605.2
work_keys_str_mv AT troumpoukisantonis ageospatialsourceselectorforfederatedgeosparqlquerying
AT konstantopoulosstasinos ageospatialsourceselectorforfederatedgeosparqlquerying
AT prokopakikostopoulounefeli ageospatialsourceselectorforfederatedgeosparqlquerying
AT troumpoukisantonis geospatialsourceselectorforfederatedgeosparqlquerying
AT konstantopoulosstasinos geospatialsourceselectorforfederatedgeosparqlquerying
AT prokopakikostopoulounefeli geospatialsourceselectorforfederatedgeosparqlquerying