Cargando…
A geospatial source selector for federated GeoSPARQL querying
Background: Geospatial linked data brings into the scope of the Semantic Web and its technologies, a wealth of datasets that combine semantically-rich descriptions of resources with their geo-location. There are, however, various Semantic Web technologies where technical work is needed in order to a...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
F1000 Research Limited
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10446020/ https://www.ncbi.nlm.nih.gov/pubmed/37645331 http://dx.doi.org/10.12688/openreseurope.14605.2 |
_version_ | 1785094309550227456 |
---|---|
author | Troumpoukis, Antonis Konstantopoulos, Stasinos Prokopaki-Kostopoulou, Nefeli |
author_facet | Troumpoukis, Antonis Konstantopoulos, Stasinos Prokopaki-Kostopoulou, Nefeli |
author_sort | Troumpoukis, Antonis |
collection | PubMed |
description | Background: Geospatial linked data brings into the scope of the Semantic Web and its technologies, a wealth of datasets that combine semantically-rich descriptions of resources with their geo-location. There are, however, various Semantic Web technologies where technical work is needed in order to achieve the full integration of geospatial data, and federated query processing is one of these technologies. Methods: In this paper, we explore the idea of annotating data sources with a bounding polygon that summarizes the spatial extent of the resources in each data source, and of using such a summary as an (additional) source selection criterion in order to reduce the set of sources that will be tested as potentially holding relevant data. We present our source selection method, and we discuss its correctness and implementation. Results: We evaluate the proposed source selection using three different types of summaries with different degrees of accuracy, against not using geospatial summaries. We use datasets and queries from a practical use case that combines crop-type data with water availability data for food security. The experimental results suggest that more complex summaries lead to slower source selection times, but also to more precise exclusion of unneeded sources. Moreover, we observe the source selection runtime is (partially or fully) recovered by shorter planning and execution runtimes. As a result, the federated sources are not burdened by pointless querying from the federation engine. Conclusions: The evaluation draws on data and queries from the agroenvironmental domain and shows that our source selection method substantially improves the effectiveness of federated GeoSPARQL query processing. |
format | Online Article Text |
id | pubmed-10446020 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | F1000 Research Limited |
record_format | MEDLINE/PubMed |
spelling | pubmed-104460202023-08-29 A geospatial source selector for federated GeoSPARQL querying Troumpoukis, Antonis Konstantopoulos, Stasinos Prokopaki-Kostopoulou, Nefeli Open Res Eur Method Article Background: Geospatial linked data brings into the scope of the Semantic Web and its technologies, a wealth of datasets that combine semantically-rich descriptions of resources with their geo-location. There are, however, various Semantic Web technologies where technical work is needed in order to achieve the full integration of geospatial data, and federated query processing is one of these technologies. Methods: In this paper, we explore the idea of annotating data sources with a bounding polygon that summarizes the spatial extent of the resources in each data source, and of using such a summary as an (additional) source selection criterion in order to reduce the set of sources that will be tested as potentially holding relevant data. We present our source selection method, and we discuss its correctness and implementation. Results: We evaluate the proposed source selection using three different types of summaries with different degrees of accuracy, against not using geospatial summaries. We use datasets and queries from a practical use case that combines crop-type data with water availability data for food security. The experimental results suggest that more complex summaries lead to slower source selection times, but also to more precise exclusion of unneeded sources. Moreover, we observe the source selection runtime is (partially or fully) recovered by shorter planning and execution runtimes. As a result, the federated sources are not burdened by pointless querying from the federation engine. Conclusions: The evaluation draws on data and queries from the agroenvironmental domain and shows that our source selection method substantially improves the effectiveness of federated GeoSPARQL query processing. F1000 Research Limited 2022-10-06 /pmc/articles/PMC10446020/ /pubmed/37645331 http://dx.doi.org/10.12688/openreseurope.14605.2 Text en Copyright: © 2022 Troumpoukis A et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Method Article Troumpoukis, Antonis Konstantopoulos, Stasinos Prokopaki-Kostopoulou, Nefeli A geospatial source selector for federated GeoSPARQL querying |
title | A geospatial source selector for federated GeoSPARQL querying |
title_full | A geospatial source selector for federated GeoSPARQL querying |
title_fullStr | A geospatial source selector for federated GeoSPARQL querying |
title_full_unstemmed | A geospatial source selector for federated GeoSPARQL querying |
title_short | A geospatial source selector for federated GeoSPARQL querying |
title_sort | geospatial source selector for federated geosparql querying |
topic | Method Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10446020/ https://www.ncbi.nlm.nih.gov/pubmed/37645331 http://dx.doi.org/10.12688/openreseurope.14605.2 |
work_keys_str_mv | AT troumpoukisantonis ageospatialsourceselectorforfederatedgeosparqlquerying AT konstantopoulosstasinos ageospatialsourceselectorforfederatedgeosparqlquerying AT prokopakikostopoulounefeli ageospatialsourceselectorforfederatedgeosparqlquerying AT troumpoukisantonis geospatialsourceselectorforfederatedgeosparqlquerying AT konstantopoulosstasinos geospatialsourceselectorforfederatedgeosparqlquerying AT prokopakikostopoulounefeli geospatialsourceselectorforfederatedgeosparqlquerying |