Cargando…

Keyword Search over RDF Using Document-Centric Information Retrieval Systems

For ordinary users, the task of accessing knowledge graphs through structured query languages like SPARQL is rather demanding. As a result, various approaches exploit the simpler and widely used keyword-based search paradigm, either by translating keyword queries to structured queries, or by adoptin...

Descripción completa

Detalles Bibliográficos
Autores principales: Kadilierakis, Giorgos, Fafalios, Pavlos, Papadakos, Panagiotis, Tzitzikas, Yannis
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7250595/
http://dx.doi.org/10.1007/978-3-030-49461-2_8
_version_ 1783538792074838016
author Kadilierakis, Giorgos
Fafalios, Pavlos
Papadakos, Panagiotis
Tzitzikas, Yannis
author_facet Kadilierakis, Giorgos
Fafalios, Pavlos
Papadakos, Panagiotis
Tzitzikas, Yannis
author_sort Kadilierakis, Giorgos
collection PubMed
description For ordinary users, the task of accessing knowledge graphs through structured query languages like SPARQL is rather demanding. As a result, various approaches exploit the simpler and widely used keyword-based search paradigm, either by translating keyword queries to structured queries, or by adopting classical information retrieval (IR) techniques. In this paper, we study and adapt Elasticsearch, an out-of-the-box document-centric IR system, for supporting keyword search over RDF datasets. Contrary to other works that mainly retrieve entities, we opt for retrieving triples, due to their expressiveness and informativeness. We specify the set of functional requirements and study the emerging questions related to the selection and weighting of the triple data to index, and the structuring and ranking of the retrieved results. Finally, we perform an extensive evaluation of the different factors that affect the IR performance for four different query types. The reported results are promising and offer useful insights on how different Elasticsearch configurations affect the retrieval effectiveness and efficiency.
format Online
Article
Text
id pubmed-7250595
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-72505952020-05-27 Keyword Search over RDF Using Document-Centric Information Retrieval Systems Kadilierakis, Giorgos Fafalios, Pavlos Papadakos, Panagiotis Tzitzikas, Yannis The Semantic Web Article For ordinary users, the task of accessing knowledge graphs through structured query languages like SPARQL is rather demanding. As a result, various approaches exploit the simpler and widely used keyword-based search paradigm, either by translating keyword queries to structured queries, or by adopting classical information retrieval (IR) techniques. In this paper, we study and adapt Elasticsearch, an out-of-the-box document-centric IR system, for supporting keyword search over RDF datasets. Contrary to other works that mainly retrieve entities, we opt for retrieving triples, due to their expressiveness and informativeness. We specify the set of functional requirements and study the emerging questions related to the selection and weighting of the triple data to index, and the structuring and ranking of the retrieved results. Finally, we perform an extensive evaluation of the different factors that affect the IR performance for four different query types. The reported results are promising and offer useful insights on how different Elasticsearch configurations affect the retrieval effectiveness and efficiency. 2020-05-07 /pmc/articles/PMC7250595/ http://dx.doi.org/10.1007/978-3-030-49461-2_8 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Kadilierakis, Giorgos
Fafalios, Pavlos
Papadakos, Panagiotis
Tzitzikas, Yannis
Keyword Search over RDF Using Document-Centric Information Retrieval Systems
title Keyword Search over RDF Using Document-Centric Information Retrieval Systems
title_full Keyword Search over RDF Using Document-Centric Information Retrieval Systems
title_fullStr Keyword Search over RDF Using Document-Centric Information Retrieval Systems
title_full_unstemmed Keyword Search over RDF Using Document-Centric Information Retrieval Systems
title_short Keyword Search over RDF Using Document-Centric Information Retrieval Systems
title_sort keyword search over rdf using document-centric information retrieval systems
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7250595/
http://dx.doi.org/10.1007/978-3-030-49461-2_8
work_keys_str_mv AT kadilierakisgiorgos keywordsearchoverrdfusingdocumentcentricinformationretrievalsystems
AT fafaliospavlos keywordsearchoverrdfusingdocumentcentricinformationretrievalsystems
AT papadakospanagiotis keywordsearchoverrdfusingdocumentcentricinformationretrievalsystems
AT tzitzikasyannis keywordsearchoverrdfusingdocumentcentricinformationretrievalsystems