Cargando…

Querying knowledge graphs in natural language

Knowledge graphs are a powerful concept for querying large amounts of data. These knowledge graphs are typically enormous and are often not easily accessible to end-users because they require specialized knowledge in query languages such as SPARQL. Moreover, end-users need a deep understanding of th...

Descripción completa

Detalles Bibliográficos
Autores principales:	Liang, Shiqi, Stockinger, Kurt, de Farias, Tarcisio Mendes, Anisimova, Maria, Gil, Manuel
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer International Publishing 2021
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7799375/ https://www.ncbi.nlm.nih.gov/pubmed/33489717 http://dx.doi.org/10.1186/s40537-020-00383-w

_version_	1783635121105010688
author	Liang, Shiqi Stockinger, Kurt de Farias, Tarcisio Mendes Anisimova, Maria Gil, Manuel
author_facet	Liang, Shiqi Stockinger, Kurt de Farias, Tarcisio Mendes Anisimova, Maria Gil, Manuel
author_sort	Liang, Shiqi
collection	PubMed
description	Knowledge graphs are a powerful concept for querying large amounts of data. These knowledge graphs are typically enormous and are often not easily accessible to end-users because they require specialized knowledge in query languages such as SPARQL. Moreover, end-users need a deep understanding of the structure of the underlying data models often based on the Resource Description Framework (RDF). This drawback has led to the development of Question-Answering (QA) systems that enable end-users to express their information needs in natural language. While existing systems simplify user access, there is still room for improvement in the accuracy of these systems. In this paper we propose a new QA system for translating natural language questions into SPARQL queries. The key idea is to break up the translation process into 5 smaller, more manageable sub-tasks and use ensemble machine learning methods as well as Tree-LSTM-based neural network models to automatically learn and translate a natural language question into a SPARQL query. The performance of our proposed QA system is empirically evaluated using the two renowned benchmarks-the 7th Question Answering over Linked Data Challenge (QALD-7) and the Large-Scale Complex Question Answering Dataset (LC-QuAD). Experimental results show that our QA system outperforms the state-of-art systems by 15% on the QALD-7 dataset and by 48% on the LC-QuAD dataset, respectively. In addition, we make our source code available.
format	Online Article Text
id	pubmed-7799375
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Springer International Publishing
record_format	MEDLINE/PubMed
spelling	pubmed-77993752021-01-21 Querying knowledge graphs in natural language Liang, Shiqi Stockinger, Kurt de Farias, Tarcisio Mendes Anisimova, Maria Gil, Manuel J Big Data Research Knowledge graphs are a powerful concept for querying large amounts of data. These knowledge graphs are typically enormous and are often not easily accessible to end-users because they require specialized knowledge in query languages such as SPARQL. Moreover, end-users need a deep understanding of the structure of the underlying data models often based on the Resource Description Framework (RDF). This drawback has led to the development of Question-Answering (QA) systems that enable end-users to express their information needs in natural language. While existing systems simplify user access, there is still room for improvement in the accuracy of these systems. In this paper we propose a new QA system for translating natural language questions into SPARQL queries. The key idea is to break up the translation process into 5 smaller, more manageable sub-tasks and use ensemble machine learning methods as well as Tree-LSTM-based neural network models to automatically learn and translate a natural language question into a SPARQL query. The performance of our proposed QA system is empirically evaluated using the two renowned benchmarks-the 7th Question Answering over Linked Data Challenge (QALD-7) and the Large-Scale Complex Question Answering Dataset (LC-QuAD). Experimental results show that our QA system outperforms the state-of-art systems by 15% on the QALD-7 dataset and by 48% on the LC-QuAD dataset, respectively. In addition, we make our source code available. Springer International Publishing 2021-01-06 2021 /pmc/articles/PMC7799375/ /pubmed/33489717 http://dx.doi.org/10.1186/s40537-020-00383-w Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle	Research Liang, Shiqi Stockinger, Kurt de Farias, Tarcisio Mendes Anisimova, Maria Gil, Manuel Querying knowledge graphs in natural language
title	Querying knowledge graphs in natural language
title_full	Querying knowledge graphs in natural language
title_fullStr	Querying knowledge graphs in natural language
title_full_unstemmed	Querying knowledge graphs in natural language
title_short	Querying knowledge graphs in natural language
title_sort	querying knowledge graphs in natural language
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7799375/ https://www.ncbi.nlm.nih.gov/pubmed/33489717 http://dx.doi.org/10.1186/s40537-020-00383-w
work_keys_str_mv	AT liangshiqi queryingknowledgegraphsinnaturallanguage AT stockingerkurt queryingknowledgegraphsinnaturallanguage AT defariastarcisiomendes queryingknowledgegraphsinnaturallanguage AT anisimovamaria queryingknowledgegraphsinnaturallanguage AT gilmanuel queryingknowledgegraphsinnaturallanguage

Querying knowledge graphs in natural language

Ejemplares similares