Cargando…

Detecting Synonymous Properties by Shared Data-Driven Definitions

Knowledge graphs have become an essential source of entity-centric information for modern applications. Today’s KGs have reached a size of billions of RDF triples extracted from a variety of sources, including structured sources and text. While this definitely improves completeness, the inherent var...

Descripción completa

Detalles Bibliográficos
Autores principales: Kalo, Jan-Christoph, Mennicke, Stephan, Ehler, Philipp, Balke, Wolf-Tilo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7250622/
http://dx.doi.org/10.1007/978-3-030-49461-2_21
_version_ 1783538798697644032
author Kalo, Jan-Christoph
Mennicke, Stephan
Ehler, Philipp
Balke, Wolf-Tilo
author_facet Kalo, Jan-Christoph
Mennicke, Stephan
Ehler, Philipp
Balke, Wolf-Tilo
author_sort Kalo, Jan-Christoph
collection PubMed
description Knowledge graphs have become an essential source of entity-centric information for modern applications. Today’s KGs have reached a size of billions of RDF triples extracted from a variety of sources, including structured sources and text. While this definitely improves completeness, the inherent variety of sources leads to severe heterogeneity, negatively affecting data quality by introducing duplicate information. We present a novel technique for detecting synonymous properties in large knowledge graphs by mining interpretable definitions of properties using association rule mining. Relying on such shared definitions, our technique is able to mine even synonym rules that have only little support in the data. In particular, our extensive experiments on DBpedia and Wikidata show that our rule-based approach can outperform state-of-the-art knowledge graph embedding techniques, while offering good interpretability through shared logical rules.
format Online
Article
Text
id pubmed-7250622
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-72506222020-05-27 Detecting Synonymous Properties by Shared Data-Driven Definitions Kalo, Jan-Christoph Mennicke, Stephan Ehler, Philipp Balke, Wolf-Tilo The Semantic Web Article Knowledge graphs have become an essential source of entity-centric information for modern applications. Today’s KGs have reached a size of billions of RDF triples extracted from a variety of sources, including structured sources and text. While this definitely improves completeness, the inherent variety of sources leads to severe heterogeneity, negatively affecting data quality by introducing duplicate information. We present a novel technique for detecting synonymous properties in large knowledge graphs by mining interpretable definitions of properties using association rule mining. Relying on such shared definitions, our technique is able to mine even synonym rules that have only little support in the data. In particular, our extensive experiments on DBpedia and Wikidata show that our rule-based approach can outperform state-of-the-art knowledge graph embedding techniques, while offering good interpretability through shared logical rules. 2020-05-07 /pmc/articles/PMC7250622/ http://dx.doi.org/10.1007/978-3-030-49461-2_21 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Kalo, Jan-Christoph
Mennicke, Stephan
Ehler, Philipp
Balke, Wolf-Tilo
Detecting Synonymous Properties by Shared Data-Driven Definitions
title Detecting Synonymous Properties by Shared Data-Driven Definitions
title_full Detecting Synonymous Properties by Shared Data-Driven Definitions
title_fullStr Detecting Synonymous Properties by Shared Data-Driven Definitions
title_full_unstemmed Detecting Synonymous Properties by Shared Data-Driven Definitions
title_short Detecting Synonymous Properties by Shared Data-Driven Definitions
title_sort detecting synonymous properties by shared data-driven definitions
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7250622/
http://dx.doi.org/10.1007/978-3-030-49461-2_21
work_keys_str_mv AT kalojanchristoph detectingsynonymouspropertiesbyshareddatadrivendefinitions
AT mennickestephan detectingsynonymouspropertiesbyshareddatadrivendefinitions
AT ehlerphilipp detectingsynonymouspropertiesbyshareddatadrivendefinitions
AT balkewolftilo detectingsynonymouspropertiesbyshareddatadrivendefinitions