Cargando…
Detecting Synonymous Properties by Shared Data-Driven Definitions
Knowledge graphs have become an essential source of entity-centric information for modern applications. Today’s KGs have reached a size of billions of RDF triples extracted from a variety of sources, including structured sources and text. While this definitely improves completeness, the inherent var...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7250622/ http://dx.doi.org/10.1007/978-3-030-49461-2_21 |
_version_ | 1783538798697644032 |
---|---|
author | Kalo, Jan-Christoph Mennicke, Stephan Ehler, Philipp Balke, Wolf-Tilo |
author_facet | Kalo, Jan-Christoph Mennicke, Stephan Ehler, Philipp Balke, Wolf-Tilo |
author_sort | Kalo, Jan-Christoph |
collection | PubMed |
description | Knowledge graphs have become an essential source of entity-centric information for modern applications. Today’s KGs have reached a size of billions of RDF triples extracted from a variety of sources, including structured sources and text. While this definitely improves completeness, the inherent variety of sources leads to severe heterogeneity, negatively affecting data quality by introducing duplicate information. We present a novel technique for detecting synonymous properties in large knowledge graphs by mining interpretable definitions of properties using association rule mining. Relying on such shared definitions, our technique is able to mine even synonym rules that have only little support in the data. In particular, our extensive experiments on DBpedia and Wikidata show that our rule-based approach can outperform state-of-the-art knowledge graph embedding techniques, while offering good interpretability through shared logical rules. |
format | Online Article Text |
id | pubmed-7250622 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
record_format | MEDLINE/PubMed |
spelling | pubmed-72506222020-05-27 Detecting Synonymous Properties by Shared Data-Driven Definitions Kalo, Jan-Christoph Mennicke, Stephan Ehler, Philipp Balke, Wolf-Tilo The Semantic Web Article Knowledge graphs have become an essential source of entity-centric information for modern applications. Today’s KGs have reached a size of billions of RDF triples extracted from a variety of sources, including structured sources and text. While this definitely improves completeness, the inherent variety of sources leads to severe heterogeneity, negatively affecting data quality by introducing duplicate information. We present a novel technique for detecting synonymous properties in large knowledge graphs by mining interpretable definitions of properties using association rule mining. Relying on such shared definitions, our technique is able to mine even synonym rules that have only little support in the data. In particular, our extensive experiments on DBpedia and Wikidata show that our rule-based approach can outperform state-of-the-art knowledge graph embedding techniques, while offering good interpretability through shared logical rules. 2020-05-07 /pmc/articles/PMC7250622/ http://dx.doi.org/10.1007/978-3-030-49461-2_21 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Article Kalo, Jan-Christoph Mennicke, Stephan Ehler, Philipp Balke, Wolf-Tilo Detecting Synonymous Properties by Shared Data-Driven Definitions |
title | Detecting Synonymous Properties by Shared Data-Driven Definitions |
title_full | Detecting Synonymous Properties by Shared Data-Driven Definitions |
title_fullStr | Detecting Synonymous Properties by Shared Data-Driven Definitions |
title_full_unstemmed | Detecting Synonymous Properties by Shared Data-Driven Definitions |
title_short | Detecting Synonymous Properties by Shared Data-Driven Definitions |
title_sort | detecting synonymous properties by shared data-driven definitions |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7250622/ http://dx.doi.org/10.1007/978-3-030-49461-2_21 |
work_keys_str_mv | AT kalojanchristoph detectingsynonymouspropertiesbyshareddatadrivendefinitions AT mennickestephan detectingsynonymouspropertiesbyshareddatadrivendefinitions AT ehlerphilipp detectingsynonymouspropertiesbyshareddatadrivendefinitions AT balkewolftilo detectingsynonymouspropertiesbyshareddatadrivendefinitions |