Cargando…
An empirical study on Resource Description Framework reification for trustworthiness in knowledge graphs
Knowledge graph (KG) publishes machine-readable representation of knowledge on the Web. Structured data in the knowledge graph is published using Resource Description Framework (RDF) where knowledge is represented as a triple (subject, predicate, object). Due to the presence of erroneous, outdated o...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
F1000 Research Limited
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8634049/ https://www.ncbi.nlm.nih.gov/pubmed/34900233 http://dx.doi.org/10.12688/f1000research.72843.2 |
_version_ | 1784608057273090048 |
---|---|
author | Govindapillai, Sini Soon, Lay-Ki Haw, Su-Cheng |
author_facet | Govindapillai, Sini Soon, Lay-Ki Haw, Su-Cheng |
author_sort | Govindapillai, Sini |
collection | PubMed |
description | Knowledge graph (KG) publishes machine-readable representation of knowledge on the Web. Structured data in the knowledge graph is published using Resource Description Framework (RDF) where knowledge is represented as a triple (subject, predicate, object). Due to the presence of erroneous, outdated or conflicting data in the knowledge graph, the quality of facts cannot be guaranteed. Trustworthiness of facts in knowledge graph can be enhanced by the addition of metadata like the source of information, location and time of the fact occurrence. Since RDF does not support metadata for providing provenance and contextualization, an alternate method, RDF reification is employed by most of the knowledge graphs. RDF reification increases the magnitude of data as several statements are required to represent a single fact. Another limitation for applications that uses provenance data like in the medical domain and in cyber security is that not all facts in these knowledge graphs are annotated with provenance data. In this paper, we have provided an overview of prominent reification approaches together with the analysis of popular, general knowledge graphs Wikidata and YAGO4 with regard to the representation of provenance and context data. Wikidata employs qualifiers to include metadata to facts, while YAGO4 collects metadata from Wikidata qualifiers. However, facts in Wikidata and YAGO4 can be fetched without using reification to cater for applications that do not require metadata. To the best of our knowledge, this is the first paper that investigates the method and the extent of metadata covered by two prominent KGs, Wikidata and YAGO4. |
format | Online Article Text |
id | pubmed-8634049 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | F1000 Research Limited |
record_format | MEDLINE/PubMed |
spelling | pubmed-86340492021-12-09 An empirical study on Resource Description Framework reification for trustworthiness in knowledge graphs Govindapillai, Sini Soon, Lay-Ki Haw, Su-Cheng F1000Res Research Article Knowledge graph (KG) publishes machine-readable representation of knowledge on the Web. Structured data in the knowledge graph is published using Resource Description Framework (RDF) where knowledge is represented as a triple (subject, predicate, object). Due to the presence of erroneous, outdated or conflicting data in the knowledge graph, the quality of facts cannot be guaranteed. Trustworthiness of facts in knowledge graph can be enhanced by the addition of metadata like the source of information, location and time of the fact occurrence. Since RDF does not support metadata for providing provenance and contextualization, an alternate method, RDF reification is employed by most of the knowledge graphs. RDF reification increases the magnitude of data as several statements are required to represent a single fact. Another limitation for applications that uses provenance data like in the medical domain and in cyber security is that not all facts in these knowledge graphs are annotated with provenance data. In this paper, we have provided an overview of prominent reification approaches together with the analysis of popular, general knowledge graphs Wikidata and YAGO4 with regard to the representation of provenance and context data. Wikidata employs qualifiers to include metadata to facts, while YAGO4 collects metadata from Wikidata qualifiers. However, facts in Wikidata and YAGO4 can be fetched without using reification to cater for applications that do not require metadata. To the best of our knowledge, this is the first paper that investigates the method and the extent of metadata covered by two prominent KGs, Wikidata and YAGO4. F1000 Research Limited 2021-11-29 /pmc/articles/PMC8634049/ /pubmed/34900233 http://dx.doi.org/10.12688/f1000research.72843.2 Text en Copyright: © 2021 Govindapillai S et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Govindapillai, Sini Soon, Lay-Ki Haw, Su-Cheng An empirical study on Resource Description Framework reification for trustworthiness in knowledge graphs |
title | An empirical study on Resource Description Framework reification for trustworthiness in knowledge graphs |
title_full | An empirical study on Resource Description Framework reification for trustworthiness in knowledge graphs |
title_fullStr | An empirical study on Resource Description Framework reification for trustworthiness in knowledge graphs |
title_full_unstemmed | An empirical study on Resource Description Framework reification for trustworthiness in knowledge graphs |
title_short | An empirical study on Resource Description Framework reification for trustworthiness in knowledge graphs |
title_sort | empirical study on resource description framework reification for trustworthiness in knowledge graphs |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8634049/ https://www.ncbi.nlm.nih.gov/pubmed/34900233 http://dx.doi.org/10.12688/f1000research.72843.2 |
work_keys_str_mv | AT govindapillaisini anempiricalstudyonresourcedescriptionframeworkreificationfortrustworthinessinknowledgegraphs AT soonlayki anempiricalstudyonresourcedescriptionframeworkreificationfortrustworthinessinknowledgegraphs AT hawsucheng anempiricalstudyonresourcedescriptionframeworkreificationfortrustworthinessinknowledgegraphs AT govindapillaisini empiricalstudyonresourcedescriptionframeworkreificationfortrustworthinessinknowledgegraphs AT soonlayki empiricalstudyonresourcedescriptionframeworkreificationfortrustworthinessinknowledgegraphs AT hawsucheng empiricalstudyonresourcedescriptionframeworkreificationfortrustworthinessinknowledgegraphs |