Cargando…

An empirical study on Resource Description Framework reification for trustworthiness in knowledge graphs

Knowledge graph (KG) publishes machine-readable representation of knowledge on the Web. Structured data in the knowledge graph is published using Resource Description Framework (RDF) where knowledge is represented as a triple (subject, predicate, object). Due to the presence of erroneous, outdated o...

Descripción completa

Detalles Bibliográficos
Autores principales: Govindapillai, Sini, Soon, Lay-Ki, Haw, Su-Cheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: F1000 Research Limited 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8634049/
https://www.ncbi.nlm.nih.gov/pubmed/34900233
http://dx.doi.org/10.12688/f1000research.72843.2
_version_ 1784608057273090048
author Govindapillai, Sini
Soon, Lay-Ki
Haw, Su-Cheng
author_facet Govindapillai, Sini
Soon, Lay-Ki
Haw, Su-Cheng
author_sort Govindapillai, Sini
collection PubMed
description Knowledge graph (KG) publishes machine-readable representation of knowledge on the Web. Structured data in the knowledge graph is published using Resource Description Framework (RDF) where knowledge is represented as a triple (subject, predicate, object). Due to the presence of erroneous, outdated or conflicting data in the knowledge graph, the quality of facts cannot be guaranteed. Trustworthiness of facts in knowledge graph can be enhanced by the addition of metadata like the source of information, location and time of the fact occurrence. Since RDF does not support metadata for providing provenance and contextualization, an alternate method, RDF reification is employed by most of the knowledge graphs. RDF reification increases the magnitude of data as several statements are required to represent a single fact. Another limitation for applications that uses provenance data like in the medical domain and in cyber security is that not all facts in these knowledge graphs are annotated with provenance data. In this paper, we have provided an overview of prominent reification approaches together with the analysis of popular, general knowledge graphs Wikidata and YAGO4 with regard to the representation of provenance and context data. Wikidata employs qualifiers to include metadata to facts, while YAGO4 collects metadata from Wikidata qualifiers. However, facts in Wikidata and YAGO4 can be fetched without using reification to cater for applications that do not require metadata. To the best of our knowledge, this is the first paper that investigates the method and the extent of metadata covered by two prominent KGs, Wikidata and YAGO4.
format Online
Article
Text
id pubmed-8634049
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher F1000 Research Limited
record_format MEDLINE/PubMed
spelling pubmed-86340492021-12-09 An empirical study on Resource Description Framework reification for trustworthiness in knowledge graphs Govindapillai, Sini Soon, Lay-Ki Haw, Su-Cheng F1000Res Research Article Knowledge graph (KG) publishes machine-readable representation of knowledge on the Web. Structured data in the knowledge graph is published using Resource Description Framework (RDF) where knowledge is represented as a triple (subject, predicate, object). Due to the presence of erroneous, outdated or conflicting data in the knowledge graph, the quality of facts cannot be guaranteed. Trustworthiness of facts in knowledge graph can be enhanced by the addition of metadata like the source of information, location and time of the fact occurrence. Since RDF does not support metadata for providing provenance and contextualization, an alternate method, RDF reification is employed by most of the knowledge graphs. RDF reification increases the magnitude of data as several statements are required to represent a single fact. Another limitation for applications that uses provenance data like in the medical domain and in cyber security is that not all facts in these knowledge graphs are annotated with provenance data. In this paper, we have provided an overview of prominent reification approaches together with the analysis of popular, general knowledge graphs Wikidata and YAGO4 with regard to the representation of provenance and context data. Wikidata employs qualifiers to include metadata to facts, while YAGO4 collects metadata from Wikidata qualifiers. However, facts in Wikidata and YAGO4 can be fetched without using reification to cater for applications that do not require metadata. To the best of our knowledge, this is the first paper that investigates the method and the extent of metadata covered by two prominent KGs, Wikidata and YAGO4. F1000 Research Limited 2021-11-29 /pmc/articles/PMC8634049/ /pubmed/34900233 http://dx.doi.org/10.12688/f1000research.72843.2 Text en Copyright: © 2021 Govindapillai S et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Govindapillai, Sini
Soon, Lay-Ki
Haw, Su-Cheng
An empirical study on Resource Description Framework reification for trustworthiness in knowledge graphs
title An empirical study on Resource Description Framework reification for trustworthiness in knowledge graphs
title_full An empirical study on Resource Description Framework reification for trustworthiness in knowledge graphs
title_fullStr An empirical study on Resource Description Framework reification for trustworthiness in knowledge graphs
title_full_unstemmed An empirical study on Resource Description Framework reification for trustworthiness in knowledge graphs
title_short An empirical study on Resource Description Framework reification for trustworthiness in knowledge graphs
title_sort empirical study on resource description framework reification for trustworthiness in knowledge graphs
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8634049/
https://www.ncbi.nlm.nih.gov/pubmed/34900233
http://dx.doi.org/10.12688/f1000research.72843.2
work_keys_str_mv AT govindapillaisini anempiricalstudyonresourcedescriptionframeworkreificationfortrustworthinessinknowledgegraphs
AT soonlayki anempiricalstudyonresourcedescriptionframeworkreificationfortrustworthinessinknowledgegraphs
AT hawsucheng anempiricalstudyonresourcedescriptionframeworkreificationfortrustworthinessinknowledgegraphs
AT govindapillaisini empiricalstudyonresourcedescriptionframeworkreificationfortrustworthinessinknowledgegraphs
AT soonlayki empiricalstudyonresourcedescriptionframeworkreificationfortrustworthinessinknowledgegraphs
AT hawsucheng empiricalstudyonresourcedescriptionframeworkreificationfortrustworthinessinknowledgegraphs