Cargando…

Approaches to measure class importance in Knowledge Graphs

The amount, size, complexity, and importance of Knowledge Graphs (KGs) have increased during the last decade. Many different communities have chosen to publish their datasets using Linked Data principles, which favors the integration of this information with many other sources published using the sa...

Descripción completa

Detalles Bibliográficos
Autores principales: Fernández-Álvarez, Daniel, Frey, Johannes, Labra Gayo, Jose Emilio, Gayo-Avello, Daniel, Hellmann, Sebastian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8191918/
https://www.ncbi.nlm.nih.gov/pubmed/34111187
http://dx.doi.org/10.1371/journal.pone.0252862
_version_ 1783705950889181184
author Fernández-Álvarez, Daniel
Frey, Johannes
Labra Gayo, Jose Emilio
Gayo-Avello, Daniel
Hellmann, Sebastian
author_facet Fernández-Álvarez, Daniel
Frey, Johannes
Labra Gayo, Jose Emilio
Gayo-Avello, Daniel
Hellmann, Sebastian
author_sort Fernández-Álvarez, Daniel
collection PubMed
description The amount, size, complexity, and importance of Knowledge Graphs (KGs) have increased during the last decade. Many different communities have chosen to publish their datasets using Linked Data principles, which favors the integration of this information with many other sources published using the same principles and technologies. Such a scenario requires to develop techniques of Linked Data Summarization. The concept of a class is one of the core elements used to define the ontologies which sustain most of the existing KGs. Moreover, classes are an excellent tool to refer to an abstract idea which groups many individuals (or instances) in the context of a given KG, which is handy to use when producing summaries of its content. Rankings of class importance are a powerful summarization tool that can be used both to obtain a superficial view of the content of a given KG and to prioritize many different actions over the data (data quality checking, visualization, relevance for search engines…). In this paper, we analyze existing techniques to measure class importance and propose a novel approach called ClassRank. We compare the class usage in SPARQL logs of different KGs with the importance ranking produced by the approaches evaluated. Then, we discuss the strengths and weaknesses of the evaluated techniques. Our experimentation suggests that ClassRank outperforms state-of-the-art approaches measuring class importance.
format Online
Article
Text
id pubmed-8191918
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-81919182021-06-10 Approaches to measure class importance in Knowledge Graphs Fernández-Álvarez, Daniel Frey, Johannes Labra Gayo, Jose Emilio Gayo-Avello, Daniel Hellmann, Sebastian PLoS One Research Article The amount, size, complexity, and importance of Knowledge Graphs (KGs) have increased during the last decade. Many different communities have chosen to publish their datasets using Linked Data principles, which favors the integration of this information with many other sources published using the same principles and technologies. Such a scenario requires to develop techniques of Linked Data Summarization. The concept of a class is one of the core elements used to define the ontologies which sustain most of the existing KGs. Moreover, classes are an excellent tool to refer to an abstract idea which groups many individuals (or instances) in the context of a given KG, which is handy to use when producing summaries of its content. Rankings of class importance are a powerful summarization tool that can be used both to obtain a superficial view of the content of a given KG and to prioritize many different actions over the data (data quality checking, visualization, relevance for search engines…). In this paper, we analyze existing techniques to measure class importance and propose a novel approach called ClassRank. We compare the class usage in SPARQL logs of different KGs with the importance ranking produced by the approaches evaluated. Then, we discuss the strengths and weaknesses of the evaluated techniques. Our experimentation suggests that ClassRank outperforms state-of-the-art approaches measuring class importance. Public Library of Science 2021-06-10 /pmc/articles/PMC8191918/ /pubmed/34111187 http://dx.doi.org/10.1371/journal.pone.0252862 Text en © 2021 Fernández-Álvarez et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Fernández-Álvarez, Daniel
Frey, Johannes
Labra Gayo, Jose Emilio
Gayo-Avello, Daniel
Hellmann, Sebastian
Approaches to measure class importance in Knowledge Graphs
title Approaches to measure class importance in Knowledge Graphs
title_full Approaches to measure class importance in Knowledge Graphs
title_fullStr Approaches to measure class importance in Knowledge Graphs
title_full_unstemmed Approaches to measure class importance in Knowledge Graphs
title_short Approaches to measure class importance in Knowledge Graphs
title_sort approaches to measure class importance in knowledge graphs
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8191918/
https://www.ncbi.nlm.nih.gov/pubmed/34111187
http://dx.doi.org/10.1371/journal.pone.0252862
work_keys_str_mv AT fernandezalvarezdaniel approachestomeasureclassimportanceinknowledgegraphs
AT freyjohannes approachestomeasureclassimportanceinknowledgegraphs
AT labragayojoseemilio approachestomeasureclassimportanceinknowledgegraphs
AT gayoavellodaniel approachestomeasureclassimportanceinknowledgegraphs
AT hellmannsebastian approachestomeasureclassimportanceinknowledgegraphs