Cargando…
Approaches to measure class importance in Knowledge Graphs
The amount, size, complexity, and importance of Knowledge Graphs (KGs) have increased during the last decade. Many different communities have chosen to publish their datasets using Linked Data principles, which favors the integration of this information with many other sources published using the sa...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8191918/ https://www.ncbi.nlm.nih.gov/pubmed/34111187 http://dx.doi.org/10.1371/journal.pone.0252862 |
_version_ | 1783705950889181184 |
---|---|
author | Fernández-Álvarez, Daniel Frey, Johannes Labra Gayo, Jose Emilio Gayo-Avello, Daniel Hellmann, Sebastian |
author_facet | Fernández-Álvarez, Daniel Frey, Johannes Labra Gayo, Jose Emilio Gayo-Avello, Daniel Hellmann, Sebastian |
author_sort | Fernández-Álvarez, Daniel |
collection | PubMed |
description | The amount, size, complexity, and importance of Knowledge Graphs (KGs) have increased during the last decade. Many different communities have chosen to publish their datasets using Linked Data principles, which favors the integration of this information with many other sources published using the same principles and technologies. Such a scenario requires to develop techniques of Linked Data Summarization. The concept of a class is one of the core elements used to define the ontologies which sustain most of the existing KGs. Moreover, classes are an excellent tool to refer to an abstract idea which groups many individuals (or instances) in the context of a given KG, which is handy to use when producing summaries of its content. Rankings of class importance are a powerful summarization tool that can be used both to obtain a superficial view of the content of a given KG and to prioritize many different actions over the data (data quality checking, visualization, relevance for search engines…). In this paper, we analyze existing techniques to measure class importance and propose a novel approach called ClassRank. We compare the class usage in SPARQL logs of different KGs with the importance ranking produced by the approaches evaluated. Then, we discuss the strengths and weaknesses of the evaluated techniques. Our experimentation suggests that ClassRank outperforms state-of-the-art approaches measuring class importance. |
format | Online Article Text |
id | pubmed-8191918 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-81919182021-06-10 Approaches to measure class importance in Knowledge Graphs Fernández-Álvarez, Daniel Frey, Johannes Labra Gayo, Jose Emilio Gayo-Avello, Daniel Hellmann, Sebastian PLoS One Research Article The amount, size, complexity, and importance of Knowledge Graphs (KGs) have increased during the last decade. Many different communities have chosen to publish their datasets using Linked Data principles, which favors the integration of this information with many other sources published using the same principles and technologies. Such a scenario requires to develop techniques of Linked Data Summarization. The concept of a class is one of the core elements used to define the ontologies which sustain most of the existing KGs. Moreover, classes are an excellent tool to refer to an abstract idea which groups many individuals (or instances) in the context of a given KG, which is handy to use when producing summaries of its content. Rankings of class importance are a powerful summarization tool that can be used both to obtain a superficial view of the content of a given KG and to prioritize many different actions over the data (data quality checking, visualization, relevance for search engines…). In this paper, we analyze existing techniques to measure class importance and propose a novel approach called ClassRank. We compare the class usage in SPARQL logs of different KGs with the importance ranking produced by the approaches evaluated. Then, we discuss the strengths and weaknesses of the evaluated techniques. Our experimentation suggests that ClassRank outperforms state-of-the-art approaches measuring class importance. Public Library of Science 2021-06-10 /pmc/articles/PMC8191918/ /pubmed/34111187 http://dx.doi.org/10.1371/journal.pone.0252862 Text en © 2021 Fernández-Álvarez et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Fernández-Álvarez, Daniel Frey, Johannes Labra Gayo, Jose Emilio Gayo-Avello, Daniel Hellmann, Sebastian Approaches to measure class importance in Knowledge Graphs |
title | Approaches to measure class importance in Knowledge Graphs |
title_full | Approaches to measure class importance in Knowledge Graphs |
title_fullStr | Approaches to measure class importance in Knowledge Graphs |
title_full_unstemmed | Approaches to measure class importance in Knowledge Graphs |
title_short | Approaches to measure class importance in Knowledge Graphs |
title_sort | approaches to measure class importance in knowledge graphs |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8191918/ https://www.ncbi.nlm.nih.gov/pubmed/34111187 http://dx.doi.org/10.1371/journal.pone.0252862 |
work_keys_str_mv | AT fernandezalvarezdaniel approachestomeasureclassimportanceinknowledgegraphs AT freyjohannes approachestomeasureclassimportanceinknowledgegraphs AT labragayojoseemilio approachestomeasureclassimportanceinknowledgegraphs AT gayoavellodaniel approachestomeasureclassimportanceinknowledgegraphs AT hellmannsebastian approachestomeasureclassimportanceinknowledgegraphs |