Cargando…

EAPB: entropy-aware path-based metric for ontology quality

BACKGROUND: Entropy has become increasingly popular in computer science and information theory because it can be used to measure the predictability and redundancy of knowledge bases, especially ontologies. However, current entropy applications that evaluate ontologies consider only single-point conn...

Descripción completa

Detalles Bibliográficos
Autores principales: Shen, Ying, Chen, Daoyuan, Tang, Buzhou, Yang, Min, Lei, Kai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6086046/
https://www.ncbi.nlm.nih.gov/pubmed/30097014
http://dx.doi.org/10.1186/s13326-018-0188-7
_version_ 1783346447581708288
author Shen, Ying
Chen, Daoyuan
Tang, Buzhou
Yang, Min
Lei, Kai
author_facet Shen, Ying
Chen, Daoyuan
Tang, Buzhou
Yang, Min
Lei, Kai
author_sort Shen, Ying
collection PubMed
description BACKGROUND: Entropy has become increasingly popular in computer science and information theory because it can be used to measure the predictability and redundancy of knowledge bases, especially ontologies. However, current entropy applications that evaluate ontologies consider only single-point connectivity rather than path connectivity, and they assign equal weights to each entity and path. RESULTS: We propose an Entropy-Aware Path-Based (EAPB) metric for ontology quality by considering the path information between different vertices and textual information included in the path to calculate the connectivity path of the whole network and dynamic weights between different nodes. The information obtained from structure-based embedding and text-based embedding is multiplied by the connectivity matrix of the entropy computation. EAPB is analytically evaluated against the state-of-the-art criteria. We have performed empirical analysis on real-world medical ontologies and a synthetic ontology based on the following three aspects: ontology statistical information (data quantity), entropy evaluation (data quality), and a case study (ontology structure and text visualization). These aspects mutually demonstrate the reliability of the proposed metric. The experimental results show that the proposed EAPB can effectively evaluate ontologies, especially those in the medical informatics field. CONCLUSIONS: We leverage path information and textual information to enrich the network representational learning and aid in entropy computation. The analytics and assessments of semantic web can benefit from the structure information but also the text information. We believe that EAPB is helpful for managing ontology development and evaluation projects. Our results are reproducible and we will release the source code and ontology of this work after publication. (Source code and ontology: https://github.com/AnonymousResearcher1/ontologyEvaluate).
format Online
Article
Text
id pubmed-6086046
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-60860462018-08-16 EAPB: entropy-aware path-based metric for ontology quality Shen, Ying Chen, Daoyuan Tang, Buzhou Yang, Min Lei, Kai J Biomed Semantics Research BACKGROUND: Entropy has become increasingly popular in computer science and information theory because it can be used to measure the predictability and redundancy of knowledge bases, especially ontologies. However, current entropy applications that evaluate ontologies consider only single-point connectivity rather than path connectivity, and they assign equal weights to each entity and path. RESULTS: We propose an Entropy-Aware Path-Based (EAPB) metric for ontology quality by considering the path information between different vertices and textual information included in the path to calculate the connectivity path of the whole network and dynamic weights between different nodes. The information obtained from structure-based embedding and text-based embedding is multiplied by the connectivity matrix of the entropy computation. EAPB is analytically evaluated against the state-of-the-art criteria. We have performed empirical analysis on real-world medical ontologies and a synthetic ontology based on the following three aspects: ontology statistical information (data quantity), entropy evaluation (data quality), and a case study (ontology structure and text visualization). These aspects mutually demonstrate the reliability of the proposed metric. The experimental results show that the proposed EAPB can effectively evaluate ontologies, especially those in the medical informatics field. CONCLUSIONS: We leverage path information and textual information to enrich the network representational learning and aid in entropy computation. The analytics and assessments of semantic web can benefit from the structure information but also the text information. We believe that EAPB is helpful for managing ontology development and evaluation projects. Our results are reproducible and we will release the source code and ontology of this work after publication. (Source code and ontology: https://github.com/AnonymousResearcher1/ontologyEvaluate). BioMed Central 2018-08-10 /pmc/articles/PMC6086046/ /pubmed/30097014 http://dx.doi.org/10.1186/s13326-018-0188-7 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Shen, Ying
Chen, Daoyuan
Tang, Buzhou
Yang, Min
Lei, Kai
EAPB: entropy-aware path-based metric for ontology quality
title EAPB: entropy-aware path-based metric for ontology quality
title_full EAPB: entropy-aware path-based metric for ontology quality
title_fullStr EAPB: entropy-aware path-based metric for ontology quality
title_full_unstemmed EAPB: entropy-aware path-based metric for ontology quality
title_short EAPB: entropy-aware path-based metric for ontology quality
title_sort eapb: entropy-aware path-based metric for ontology quality
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6086046/
https://www.ncbi.nlm.nih.gov/pubmed/30097014
http://dx.doi.org/10.1186/s13326-018-0188-7
work_keys_str_mv AT shenying eapbentropyawarepathbasedmetricforontologyquality
AT chendaoyuan eapbentropyawarepathbasedmetricforontologyquality
AT tangbuzhou eapbentropyawarepathbasedmetricforontologyquality
AT yangmin eapbentropyawarepathbasedmetricforontologyquality
AT leikai eapbentropyawarepathbasedmetricforontologyquality