Cargando…
EAPB: entropy-aware path-based metric for ontology quality
BACKGROUND: Entropy has become increasingly popular in computer science and information theory because it can be used to measure the predictability and redundancy of knowledge bases, especially ontologies. However, current entropy applications that evaluate ontologies consider only single-point conn...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6086046/ https://www.ncbi.nlm.nih.gov/pubmed/30097014 http://dx.doi.org/10.1186/s13326-018-0188-7 |
_version_ | 1783346447581708288 |
---|---|
author | Shen, Ying Chen, Daoyuan Tang, Buzhou Yang, Min Lei, Kai |
author_facet | Shen, Ying Chen, Daoyuan Tang, Buzhou Yang, Min Lei, Kai |
author_sort | Shen, Ying |
collection | PubMed |
description | BACKGROUND: Entropy has become increasingly popular in computer science and information theory because it can be used to measure the predictability and redundancy of knowledge bases, especially ontologies. However, current entropy applications that evaluate ontologies consider only single-point connectivity rather than path connectivity, and they assign equal weights to each entity and path. RESULTS: We propose an Entropy-Aware Path-Based (EAPB) metric for ontology quality by considering the path information between different vertices and textual information included in the path to calculate the connectivity path of the whole network and dynamic weights between different nodes. The information obtained from structure-based embedding and text-based embedding is multiplied by the connectivity matrix of the entropy computation. EAPB is analytically evaluated against the state-of-the-art criteria. We have performed empirical analysis on real-world medical ontologies and a synthetic ontology based on the following three aspects: ontology statistical information (data quantity), entropy evaluation (data quality), and a case study (ontology structure and text visualization). These aspects mutually demonstrate the reliability of the proposed metric. The experimental results show that the proposed EAPB can effectively evaluate ontologies, especially those in the medical informatics field. CONCLUSIONS: We leverage path information and textual information to enrich the network representational learning and aid in entropy computation. The analytics and assessments of semantic web can benefit from the structure information but also the text information. We believe that EAPB is helpful for managing ontology development and evaluation projects. Our results are reproducible and we will release the source code and ontology of this work after publication. (Source code and ontology: https://github.com/AnonymousResearcher1/ontologyEvaluate). |
format | Online Article Text |
id | pubmed-6086046 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-60860462018-08-16 EAPB: entropy-aware path-based metric for ontology quality Shen, Ying Chen, Daoyuan Tang, Buzhou Yang, Min Lei, Kai J Biomed Semantics Research BACKGROUND: Entropy has become increasingly popular in computer science and information theory because it can be used to measure the predictability and redundancy of knowledge bases, especially ontologies. However, current entropy applications that evaluate ontologies consider only single-point connectivity rather than path connectivity, and they assign equal weights to each entity and path. RESULTS: We propose an Entropy-Aware Path-Based (EAPB) metric for ontology quality by considering the path information between different vertices and textual information included in the path to calculate the connectivity path of the whole network and dynamic weights between different nodes. The information obtained from structure-based embedding and text-based embedding is multiplied by the connectivity matrix of the entropy computation. EAPB is analytically evaluated against the state-of-the-art criteria. We have performed empirical analysis on real-world medical ontologies and a synthetic ontology based on the following three aspects: ontology statistical information (data quantity), entropy evaluation (data quality), and a case study (ontology structure and text visualization). These aspects mutually demonstrate the reliability of the proposed metric. The experimental results show that the proposed EAPB can effectively evaluate ontologies, especially those in the medical informatics field. CONCLUSIONS: We leverage path information and textual information to enrich the network representational learning and aid in entropy computation. The analytics and assessments of semantic web can benefit from the structure information but also the text information. We believe that EAPB is helpful for managing ontology development and evaluation projects. Our results are reproducible and we will release the source code and ontology of this work after publication. (Source code and ontology: https://github.com/AnonymousResearcher1/ontologyEvaluate). BioMed Central 2018-08-10 /pmc/articles/PMC6086046/ /pubmed/30097014 http://dx.doi.org/10.1186/s13326-018-0188-7 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Shen, Ying Chen, Daoyuan Tang, Buzhou Yang, Min Lei, Kai EAPB: entropy-aware path-based metric for ontology quality |
title | EAPB: entropy-aware path-based metric for ontology quality |
title_full | EAPB: entropy-aware path-based metric for ontology quality |
title_fullStr | EAPB: entropy-aware path-based metric for ontology quality |
title_full_unstemmed | EAPB: entropy-aware path-based metric for ontology quality |
title_short | EAPB: entropy-aware path-based metric for ontology quality |
title_sort | eapb: entropy-aware path-based metric for ontology quality |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6086046/ https://www.ncbi.nlm.nih.gov/pubmed/30097014 http://dx.doi.org/10.1186/s13326-018-0188-7 |
work_keys_str_mv | AT shenying eapbentropyawarepathbasedmetricforontologyquality AT chendaoyuan eapbentropyawarepathbasedmetricforontologyquality AT tangbuzhou eapbentropyawarepathbasedmetricforontologyquality AT yangmin eapbentropyawarepathbasedmetricforontologyquality AT leikai eapbentropyawarepathbasedmetricforontologyquality |