Cargando…

Missing lateral relationships in top-level concepts of an ontology

BACKGROUND: Ontologies house various kinds of domain knowledge in formal structures, primarily in the form of concepts and the associative relationships between them. Ontologies have become integral components of many health information processing environments. Hence, quality assurance of the concep...

Descripción completa

Detalles Bibliográficos
Autores principales: Zheng, Ling, Chen, Yan, Min, Hua, Hildebrand, P. Lloyd, Liu, Hao, Halper, Michael, Geller, James, de Coronado, Sherri, Perl, Yehoshua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7737264/
https://www.ncbi.nlm.nih.gov/pubmed/33319709
http://dx.doi.org/10.1186/s12911-020-01319-3
_version_ 1783622911705219072
author Zheng, Ling
Chen, Yan
Min, Hua
Hildebrand, P. Lloyd
Liu, Hao
Halper, Michael
Geller, James
de Coronado, Sherri
Perl, Yehoshua
author_facet Zheng, Ling
Chen, Yan
Min, Hua
Hildebrand, P. Lloyd
Liu, Hao
Halper, Michael
Geller, James
de Coronado, Sherri
Perl, Yehoshua
author_sort Zheng, Ling
collection PubMed
description BACKGROUND: Ontologies house various kinds of domain knowledge in formal structures, primarily in the form of concepts and the associative relationships between them. Ontologies have become integral components of many health information processing environments. Hence, quality assurance of the conceptual content of any ontology is critical. Relationships are foundational to the definition of concepts. Missing relationship errors (i.e., unintended omissions of important definitional relationships) can have a deleterious effect on the quality of an ontology. An abstraction network is a structure that overlays an ontology and provides an alternate, summarization view of its contents. One kind of abstraction network is called an area taxonomy, and a variation of it is called a subtaxonomy. A methodology based on these taxonomies for more readily finding missing relationship errors is explored. METHODS: The area taxonomy and the subtaxonomy are deployed to help reveal concepts that have a high likelihood of exhibiting missing relationship errors. A specific top-level grouping unit found within the area taxonomy and subtaxonomy, when deemed to be anomalous, is used as an indicator that missing relationship errors are likely to be found among certain concepts. Two hypotheses pertaining to the effectiveness of our Quality Assurance approach are studied. RESULTS: Our Quality Assurance methodology was applied to the Biological Process hierarchy of the National Cancer Institute thesaurus (NCIt) and SNOMED CT’s Eye/vision finding subhierarchy within its Clinical finding hierarchy. Many missing relationship errors were discovered and confirmed in our analysis. For both test-bed hierarchies, our Quality Assurance methodology yielded a statistically significantly higher number of concepts with missing relationship errors in comparison to a control sample of concepts. Two hypotheses are confirmed by these findings. CONCLUSIONS: Quality assurance is a critical part of an ontology’s lifecycle, and automated or semi-automated tools for supporting this process are invaluable. We introduced a Quality Assurance methodology targeted at missing relationship errors. Its successful application to the NCIt’s Biological Process hierarchy and SNOMED CT’s Eye/vision finding subhierarchy indicates that it can be a useful addition to the arsenal of tools available to ontology maintenance personnel.
format Online
Article
Text
id pubmed-7737264
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-77372642020-12-17 Missing lateral relationships in top-level concepts of an ontology Zheng, Ling Chen, Yan Min, Hua Hildebrand, P. Lloyd Liu, Hao Halper, Michael Geller, James de Coronado, Sherri Perl, Yehoshua BMC Med Inform Decis Mak Research BACKGROUND: Ontologies house various kinds of domain knowledge in formal structures, primarily in the form of concepts and the associative relationships between them. Ontologies have become integral components of many health information processing environments. Hence, quality assurance of the conceptual content of any ontology is critical. Relationships are foundational to the definition of concepts. Missing relationship errors (i.e., unintended omissions of important definitional relationships) can have a deleterious effect on the quality of an ontology. An abstraction network is a structure that overlays an ontology and provides an alternate, summarization view of its contents. One kind of abstraction network is called an area taxonomy, and a variation of it is called a subtaxonomy. A methodology based on these taxonomies for more readily finding missing relationship errors is explored. METHODS: The area taxonomy and the subtaxonomy are deployed to help reveal concepts that have a high likelihood of exhibiting missing relationship errors. A specific top-level grouping unit found within the area taxonomy and subtaxonomy, when deemed to be anomalous, is used as an indicator that missing relationship errors are likely to be found among certain concepts. Two hypotheses pertaining to the effectiveness of our Quality Assurance approach are studied. RESULTS: Our Quality Assurance methodology was applied to the Biological Process hierarchy of the National Cancer Institute thesaurus (NCIt) and SNOMED CT’s Eye/vision finding subhierarchy within its Clinical finding hierarchy. Many missing relationship errors were discovered and confirmed in our analysis. For both test-bed hierarchies, our Quality Assurance methodology yielded a statistically significantly higher number of concepts with missing relationship errors in comparison to a control sample of concepts. Two hypotheses are confirmed by these findings. CONCLUSIONS: Quality assurance is a critical part of an ontology’s lifecycle, and automated or semi-automated tools for supporting this process are invaluable. We introduced a Quality Assurance methodology targeted at missing relationship errors. Its successful application to the NCIt’s Biological Process hierarchy and SNOMED CT’s Eye/vision finding subhierarchy indicates that it can be a useful addition to the arsenal of tools available to ontology maintenance personnel. BioMed Central 2020-12-15 /pmc/articles/PMC7737264/ /pubmed/33319709 http://dx.doi.org/10.1186/s12911-020-01319-3 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Zheng, Ling
Chen, Yan
Min, Hua
Hildebrand, P. Lloyd
Liu, Hao
Halper, Michael
Geller, James
de Coronado, Sherri
Perl, Yehoshua
Missing lateral relationships in top-level concepts of an ontology
title Missing lateral relationships in top-level concepts of an ontology
title_full Missing lateral relationships in top-level concepts of an ontology
title_fullStr Missing lateral relationships in top-level concepts of an ontology
title_full_unstemmed Missing lateral relationships in top-level concepts of an ontology
title_short Missing lateral relationships in top-level concepts of an ontology
title_sort missing lateral relationships in top-level concepts of an ontology
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7737264/
https://www.ncbi.nlm.nih.gov/pubmed/33319709
http://dx.doi.org/10.1186/s12911-020-01319-3
work_keys_str_mv AT zhengling missinglateralrelationshipsintoplevelconceptsofanontology
AT chenyan missinglateralrelationshipsintoplevelconceptsofanontology
AT minhua missinglateralrelationshipsintoplevelconceptsofanontology
AT hildebrandplloyd missinglateralrelationshipsintoplevelconceptsofanontology
AT liuhao missinglateralrelationshipsintoplevelconceptsofanontology
AT halpermichael missinglateralrelationshipsintoplevelconceptsofanontology
AT gellerjames missinglateralrelationshipsintoplevelconceptsofanontology
AT decoronadosherri missinglateralrelationshipsintoplevelconceptsofanontology
AT perlyehoshua missinglateralrelationshipsintoplevelconceptsofanontology