Cargando…
Identification of missing hierarchical relations in the vaccine ontology using acquired term pairs
BACKGROUND: The Vaccine Ontology (VO) is a biomedical ontology that standardizes vaccine annotation. Errors in VO will affect a multitude of applications that it is being used in. Quality assurance of VO is imperative to ensure that it provides accurate domain knowledge to these downstream tasks. Ma...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9375092/ https://www.ncbi.nlm.nih.gov/pubmed/35964149 http://dx.doi.org/10.1186/s13326-022-00276-2 |
_version_ | 1784767886513930240 |
---|---|
author | Manuel, Warren Abeysinghe, Rashmie He, Yongqun Tao, Cui Cui, Licong |
author_facet | Manuel, Warren Abeysinghe, Rashmie He, Yongqun Tao, Cui Cui, Licong |
author_sort | Manuel, Warren |
collection | PubMed |
description | BACKGROUND: The Vaccine Ontology (VO) is a biomedical ontology that standardizes vaccine annotation. Errors in VO will affect a multitude of applications that it is being used in. Quality assurance of VO is imperative to ensure that it provides accurate domain knowledge to these downstream tasks. Manual review to identify and fix quality issues (such as missing hierarchical is-a relations) is challenging given the complexity of the ontology. Automated approaches are highly desirable to facilitate the quality assurance of VO. METHODS: We developed an automated lexical approach that identifies potentially missing is-a relations in VO. First, we construct two types of VO concept-pairs: (1) linked; and (2) unlinked. Each concept-pair further derives an Acquired Term Pair (ATP) based on their lexical features. If the same ATP is obtained by a linked concept-pair and an unlinked concept-pair, this is considered to indicate a potentially missing is-a relation between the unlinked pair of concepts. RESULTS: Applying this approach on the 1.1.192 version of VO, we were able to identify 232 potentially missing is-a relations. A manual review by a VO domain expert on a random sample of 70 potentially missing is-a relations revealed that 65 of the cases were valid missing is-a relations in VO (a precision of 92.86%). CONCLUSIONS: The results indicate that our approach is highly effective in identifying missing is-a relation in VO. |
format | Online Article Text |
id | pubmed-9375092 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-93750922022-08-14 Identification of missing hierarchical relations in the vaccine ontology using acquired term pairs Manuel, Warren Abeysinghe, Rashmie He, Yongqun Tao, Cui Cui, Licong J Biomed Semantics Research BACKGROUND: The Vaccine Ontology (VO) is a biomedical ontology that standardizes vaccine annotation. Errors in VO will affect a multitude of applications that it is being used in. Quality assurance of VO is imperative to ensure that it provides accurate domain knowledge to these downstream tasks. Manual review to identify and fix quality issues (such as missing hierarchical is-a relations) is challenging given the complexity of the ontology. Automated approaches are highly desirable to facilitate the quality assurance of VO. METHODS: We developed an automated lexical approach that identifies potentially missing is-a relations in VO. First, we construct two types of VO concept-pairs: (1) linked; and (2) unlinked. Each concept-pair further derives an Acquired Term Pair (ATP) based on their lexical features. If the same ATP is obtained by a linked concept-pair and an unlinked concept-pair, this is considered to indicate a potentially missing is-a relation between the unlinked pair of concepts. RESULTS: Applying this approach on the 1.1.192 version of VO, we were able to identify 232 potentially missing is-a relations. A manual review by a VO domain expert on a random sample of 70 potentially missing is-a relations revealed that 65 of the cases were valid missing is-a relations in VO (a precision of 92.86%). CONCLUSIONS: The results indicate that our approach is highly effective in identifying missing is-a relation in VO. BioMed Central 2022-08-13 /pmc/articles/PMC9375092/ /pubmed/35964149 http://dx.doi.org/10.1186/s13326-022-00276-2 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Manuel, Warren Abeysinghe, Rashmie He, Yongqun Tao, Cui Cui, Licong Identification of missing hierarchical relations in the vaccine ontology using acquired term pairs |
title | Identification of missing hierarchical relations in the vaccine ontology using acquired term pairs |
title_full | Identification of missing hierarchical relations in the vaccine ontology using acquired term pairs |
title_fullStr | Identification of missing hierarchical relations in the vaccine ontology using acquired term pairs |
title_full_unstemmed | Identification of missing hierarchical relations in the vaccine ontology using acquired term pairs |
title_short | Identification of missing hierarchical relations in the vaccine ontology using acquired term pairs |
title_sort | identification of missing hierarchical relations in the vaccine ontology using acquired term pairs |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9375092/ https://www.ncbi.nlm.nih.gov/pubmed/35964149 http://dx.doi.org/10.1186/s13326-022-00276-2 |
work_keys_str_mv | AT manuelwarren identificationofmissinghierarchicalrelationsinthevaccineontologyusingacquiredtermpairs AT abeysingherashmie identificationofmissinghierarchicalrelationsinthevaccineontologyusingacquiredtermpairs AT heyongqun identificationofmissinghierarchicalrelationsinthevaccineontologyusingacquiredtermpairs AT taocui identificationofmissinghierarchicalrelationsinthevaccineontologyusingacquiredtermpairs AT cuilicong identificationofmissinghierarchicalrelationsinthevaccineontologyusingacquiredtermpairs |