Cargando…

Identification of missing hierarchical relations in the vaccine ontology using acquired term pairs

BACKGROUND: The Vaccine Ontology (VO) is a biomedical ontology that standardizes vaccine annotation. Errors in VO will affect a multitude of applications that it is being used in. Quality assurance of VO is imperative to ensure that it provides accurate domain knowledge to these downstream tasks. Ma...

Descripción completa

Detalles Bibliográficos
Autores principales: Manuel, Warren, Abeysinghe, Rashmie, He, Yongqun, Tao, Cui, Cui, Licong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9375092/
https://www.ncbi.nlm.nih.gov/pubmed/35964149
http://dx.doi.org/10.1186/s13326-022-00276-2
_version_ 1784767886513930240
author Manuel, Warren
Abeysinghe, Rashmie
He, Yongqun
Tao, Cui
Cui, Licong
author_facet Manuel, Warren
Abeysinghe, Rashmie
He, Yongqun
Tao, Cui
Cui, Licong
author_sort Manuel, Warren
collection PubMed
description BACKGROUND: The Vaccine Ontology (VO) is a biomedical ontology that standardizes vaccine annotation. Errors in VO will affect a multitude of applications that it is being used in. Quality assurance of VO is imperative to ensure that it provides accurate domain knowledge to these downstream tasks. Manual review to identify and fix quality issues (such as missing hierarchical is-a relations) is challenging given the complexity of the ontology. Automated approaches are highly desirable to facilitate the quality assurance of VO. METHODS: We developed an automated lexical approach that identifies potentially missing is-a relations in VO. First, we construct two types of VO concept-pairs: (1) linked; and (2) unlinked. Each concept-pair further derives an Acquired Term Pair (ATP) based on their lexical features. If the same ATP is obtained by a linked concept-pair and an unlinked concept-pair, this is considered to indicate a potentially missing is-a relation between the unlinked pair of concepts. RESULTS: Applying this approach on the 1.1.192 version of VO, we were able to identify 232 potentially missing is-a relations. A manual review by a VO domain expert on a random sample of 70 potentially missing is-a relations revealed that 65 of the cases were valid missing is-a relations in VO (a precision of 92.86%). CONCLUSIONS: The results indicate that our approach is highly effective in identifying missing is-a relation in VO.
format Online
Article
Text
id pubmed-9375092
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-93750922022-08-14 Identification of missing hierarchical relations in the vaccine ontology using acquired term pairs Manuel, Warren Abeysinghe, Rashmie He, Yongqun Tao, Cui Cui, Licong J Biomed Semantics Research BACKGROUND: The Vaccine Ontology (VO) is a biomedical ontology that standardizes vaccine annotation. Errors in VO will affect a multitude of applications that it is being used in. Quality assurance of VO is imperative to ensure that it provides accurate domain knowledge to these downstream tasks. Manual review to identify and fix quality issues (such as missing hierarchical is-a relations) is challenging given the complexity of the ontology. Automated approaches are highly desirable to facilitate the quality assurance of VO. METHODS: We developed an automated lexical approach that identifies potentially missing is-a relations in VO. First, we construct two types of VO concept-pairs: (1) linked; and (2) unlinked. Each concept-pair further derives an Acquired Term Pair (ATP) based on their lexical features. If the same ATP is obtained by a linked concept-pair and an unlinked concept-pair, this is considered to indicate a potentially missing is-a relation between the unlinked pair of concepts. RESULTS: Applying this approach on the 1.1.192 version of VO, we were able to identify 232 potentially missing is-a relations. A manual review by a VO domain expert on a random sample of 70 potentially missing is-a relations revealed that 65 of the cases were valid missing is-a relations in VO (a precision of 92.86%). CONCLUSIONS: The results indicate that our approach is highly effective in identifying missing is-a relation in VO. BioMed Central 2022-08-13 /pmc/articles/PMC9375092/ /pubmed/35964149 http://dx.doi.org/10.1186/s13326-022-00276-2 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Manuel, Warren
Abeysinghe, Rashmie
He, Yongqun
Tao, Cui
Cui, Licong
Identification of missing hierarchical relations in the vaccine ontology using acquired term pairs
title Identification of missing hierarchical relations in the vaccine ontology using acquired term pairs
title_full Identification of missing hierarchical relations in the vaccine ontology using acquired term pairs
title_fullStr Identification of missing hierarchical relations in the vaccine ontology using acquired term pairs
title_full_unstemmed Identification of missing hierarchical relations in the vaccine ontology using acquired term pairs
title_short Identification of missing hierarchical relations in the vaccine ontology using acquired term pairs
title_sort identification of missing hierarchical relations in the vaccine ontology using acquired term pairs
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9375092/
https://www.ncbi.nlm.nih.gov/pubmed/35964149
http://dx.doi.org/10.1186/s13326-022-00276-2
work_keys_str_mv AT manuelwarren identificationofmissinghierarchicalrelationsinthevaccineontologyusingacquiredtermpairs
AT abeysingherashmie identificationofmissinghierarchicalrelationsinthevaccineontologyusingacquiredtermpairs
AT heyongqun identificationofmissinghierarchicalrelationsinthevaccineontologyusingacquiredtermpairs
AT taocui identificationofmissinghierarchicalrelationsinthevaccineontologyusingacquiredtermpairs
AT cuilicong identificationofmissinghierarchicalrelationsinthevaccineontologyusingacquiredtermpairs