Cargando…

Detecting Role Errors in the Gene Hierarchy of the NCI Thesaurus

Gene terminologies are playing an increasingly important role in the ever-growing field of genomic research. While errors in large, complex terminologies are inevitable, gene terminologies are even more susceptible to them due to the rapid growth of genomic knowledge and the nature of its discovery....

Descripción completa

Detalles Bibliográficos
Autores principales: Min, Hua, Cohen, Barry, Halper, Michael, Oren, Marc, Perl, Yehoshua
Formato: Texto
Lenguaje:English
Publicado: Libertas Academica 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2623310/
https://www.ncbi.nlm.nih.gov/pubmed/19221606
_version_ 1782163425812545536
author Min, Hua
Cohen, Barry
Halper, Michael
Oren, Marc
Perl, Yehoshua
author_facet Min, Hua
Cohen, Barry
Halper, Michael
Oren, Marc
Perl, Yehoshua
author_sort Min, Hua
collection PubMed
description Gene terminologies are playing an increasingly important role in the ever-growing field of genomic research. While errors in large, complex terminologies are inevitable, gene terminologies are even more susceptible to them due to the rapid growth of genomic knowledge and the nature of its discovery. It is therefore very important to establish quality-assurance protocols for such genomic-knowledge repositories. Different kinds of terminologies oftentimes require auditing methodologies adapted to their particular structures. In light of this, an auditing methodology tailored to the characteristics of the NCI Thesaurus’s (NCIT’s) Gene hierarchy is presented. The Gene hierarchy is of particular interest to the NCIT’s designers due to the primary role of genomics in current cancer research. This multiphase methodology focuses on detecting role-errors, such as missing roles or roles with incorrect or incomplete target structures, occurring within that hierarchy. The methodology is based on two kinds of abstraction networks, called taxonomies, that highlight the role distribution among concepts within the IS-A (subsumption) hierarchy. These abstract views tend to highlight portions of the hierarchy having a higher concentration of errors. The errors found during an application of the methodology are reported. Hypotheses pertaining to the efficacy of our methodology are investigated.
format Text
id pubmed-2623310
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Libertas Academica
record_format MEDLINE/PubMed
spelling pubmed-26233102009-02-24 Detecting Role Errors in the Gene Hierarchy of the NCI Thesaurus Min, Hua Cohen, Barry Halper, Michael Oren, Marc Perl, Yehoshua Cancer Inform Original Article Gene terminologies are playing an increasingly important role in the ever-growing field of genomic research. While errors in large, complex terminologies are inevitable, gene terminologies are even more susceptible to them due to the rapid growth of genomic knowledge and the nature of its discovery. It is therefore very important to establish quality-assurance protocols for such genomic-knowledge repositories. Different kinds of terminologies oftentimes require auditing methodologies adapted to their particular structures. In light of this, an auditing methodology tailored to the characteristics of the NCI Thesaurus’s (NCIT’s) Gene hierarchy is presented. The Gene hierarchy is of particular interest to the NCIT’s designers due to the primary role of genomics in current cancer research. This multiphase methodology focuses on detecting role-errors, such as missing roles or roles with incorrect or incomplete target structures, occurring within that hierarchy. The methodology is based on two kinds of abstraction networks, called taxonomies, that highlight the role distribution among concepts within the IS-A (subsumption) hierarchy. These abstract views tend to highlight portions of the hierarchy having a higher concentration of errors. The errors found during an application of the methodology are reported. Hypotheses pertaining to the efficacy of our methodology are investigated. Libertas Academica 2008-05-27 /pmc/articles/PMC2623310/ /pubmed/19221606 Text en © 2008 by the authors http://creativecommons.org/licenses/by/3.0 This article is an open-access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).
spellingShingle Original Article
Min, Hua
Cohen, Barry
Halper, Michael
Oren, Marc
Perl, Yehoshua
Detecting Role Errors in the Gene Hierarchy of the NCI Thesaurus
title Detecting Role Errors in the Gene Hierarchy of the NCI Thesaurus
title_full Detecting Role Errors in the Gene Hierarchy of the NCI Thesaurus
title_fullStr Detecting Role Errors in the Gene Hierarchy of the NCI Thesaurus
title_full_unstemmed Detecting Role Errors in the Gene Hierarchy of the NCI Thesaurus
title_short Detecting Role Errors in the Gene Hierarchy of the NCI Thesaurus
title_sort detecting role errors in the gene hierarchy of the nci thesaurus
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2623310/
https://www.ncbi.nlm.nih.gov/pubmed/19221606
work_keys_str_mv AT minhua detectingroleerrorsinthegenehierarchyofthencithesaurus
AT cohenbarry detectingroleerrorsinthegenehierarchyofthencithesaurus
AT halpermichael detectingroleerrorsinthegenehierarchyofthencithesaurus
AT orenmarc detectingroleerrorsinthegenehierarchyofthencithesaurus
AT perlyehoshua detectingroleerrorsinthegenehierarchyofthencithesaurus