Cargando…

Quality control for terms and definitions in ontologies and taxonomies

BACKGROUND: Ontologies and taxonomies are among the most important computational resources for molecular biology and bioinformatics. A series of recent papers has shown that the Gene Ontology (GO), the most prominent taxonomic resource in these fields, is marked by flaws of certain characteristic ty...

Descripción completa

Detalles Bibliográficos
Autores principales: Köhler, Jacob, Munn, Katherine, Rüegg, Alexander, Skusa, Andre, Smith, Barry
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1482721/
https://www.ncbi.nlm.nih.gov/pubmed/16623942
http://dx.doi.org/10.1186/1471-2105-7-212
_version_ 1782128296505376768
author Köhler, Jacob
Munn, Katherine
Rüegg, Alexander
Skusa, Andre
Smith, Barry
author_facet Köhler, Jacob
Munn, Katherine
Rüegg, Alexander
Skusa, Andre
Smith, Barry
author_sort Köhler, Jacob
collection PubMed
description BACKGROUND: Ontologies and taxonomies are among the most important computational resources for molecular biology and bioinformatics. A series of recent papers has shown that the Gene Ontology (GO), the most prominent taxonomic resource in these fields, is marked by flaws of certain characteristic types, which flow from a failure to address basic ontological principles. As yet, no methods have been proposed which would allow ontology curators to pinpoint flawed terms or definitions in ontologies in a systematic way. RESULTS: We present computational methods that automatically identify terms and definitions which are defined in a circular or unintelligible way. We further demonstrate the potential of these methods by applying them to isolate a subset of 6001 problematic GO terms. By automatically aligning GO with other ontologies and taxonomies we were able to propose alternative synonyms and definitions for some of these problematic terms. This allows us to demonstrate that these other resources do not contain definitions superior to those supplied by GO. CONCLUSION: Our methods provide reliable indications of the quality of terms and definitions in ontologies and taxonomies. Further, they are well suited to assist ontology curators in drawing their attention to those terms that are ill-defined. We have further shown the limitations of ontology mapping and alignment in assisting ontology curators in rectifying problems, thus pointing to the need for manual curation.
format Text
id pubmed-1482721
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-14827212006-06-28 Quality control for terms and definitions in ontologies and taxonomies Köhler, Jacob Munn, Katherine Rüegg, Alexander Skusa, Andre Smith, Barry BMC Bioinformatics Methodology Article BACKGROUND: Ontologies and taxonomies are among the most important computational resources for molecular biology and bioinformatics. A series of recent papers has shown that the Gene Ontology (GO), the most prominent taxonomic resource in these fields, is marked by flaws of certain characteristic types, which flow from a failure to address basic ontological principles. As yet, no methods have been proposed which would allow ontology curators to pinpoint flawed terms or definitions in ontologies in a systematic way. RESULTS: We present computational methods that automatically identify terms and definitions which are defined in a circular or unintelligible way. We further demonstrate the potential of these methods by applying them to isolate a subset of 6001 problematic GO terms. By automatically aligning GO with other ontologies and taxonomies we were able to propose alternative synonyms and definitions for some of these problematic terms. This allows us to demonstrate that these other resources do not contain definitions superior to those supplied by GO. CONCLUSION: Our methods provide reliable indications of the quality of terms and definitions in ontologies and taxonomies. Further, they are well suited to assist ontology curators in drawing their attention to those terms that are ill-defined. We have further shown the limitations of ontology mapping and alignment in assisting ontology curators in rectifying problems, thus pointing to the need for manual curation. BioMed Central 2006-04-19 /pmc/articles/PMC1482721/ /pubmed/16623942 http://dx.doi.org/10.1186/1471-2105-7-212 Text en Copyright © 2006 Köhler et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Köhler, Jacob
Munn, Katherine
Rüegg, Alexander
Skusa, Andre
Smith, Barry
Quality control for terms and definitions in ontologies and taxonomies
title Quality control for terms and definitions in ontologies and taxonomies
title_full Quality control for terms and definitions in ontologies and taxonomies
title_fullStr Quality control for terms and definitions in ontologies and taxonomies
title_full_unstemmed Quality control for terms and definitions in ontologies and taxonomies
title_short Quality control for terms and definitions in ontologies and taxonomies
title_sort quality control for terms and definitions in ontologies and taxonomies
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1482721/
https://www.ncbi.nlm.nih.gov/pubmed/16623942
http://dx.doi.org/10.1186/1471-2105-7-212
work_keys_str_mv AT kohlerjacob qualitycontrolfortermsanddefinitionsinontologiesandtaxonomies
AT munnkatherine qualitycontrolfortermsanddefinitionsinontologiesandtaxonomies
AT rueggalexander qualitycontrolfortermsanddefinitionsinontologiesandtaxonomies
AT skusaandre qualitycontrolfortermsanddefinitionsinontologiesandtaxonomies
AT smithbarry qualitycontrolfortermsanddefinitionsinontologiesandtaxonomies