Cargando…

Obol: Integrating Language and Meaning in Bio-Ontologies

Ontologies are intended to capture and formalize a domain of knowledge. The ontologies comprising the Open Biological Ontologies (OBO) project, which includes the Gene Ontology (GO), are formalizations of various domains of biological knowledge. Ontologies within OBO typically lack computable defini...

Descripción completa

Detalles Bibliográficos
Autor principal: Mungall, Christopher J.
Formato: Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2004
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2447432/
https://www.ncbi.nlm.nih.gov/pubmed/18629143
http://dx.doi.org/10.1002/cfg.435
_version_ 1782156937608036352
author Mungall, Christopher J.
author_facet Mungall, Christopher J.
author_sort Mungall, Christopher J.
collection PubMed
description Ontologies are intended to capture and formalize a domain of knowledge. The ontologies comprising the Open Biological Ontologies (OBO) project, which includes the Gene Ontology (GO), are formalizations of various domains of biological knowledge. Ontologies within OBO typically lack computable definitions that serve to differentiate a term from other similar terms. The computer is unable to determine the meaning of a term, which presents problems for tools such as automated reasoners. Reasoners can be of enormous benefit in managing a complex ontology. OBO term names frequently implicitly encode the kind of definitions that can be used by computational tools, such as automated reasoners. The definitions encoded in the names are not easily amenable to computation, because the names are ostensibly natural language phrases designed for human users. These names are highly regular in their grammar, and can thus be treated as valid sentences in some formal or computable language.With a description of the rules underlying this formal language, term names can be parsed to derive computable definitions, which can then be reasoned over. This paper describes the effort to elucidate that language, called Obol, and the attempts to reason over the resulting definitions. The current implementation finds unique non-trivial definitions for around half of the terms in the GO, and has been used to find 223 missing relationships, which have since been added to the ontology. Obol has utility as an ontology maintenance tool, and as a means of generating computable definitions for a whole ontology. The software is available under an open-source license from: http://www.fruitfly. org/~cjm/obol. Supplementary material for this article can be found at: http://www. interscience.wiley.com/jpages/1531-6912/suppmat.
format Text
id pubmed-2447432
institution National Center for Biotechnology Information
language English
publishDate 2004
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-24474322008-07-14 Obol: Integrating Language and Meaning in Bio-Ontologies Mungall, Christopher J. Comp Funct Genomics Research Article Ontologies are intended to capture and formalize a domain of knowledge. The ontologies comprising the Open Biological Ontologies (OBO) project, which includes the Gene Ontology (GO), are formalizations of various domains of biological knowledge. Ontologies within OBO typically lack computable definitions that serve to differentiate a term from other similar terms. The computer is unable to determine the meaning of a term, which presents problems for tools such as automated reasoners. Reasoners can be of enormous benefit in managing a complex ontology. OBO term names frequently implicitly encode the kind of definitions that can be used by computational tools, such as automated reasoners. The definitions encoded in the names are not easily amenable to computation, because the names are ostensibly natural language phrases designed for human users. These names are highly regular in their grammar, and can thus be treated as valid sentences in some formal or computable language.With a description of the rules underlying this formal language, term names can be parsed to derive computable definitions, which can then be reasoned over. This paper describes the effort to elucidate that language, called Obol, and the attempts to reason over the resulting definitions. The current implementation finds unique non-trivial definitions for around half of the terms in the GO, and has been used to find 223 missing relationships, which have since been added to the ontology. Obol has utility as an ontology maintenance tool, and as a means of generating computable definitions for a whole ontology. The software is available under an open-source license from: http://www.fruitfly. org/~cjm/obol. Supplementary material for this article can be found at: http://www. interscience.wiley.com/jpages/1531-6912/suppmat. Hindawi Publishing Corporation 2004 /pmc/articles/PMC2447432/ /pubmed/18629143 http://dx.doi.org/10.1002/cfg.435 Text en Copyright © 2004 Hindawi Publishing Corporation. http://creativecommons.org/licenses/by/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Mungall, Christopher J.
Obol: Integrating Language and Meaning in Bio-Ontologies
title Obol: Integrating Language and Meaning in Bio-Ontologies
title_full Obol: Integrating Language and Meaning in Bio-Ontologies
title_fullStr Obol: Integrating Language and Meaning in Bio-Ontologies
title_full_unstemmed Obol: Integrating Language and Meaning in Bio-Ontologies
title_short Obol: Integrating Language and Meaning in Bio-Ontologies
title_sort obol: integrating language and meaning in bio-ontologies
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2447432/
https://www.ncbi.nlm.nih.gov/pubmed/18629143
http://dx.doi.org/10.1002/cfg.435
work_keys_str_mv AT mungallchristopherj obolintegratinglanguageandmeaninginbioontologies