Cargando…
A rule-based ontological framework for the classification of molecules
BACKGROUND: A variety of key activities within life sciences research involves integrating and intelligently managing large amounts of biochemical information. Semantic technologies provide an intuitive way to organise and sift through these rapidly growing datasets via the design and maintenance of...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4040517/ https://www.ncbi.nlm.nih.gov/pubmed/24735706 http://dx.doi.org/10.1186/2041-1480-5-17 |
_version_ | 1782318584378163200 |
---|---|
author | Magka, Despoina Krötzsch, Markus Horrocks, Ian |
author_facet | Magka, Despoina Krötzsch, Markus Horrocks, Ian |
author_sort | Magka, Despoina |
collection | PubMed |
description | BACKGROUND: A variety of key activities within life sciences research involves integrating and intelligently managing large amounts of biochemical information. Semantic technologies provide an intuitive way to organise and sift through these rapidly growing datasets via the design and maintenance of ontology-supported knowledge bases. To this end, OWL—a W3C standard declarative language— has been extensively used in the deployment of biochemical ontologies that can be conveniently organised using the classification facilities of OWL-based tools. One of the most established ontologies for the chemical domain is ChEBI, an open-access dictionary of molecular entities that supplies high quality annotation and taxonomical information for biologically relevant compounds. However, ChEBI is being manually expanded which hinders its potential to grow due to the limited availability of human resources. RESULTS: In this work, we describe a prototype that performs automatic classification of chemical compounds. The software we present implements a sound and complete reasoning procedure of a formalism that extends datalog and builds upon an off-the-shelf deductive database system. We capture a wide range of chemical classes that are not expressible with OWL-based formalisms such as cyclic molecules, saturated molecules and alkanes. Furthermore, we describe a surface ‘less-logician-like’ syntax that allows application experts to create ontological descriptions of complex biochemical objects without prior knowledge of logic. In terms of performance, a noticeable improvement is observed in comparison with previous approaches. Our evaluation has discovered subsumptions that are missing from the manually curated ChEBI ontology as well as discrepancies with respect to existing subclass relations. We illustrate thus the potential of an ontology language suitable for the life sciences domain that exhibits a favourable balance between expressive power and practical feasibility. CONCLUSIONS: Our proposed methodology can form the basis of an ontology-mediated application to assist biocurators in the production of complete and error-free taxonomies. Moreover, such a tool could contribute to a more rapid development of the ChEBI ontology and to the efforts of the ChEBI team to make annotated chemical datasets available to the public. From a modelling point of view, our approach could stimulate the adoption of a different and expressive reasoning paradigm based on rules for which state-of-the-art and highly optimised reasoners are available; it could thus pave the way for the representation of a broader spectrum of life sciences and biomedical knowledge. |
format | Online Article Text |
id | pubmed-4040517 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-40405172014-06-16 A rule-based ontological framework for the classification of molecules Magka, Despoina Krötzsch, Markus Horrocks, Ian J Biomed Semantics Research BACKGROUND: A variety of key activities within life sciences research involves integrating and intelligently managing large amounts of biochemical information. Semantic technologies provide an intuitive way to organise and sift through these rapidly growing datasets via the design and maintenance of ontology-supported knowledge bases. To this end, OWL—a W3C standard declarative language— has been extensively used in the deployment of biochemical ontologies that can be conveniently organised using the classification facilities of OWL-based tools. One of the most established ontologies for the chemical domain is ChEBI, an open-access dictionary of molecular entities that supplies high quality annotation and taxonomical information for biologically relevant compounds. However, ChEBI is being manually expanded which hinders its potential to grow due to the limited availability of human resources. RESULTS: In this work, we describe a prototype that performs automatic classification of chemical compounds. The software we present implements a sound and complete reasoning procedure of a formalism that extends datalog and builds upon an off-the-shelf deductive database system. We capture a wide range of chemical classes that are not expressible with OWL-based formalisms such as cyclic molecules, saturated molecules and alkanes. Furthermore, we describe a surface ‘less-logician-like’ syntax that allows application experts to create ontological descriptions of complex biochemical objects without prior knowledge of logic. In terms of performance, a noticeable improvement is observed in comparison with previous approaches. Our evaluation has discovered subsumptions that are missing from the manually curated ChEBI ontology as well as discrepancies with respect to existing subclass relations. We illustrate thus the potential of an ontology language suitable for the life sciences domain that exhibits a favourable balance between expressive power and practical feasibility. CONCLUSIONS: Our proposed methodology can form the basis of an ontology-mediated application to assist biocurators in the production of complete and error-free taxonomies. Moreover, such a tool could contribute to a more rapid development of the ChEBI ontology and to the efforts of the ChEBI team to make annotated chemical datasets available to the public. From a modelling point of view, our approach could stimulate the adoption of a different and expressive reasoning paradigm based on rules for which state-of-the-art and highly optimised reasoners are available; it could thus pave the way for the representation of a broader spectrum of life sciences and biomedical knowledge. BioMed Central 2014-04-15 /pmc/articles/PMC4040517/ /pubmed/24735706 http://dx.doi.org/10.1186/2041-1480-5-17 Text en Copyright © 2014 Magka et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Magka, Despoina Krötzsch, Markus Horrocks, Ian A rule-based ontological framework for the classification of molecules |
title | A rule-based ontological framework for the classification of molecules |
title_full | A rule-based ontological framework for the classification of molecules |
title_fullStr | A rule-based ontological framework for the classification of molecules |
title_full_unstemmed | A rule-based ontological framework for the classification of molecules |
title_short | A rule-based ontological framework for the classification of molecules |
title_sort | rule-based ontological framework for the classification of molecules |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4040517/ https://www.ncbi.nlm.nih.gov/pubmed/24735706 http://dx.doi.org/10.1186/2041-1480-5-17 |
work_keys_str_mv | AT magkadespoina arulebasedontologicalframeworkfortheclassificationofmolecules AT krotzschmarkus arulebasedontologicalframeworkfortheclassificationofmolecules AT horrocksian arulebasedontologicalframeworkfortheclassificationofmolecules AT magkadespoina rulebasedontologicalframeworkfortheclassificationofmolecules AT krotzschmarkus rulebasedontologicalframeworkfortheclassificationofmolecules AT horrocksian rulebasedontologicalframeworkfortheclassificationofmolecules |