Cargando…
Automated compound classification using a chemical ontology
BACKGROUND: Classification of chemical compounds into compound classes by using structure derived descriptors is a well-established method to aid the evaluation and abstraction of compound properties in chemical compound databases. MeSH and recently ChEBI are examples of chemical ontologies that pro...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3547776/ https://www.ncbi.nlm.nih.gov/pubmed/23273256 http://dx.doi.org/10.1186/1758-2946-4-40 |
_version_ | 1782256225408253952 |
---|---|
author | Bobach, Claudia Böhme, Timo Laube, Ulf Püschel, Anett Weber, Lutz |
author_facet | Bobach, Claudia Böhme, Timo Laube, Ulf Püschel, Anett Weber, Lutz |
author_sort | Bobach, Claudia |
collection | PubMed |
description | BACKGROUND: Classification of chemical compounds into compound classes by using structure derived descriptors is a well-established method to aid the evaluation and abstraction of compound properties in chemical compound databases. MeSH and recently ChEBI are examples of chemical ontologies that provide a hierarchical classification of compounds into general compound classes of biological interest based on their structural as well as property or use features. In these ontologies, compounds have been assigned manually to their respective classes. However, with the ever increasing possibilities to extract new compounds from text documents using name-to-structure tools and considering the large number of compounds deposited in databases, automated and comprehensive chemical classification methods are needed to avoid the error prone and time consuming manual classification of compounds. RESULTS: In the present work we implement principles and methods to construct a chemical ontology of classes that shall support the automated, high-quality compound classification in chemical databases or text documents. While SMARTS expressions have already been used to define chemical structure class concepts, in the present work we have extended the expressive power of such class definitions by expanding their structure-based reasoning logic. Thus, to achieve the required precision and granularity of chemical class definitions, sets of SMARTS class definitions are connected by OR and NOT logical operators. In addition, AND logic has been implemented to allow the concomitant use of flexible atom lists and stereochemistry definitions. The resulting chemical ontology is a multi-hierarchical taxonomy of concept nodes connected by directed, transitive relationships. CONCLUSIONS: A proposal for a rule based definition of chemical classes has been made that allows to define chemical compound classes more precisely than before. The proposed structure-based reasoning logic allows to translate chemistry expert knowledge into a computer interpretable form, preventing erroneous compound assignments and allowing automatic compound classification. The automated assignment of compounds in databases, compound structure files or text documents to their related ontology classes is possible through the integration with a chemical structure search engine. As an application example, the annotation of chemical structure files with a prototypic ontology is demonstrated. |
format | Online Article Text |
id | pubmed-3547776 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-35477762013-01-23 Automated compound classification using a chemical ontology Bobach, Claudia Böhme, Timo Laube, Ulf Püschel, Anett Weber, Lutz J Cheminform Research Article BACKGROUND: Classification of chemical compounds into compound classes by using structure derived descriptors is a well-established method to aid the evaluation and abstraction of compound properties in chemical compound databases. MeSH and recently ChEBI are examples of chemical ontologies that provide a hierarchical classification of compounds into general compound classes of biological interest based on their structural as well as property or use features. In these ontologies, compounds have been assigned manually to their respective classes. However, with the ever increasing possibilities to extract new compounds from text documents using name-to-structure tools and considering the large number of compounds deposited in databases, automated and comprehensive chemical classification methods are needed to avoid the error prone and time consuming manual classification of compounds. RESULTS: In the present work we implement principles and methods to construct a chemical ontology of classes that shall support the automated, high-quality compound classification in chemical databases or text documents. While SMARTS expressions have already been used to define chemical structure class concepts, in the present work we have extended the expressive power of such class definitions by expanding their structure-based reasoning logic. Thus, to achieve the required precision and granularity of chemical class definitions, sets of SMARTS class definitions are connected by OR and NOT logical operators. In addition, AND logic has been implemented to allow the concomitant use of flexible atom lists and stereochemistry definitions. The resulting chemical ontology is a multi-hierarchical taxonomy of concept nodes connected by directed, transitive relationships. CONCLUSIONS: A proposal for a rule based definition of chemical classes has been made that allows to define chemical compound classes more precisely than before. The proposed structure-based reasoning logic allows to translate chemistry expert knowledge into a computer interpretable form, preventing erroneous compound assignments and allowing automatic compound classification. The automated assignment of compounds in databases, compound structure files or text documents to their related ontology classes is possible through the integration with a chemical structure search engine. As an application example, the annotation of chemical structure files with a prototypic ontology is demonstrated. BioMed Central 2012-12-29 /pmc/articles/PMC3547776/ /pubmed/23273256 http://dx.doi.org/10.1186/1758-2946-4-40 Text en Copyright ©2012 Bobach et al.; licensee Chemistry Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Bobach, Claudia Böhme, Timo Laube, Ulf Püschel, Anett Weber, Lutz Automated compound classification using a chemical ontology |
title | Automated compound classification using a chemical ontology |
title_full | Automated compound classification using a chemical ontology |
title_fullStr | Automated compound classification using a chemical ontology |
title_full_unstemmed | Automated compound classification using a chemical ontology |
title_short | Automated compound classification using a chemical ontology |
title_sort | automated compound classification using a chemical ontology |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3547776/ https://www.ncbi.nlm.nih.gov/pubmed/23273256 http://dx.doi.org/10.1186/1758-2946-4-40 |
work_keys_str_mv | AT bobachclaudia automatedcompoundclassificationusingachemicalontology AT bohmetimo automatedcompoundclassificationusingachemicalontology AT laubeulf automatedcompoundclassificationusingachemicalontology AT puschelanett automatedcompoundclassificationusingachemicalontology AT weberlutz automatedcompoundclassificationusingachemicalontology |