Cargando…

Automated compound classification using a chemical ontology

BACKGROUND: Classification of chemical compounds into compound classes by using structure derived descriptors is a well-established method to aid the evaluation and abstraction of compound properties in chemical compound databases. MeSH and recently ChEBI are examples of chemical ontologies that pro...

Descripción completa

Detalles Bibliográficos
Autores principales: Bobach, Claudia, Böhme, Timo, Laube, Ulf, Püschel, Anett, Weber, Lutz
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3547776/
https://www.ncbi.nlm.nih.gov/pubmed/23273256
http://dx.doi.org/10.1186/1758-2946-4-40
_version_ 1782256225408253952
author Bobach, Claudia
Böhme, Timo
Laube, Ulf
Püschel, Anett
Weber, Lutz
author_facet Bobach, Claudia
Böhme, Timo
Laube, Ulf
Püschel, Anett
Weber, Lutz
author_sort Bobach, Claudia
collection PubMed
description BACKGROUND: Classification of chemical compounds into compound classes by using structure derived descriptors is a well-established method to aid the evaluation and abstraction of compound properties in chemical compound databases. MeSH and recently ChEBI are examples of chemical ontologies that provide a hierarchical classification of compounds into general compound classes of biological interest based on their structural as well as property or use features. In these ontologies, compounds have been assigned manually to their respective classes. However, with the ever increasing possibilities to extract new compounds from text documents using name-to-structure tools and considering the large number of compounds deposited in databases, automated and comprehensive chemical classification methods are needed to avoid the error prone and time consuming manual classification of compounds. RESULTS: In the present work we implement principles and methods to construct a chemical ontology of classes that shall support the automated, high-quality compound classification in chemical databases or text documents. While SMARTS expressions have already been used to define chemical structure class concepts, in the present work we have extended the expressive power of such class definitions by expanding their structure-based reasoning logic. Thus, to achieve the required precision and granularity of chemical class definitions, sets of SMARTS class definitions are connected by OR and NOT logical operators. In addition, AND logic has been implemented to allow the concomitant use of flexible atom lists and stereochemistry definitions. The resulting chemical ontology is a multi-hierarchical taxonomy of concept nodes connected by directed, transitive relationships. CONCLUSIONS: A proposal for a rule based definition of chemical classes has been made that allows to define chemical compound classes more precisely than before. The proposed structure-based reasoning logic allows to translate chemistry expert knowledge into a computer interpretable form, preventing erroneous compound assignments and allowing automatic compound classification. The automated assignment of compounds in databases, compound structure files or text documents to their related ontology classes is possible through the integration with a chemical structure search engine. As an application example, the annotation of chemical structure files with a prototypic ontology is demonstrated.
format Online
Article
Text
id pubmed-3547776
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35477762013-01-23 Automated compound classification using a chemical ontology Bobach, Claudia Böhme, Timo Laube, Ulf Püschel, Anett Weber, Lutz J Cheminform Research Article BACKGROUND: Classification of chemical compounds into compound classes by using structure derived descriptors is a well-established method to aid the evaluation and abstraction of compound properties in chemical compound databases. MeSH and recently ChEBI are examples of chemical ontologies that provide a hierarchical classification of compounds into general compound classes of biological interest based on their structural as well as property or use features. In these ontologies, compounds have been assigned manually to their respective classes. However, with the ever increasing possibilities to extract new compounds from text documents using name-to-structure tools and considering the large number of compounds deposited in databases, automated and comprehensive chemical classification methods are needed to avoid the error prone and time consuming manual classification of compounds. RESULTS: In the present work we implement principles and methods to construct a chemical ontology of classes that shall support the automated, high-quality compound classification in chemical databases or text documents. While SMARTS expressions have already been used to define chemical structure class concepts, in the present work we have extended the expressive power of such class definitions by expanding their structure-based reasoning logic. Thus, to achieve the required precision and granularity of chemical class definitions, sets of SMARTS class definitions are connected by OR and NOT logical operators. In addition, AND logic has been implemented to allow the concomitant use of flexible atom lists and stereochemistry definitions. The resulting chemical ontology is a multi-hierarchical taxonomy of concept nodes connected by directed, transitive relationships. CONCLUSIONS: A proposal for a rule based definition of chemical classes has been made that allows to define chemical compound classes more precisely than before. The proposed structure-based reasoning logic allows to translate chemistry expert knowledge into a computer interpretable form, preventing erroneous compound assignments and allowing automatic compound classification. The automated assignment of compounds in databases, compound structure files or text documents to their related ontology classes is possible through the integration with a chemical structure search engine. As an application example, the annotation of chemical structure files with a prototypic ontology is demonstrated. BioMed Central 2012-12-29 /pmc/articles/PMC3547776/ /pubmed/23273256 http://dx.doi.org/10.1186/1758-2946-4-40 Text en Copyright ©2012 Bobach et al.; licensee Chemistry Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Bobach, Claudia
Böhme, Timo
Laube, Ulf
Püschel, Anett
Weber, Lutz
Automated compound classification using a chemical ontology
title Automated compound classification using a chemical ontology
title_full Automated compound classification using a chemical ontology
title_fullStr Automated compound classification using a chemical ontology
title_full_unstemmed Automated compound classification using a chemical ontology
title_short Automated compound classification using a chemical ontology
title_sort automated compound classification using a chemical ontology
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3547776/
https://www.ncbi.nlm.nih.gov/pubmed/23273256
http://dx.doi.org/10.1186/1758-2946-4-40
work_keys_str_mv AT bobachclaudia automatedcompoundclassificationusingachemicalontology
AT bohmetimo automatedcompoundclassificationusingachemicalontology
AT laubeulf automatedcompoundclassificationusingachemicalontology
AT puschelanett automatedcompoundclassificationusingachemicalontology
AT weberlutz automatedcompoundclassificationusingachemicalontology