Cargando…

COMe: the ontology of bioinorganic proteins

BACKGROUND: Many characterised proteins contain metal ions, small organic molecules or modified residues. In contrast, the huge amount of data generated by genome projects consists exclusively of sequences with almost no annotation. One of the goals of the structural genomics initiative is to provid...

Descripción completa

Detalles Bibliográficos
Autores principales: Degtyarenko, Kirill, Contrino, Sergio
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2004
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC395836/
https://www.ncbi.nlm.nih.gov/pubmed/15113423
http://dx.doi.org/10.1186/1472-6807-4-3
_version_ 1782121338210615296
author Degtyarenko, Kirill
Contrino, Sergio
author_facet Degtyarenko, Kirill
Contrino, Sergio
author_sort Degtyarenko, Kirill
collection PubMed
description BACKGROUND: Many characterised proteins contain metal ions, small organic molecules or modified residues. In contrast, the huge amount of data generated by genome projects consists exclusively of sequences with almost no annotation. One of the goals of the structural genomics initiative is to provide representative three-dimensional (3-D) structures for as many protein/domain folds as possible to allow successful homology modelling. However, important functional features such as metal co-ordination or a type of prosthetic group are not always conserved in homologous proteins. So far, the problem of correct annotation of bioinorganic proteins has been largely ignored by the bioinformatics community and information on bioinorganic centres obtained by methods other than crystallography or NMR is only available in literature databases. RESULTS: COMe (Co-Ordination of Metals) represents the ontology for bioinorganic and other small molecule centres in complex proteins. COMe consists of three types of entities: 'bioinorganic motif' (BIM), 'molecule' (MOL), and 'complex proteins' (PRX), with each entity being assigned a unique identifier. A BIM consists of at least one centre (metal atom, inorganic cluster, organic molecule) and two or more endogenous and/or exogenous ligands. BIMs are represented as one-dimensional (1-D) strings and 2-D diagrams. A MOL entity represents a 'small molecule' which, when in complex with one or more polypeptides, forms a functional protein. The PRX entities refer to the functional proteins as well as to separate protein domains and subunits. The complex proteins in COMe are subdivided into three categories: (i) metalloproteins, (ii) organic prosthetic group proteins and (iii) modified amino acid proteins. The data are currently stored in both XML format and a relational database and are available at . CONCLUSION: COMe provides the classification of proteins according to their 'bioinorganic' features and thus is orthogonal to other classification schemes, such as those based on sequence similarity, 3-D fold, enzyme activity, or biological process. The hierarchical organisation of the controlled vocabulary allows both for annotation and querying at different levels of granularity.
format Text
id pubmed-395836
institution National Center for Biotechnology Information
language English
publishDate 2004
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-3958362004-04-25 COMe: the ontology of bioinorganic proteins Degtyarenko, Kirill Contrino, Sergio BMC Struct Biol Research Article BACKGROUND: Many characterised proteins contain metal ions, small organic molecules or modified residues. In contrast, the huge amount of data generated by genome projects consists exclusively of sequences with almost no annotation. One of the goals of the structural genomics initiative is to provide representative three-dimensional (3-D) structures for as many protein/domain folds as possible to allow successful homology modelling. However, important functional features such as metal co-ordination or a type of prosthetic group are not always conserved in homologous proteins. So far, the problem of correct annotation of bioinorganic proteins has been largely ignored by the bioinformatics community and information on bioinorganic centres obtained by methods other than crystallography or NMR is only available in literature databases. RESULTS: COMe (Co-Ordination of Metals) represents the ontology for bioinorganic and other small molecule centres in complex proteins. COMe consists of three types of entities: 'bioinorganic motif' (BIM), 'molecule' (MOL), and 'complex proteins' (PRX), with each entity being assigned a unique identifier. A BIM consists of at least one centre (metal atom, inorganic cluster, organic molecule) and two or more endogenous and/or exogenous ligands. BIMs are represented as one-dimensional (1-D) strings and 2-D diagrams. A MOL entity represents a 'small molecule' which, when in complex with one or more polypeptides, forms a functional protein. The PRX entities refer to the functional proteins as well as to separate protein domains and subunits. The complex proteins in COMe are subdivided into three categories: (i) metalloproteins, (ii) organic prosthetic group proteins and (iii) modified amino acid proteins. The data are currently stored in both XML format and a relational database and are available at . CONCLUSION: COMe provides the classification of proteins according to their 'bioinorganic' features and thus is orthogonal to other classification schemes, such as those based on sequence similarity, 3-D fold, enzyme activity, or biological process. The hierarchical organisation of the controlled vocabulary allows both for annotation and querying at different levels of granularity. BioMed Central 2004-02-27 /pmc/articles/PMC395836/ /pubmed/15113423 http://dx.doi.org/10.1186/1472-6807-4-3 Text en Copyright © 2004 Degtyarenko and Contrino; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.
spellingShingle Research Article
Degtyarenko, Kirill
Contrino, Sergio
COMe: the ontology of bioinorganic proteins
title COMe: the ontology of bioinorganic proteins
title_full COMe: the ontology of bioinorganic proteins
title_fullStr COMe: the ontology of bioinorganic proteins
title_full_unstemmed COMe: the ontology of bioinorganic proteins
title_short COMe: the ontology of bioinorganic proteins
title_sort come: the ontology of bioinorganic proteins
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC395836/
https://www.ncbi.nlm.nih.gov/pubmed/15113423
http://dx.doi.org/10.1186/1472-6807-4-3
work_keys_str_mv AT degtyarenkokirill cometheontologyofbioinorganicproteins
AT contrinosergio cometheontologyofbioinorganicproteins