Cargando…

Automatic generation of 3D motifs for classification of protein binding sites

BACKGROUND: Since many of the new protein structures delivered by high-throughput processes do not have any known function, there is a need for structure-based prediction of protein function. Protein 3D structures can be clustered according to their fold or secondary structures to produce classes of...

Descripción completa

Detalles Bibliográficos
Autores principales: Nebel, Jean-Christophe, Herzyk, Pawel, Gilbert, David R
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1995225/
https://www.ncbi.nlm.nih.gov/pubmed/17760982
http://dx.doi.org/10.1186/1471-2105-8-321
_version_ 1782135517785096192
author Nebel, Jean-Christophe
Herzyk, Pawel
Gilbert, David R
author_facet Nebel, Jean-Christophe
Herzyk, Pawel
Gilbert, David R
author_sort Nebel, Jean-Christophe
collection PubMed
description BACKGROUND: Since many of the new protein structures delivered by high-throughput processes do not have any known function, there is a need for structure-based prediction of protein function. Protein 3D structures can be clustered according to their fold or secondary structures to produce classes of some functional significance. A recent alternative has been to detect specific 3D motifs which are often associated to active sites. Unfortunately, there are very few known 3D motifs, which are usually the result of a manual process, compared to the number of sequential motifs already known. In this paper, we report a method to automatically generate 3D motifs of protein structure binding sites based on consensus atom positions and evaluate it on a set of adenine based ligands. RESULTS: Our new approach was validated by generating automatically 3D patterns for the main adenine based ligands, i.e. AMP, ADP and ATP. Out of the 18 detected patterns, only one, the ADP4 pattern, is not associated with well defined structural patterns. Moreover, most of the patterns could be classified as binding site 3D motifs. Literature research revealed that the ADP4 pattern actually corresponds to structural features which show complex evolutionary links between ligases and transferases. Therefore, all of the generated patterns prove to be meaningful. Each pattern was used to query all PDB proteins which bind either purine based or guanine based ligands, in order to evaluate the classification and annotation properties of the pattern. Overall, our 3D patterns matched 31% of proteins with adenine based ligands and 95.5% of them were classified correctly. CONCLUSION: A new metric has been introduced allowing the classification of proteins according to the similarity of atomic environment of binding sites, and a methodology has been developed to automatically produce 3D patterns from that classification. A study of proteins binding adenine based ligands showed that these 3D patterns are not only biochemically meaningful, but can be used for protein classification and annotation.
format Text
id pubmed-1995225
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-19952252007-09-29 Automatic generation of 3D motifs for classification of protein binding sites Nebel, Jean-Christophe Herzyk, Pawel Gilbert, David R BMC Bioinformatics Research Article BACKGROUND: Since many of the new protein structures delivered by high-throughput processes do not have any known function, there is a need for structure-based prediction of protein function. Protein 3D structures can be clustered according to their fold or secondary structures to produce classes of some functional significance. A recent alternative has been to detect specific 3D motifs which are often associated to active sites. Unfortunately, there are very few known 3D motifs, which are usually the result of a manual process, compared to the number of sequential motifs already known. In this paper, we report a method to automatically generate 3D motifs of protein structure binding sites based on consensus atom positions and evaluate it on a set of adenine based ligands. RESULTS: Our new approach was validated by generating automatically 3D patterns for the main adenine based ligands, i.e. AMP, ADP and ATP. Out of the 18 detected patterns, only one, the ADP4 pattern, is not associated with well defined structural patterns. Moreover, most of the patterns could be classified as binding site 3D motifs. Literature research revealed that the ADP4 pattern actually corresponds to structural features which show complex evolutionary links between ligases and transferases. Therefore, all of the generated patterns prove to be meaningful. Each pattern was used to query all PDB proteins which bind either purine based or guanine based ligands, in order to evaluate the classification and annotation properties of the pattern. Overall, our 3D patterns matched 31% of proteins with adenine based ligands and 95.5% of them were classified correctly. CONCLUSION: A new metric has been introduced allowing the classification of proteins according to the similarity of atomic environment of binding sites, and a methodology has been developed to automatically produce 3D patterns from that classification. A study of proteins binding adenine based ligands showed that these 3D patterns are not only biochemically meaningful, but can be used for protein classification and annotation. BioMed Central 2007-08-30 /pmc/articles/PMC1995225/ /pubmed/17760982 http://dx.doi.org/10.1186/1471-2105-8-321 Text en Copyright © 2007 Nebel et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Nebel, Jean-Christophe
Herzyk, Pawel
Gilbert, David R
Automatic generation of 3D motifs for classification of protein binding sites
title Automatic generation of 3D motifs for classification of protein binding sites
title_full Automatic generation of 3D motifs for classification of protein binding sites
title_fullStr Automatic generation of 3D motifs for classification of protein binding sites
title_full_unstemmed Automatic generation of 3D motifs for classification of protein binding sites
title_short Automatic generation of 3D motifs for classification of protein binding sites
title_sort automatic generation of 3d motifs for classification of protein binding sites
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1995225/
https://www.ncbi.nlm.nih.gov/pubmed/17760982
http://dx.doi.org/10.1186/1471-2105-8-321
work_keys_str_mv AT nebeljeanchristophe automaticgenerationof3dmotifsforclassificationofproteinbindingsites
AT herzykpawel automaticgenerationof3dmotifsforclassificationofproteinbindingsites
AT gilbertdavidr automaticgenerationof3dmotifsforclassificationofproteinbindingsites