Cargando…

Identification of recurring protein structure microenvironments and discovery of novel functional sites around CYS residues

BACKGROUND: The emergence of structural genomics presents significant challenges in the annotation of biologically uncharacterized proteins. Unfortunately, our ability to analyze these proteins is restricted by the limited catalog of known molecular functions and their associated 3D motifs. RESULTS:...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Shirley, Liu, Tianyun, Altman, Russ B
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2833161/
https://www.ncbi.nlm.nih.gov/pubmed/20122268
http://dx.doi.org/10.1186/1472-6807-10-4
_version_ 1782178362402275328
author Wu, Shirley
Liu, Tianyun
Altman, Russ B
author_facet Wu, Shirley
Liu, Tianyun
Altman, Russ B
author_sort Wu, Shirley
collection PubMed
description BACKGROUND: The emergence of structural genomics presents significant challenges in the annotation of biologically uncharacterized proteins. Unfortunately, our ability to analyze these proteins is restricted by the limited catalog of known molecular functions and their associated 3D motifs. RESULTS: In order to identify novel 3D motifs that may be associated with molecular functions, we employ an unsupervised, two-phase clustering approach that combines k-means and hierarchical clustering with knowledge-informed cluster selection and annotation methods. We applied the approach to approximately 20,000 cysteine-based protein microenvironments (3D regions 7.5 Å in radius) and identified 70 interesting clusters, some of which represent known motifs (e.g. metal binding and phosphatase activity), and some of which are novel, including several zinc binding sites. Detailed annotation results are available online for all 70 clusters at http://feature.stanford.edu/clustering/cys. CONCLUSIONS: The use of microenvironments instead of backbone geometric criteria enables flexible exploration of protein function space, and detection of recurring motifs that are discontinuous in sequence and diverse in structure. Clustering microenvironments may thus help to functionally characterize novel proteins and better understand the protein structure-function relationship.
format Text
id pubmed-2833161
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28331612010-03-06 Identification of recurring protein structure microenvironments and discovery of novel functional sites around CYS residues Wu, Shirley Liu, Tianyun Altman, Russ B BMC Struct Biol Research article BACKGROUND: The emergence of structural genomics presents significant challenges in the annotation of biologically uncharacterized proteins. Unfortunately, our ability to analyze these proteins is restricted by the limited catalog of known molecular functions and their associated 3D motifs. RESULTS: In order to identify novel 3D motifs that may be associated with molecular functions, we employ an unsupervised, two-phase clustering approach that combines k-means and hierarchical clustering with knowledge-informed cluster selection and annotation methods. We applied the approach to approximately 20,000 cysteine-based protein microenvironments (3D regions 7.5 Å in radius) and identified 70 interesting clusters, some of which represent known motifs (e.g. metal binding and phosphatase activity), and some of which are novel, including several zinc binding sites. Detailed annotation results are available online for all 70 clusters at http://feature.stanford.edu/clustering/cys. CONCLUSIONS: The use of microenvironments instead of backbone geometric criteria enables flexible exploration of protein function space, and detection of recurring motifs that are discontinuous in sequence and diverse in structure. Clustering microenvironments may thus help to functionally characterize novel proteins and better understand the protein structure-function relationship. BioMed Central 2010-02-02 /pmc/articles/PMC2833161/ /pubmed/20122268 http://dx.doi.org/10.1186/1472-6807-10-4 Text en Copyright ©2010 Wu et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research article
Wu, Shirley
Liu, Tianyun
Altman, Russ B
Identification of recurring protein structure microenvironments and discovery of novel functional sites around CYS residues
title Identification of recurring protein structure microenvironments and discovery of novel functional sites around CYS residues
title_full Identification of recurring protein structure microenvironments and discovery of novel functional sites around CYS residues
title_fullStr Identification of recurring protein structure microenvironments and discovery of novel functional sites around CYS residues
title_full_unstemmed Identification of recurring protein structure microenvironments and discovery of novel functional sites around CYS residues
title_short Identification of recurring protein structure microenvironments and discovery of novel functional sites around CYS residues
title_sort identification of recurring protein structure microenvironments and discovery of novel functional sites around cys residues
topic Research article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2833161/
https://www.ncbi.nlm.nih.gov/pubmed/20122268
http://dx.doi.org/10.1186/1472-6807-10-4
work_keys_str_mv AT wushirley identificationofrecurringproteinstructuremicroenvironmentsanddiscoveryofnovelfunctionalsitesaroundcysresidues
AT liutianyun identificationofrecurringproteinstructuremicroenvironmentsanddiscoveryofnovelfunctionalsitesaroundcysresidues
AT altmanrussb identificationofrecurringproteinstructuremicroenvironmentsanddiscoveryofnovelfunctionalsitesaroundcysresidues