Cargando…
Characterization and prediction of residues determining protein functional specificity
Motivation: Within a homologous protein family, proteins may be grouped into subtypes that share specific functions that are not common to the entire family. Often, the amino acids present in a small number of sequence positions determine each protein's particular function-al specificity. Knowl...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2008
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2718669/ https://www.ncbi.nlm.nih.gov/pubmed/18450811 http://dx.doi.org/10.1093/bioinformatics/btn214 |
_version_ | 1782170012424863744 |
---|---|
author | Capra, John A. Singh, Mona |
author_facet | Capra, John A. Singh, Mona |
author_sort | Capra, John A. |
collection | PubMed |
description | Motivation: Within a homologous protein family, proteins may be grouped into subtypes that share specific functions that are not common to the entire family. Often, the amino acids present in a small number of sequence positions determine each protein's particular function-al specificity. Knowledge of these specificity determining positions (SDPs) aids in protein function prediction, drug design and experimental analysis. A number of sequence-based computational methods have been introduced for identifying SDPs; however, their further development and evaluation have been hindered by the limited number of known experimentally determined SDPs. Results: We combine several bioinformatics resources to automate a process, typically undertaken manually, to build a dataset of SDPs. The resulting large dataset, which consists of SDPs in enzymes, enables us to characterize SDPs in terms of their physicochemical and evolution-ary properties. It also facilitates the large-scale evaluation of sequence-based SDP prediction methods. We present a simple sequence-based SDP prediction method, GroupSim, and show that, surprisingly, it is competitive with a representative set of current methods. We also describe ConsWin, a heuristic that considers sequence conservation of neighboring amino acids, and demonstrate that it improves the performance of all methods tested on our large dataset of enzyme SDPs. Availability: Datasets and GroupSim code are available online at http://compbio.cs.princeton.edu/specificity/ Contact: msingh@cs.princeton.edu Supplementary information: Supplementary data are available at Bioinformatics online. |
format | Text |
id | pubmed-2718669 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2008 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-27186692009-07-31 Characterization and prediction of residues determining protein functional specificity Capra, John A. Singh, Mona Bioinformatics Original Papers Motivation: Within a homologous protein family, proteins may be grouped into subtypes that share specific functions that are not common to the entire family. Often, the amino acids present in a small number of sequence positions determine each protein's particular function-al specificity. Knowledge of these specificity determining positions (SDPs) aids in protein function prediction, drug design and experimental analysis. A number of sequence-based computational methods have been introduced for identifying SDPs; however, their further development and evaluation have been hindered by the limited number of known experimentally determined SDPs. Results: We combine several bioinformatics resources to automate a process, typically undertaken manually, to build a dataset of SDPs. The resulting large dataset, which consists of SDPs in enzymes, enables us to characterize SDPs in terms of their physicochemical and evolution-ary properties. It also facilitates the large-scale evaluation of sequence-based SDP prediction methods. We present a simple sequence-based SDP prediction method, GroupSim, and show that, surprisingly, it is competitive with a representative set of current methods. We also describe ConsWin, a heuristic that considers sequence conservation of neighboring amino acids, and demonstrate that it improves the performance of all methods tested on our large dataset of enzyme SDPs. Availability: Datasets and GroupSim code are available online at http://compbio.cs.princeton.edu/specificity/ Contact: msingh@cs.princeton.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2008-07-01 2008-05-01 /pmc/articles/PMC2718669/ /pubmed/18450811 http://dx.doi.org/10.1093/bioinformatics/btn214 Text en © 2008 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Papers Capra, John A. Singh, Mona Characterization and prediction of residues determining protein functional specificity |
title | Characterization and prediction of residues determining protein functional specificity |
title_full | Characterization and prediction of residues determining protein functional specificity |
title_fullStr | Characterization and prediction of residues determining protein functional specificity |
title_full_unstemmed | Characterization and prediction of residues determining protein functional specificity |
title_short | Characterization and prediction of residues determining protein functional specificity |
title_sort | characterization and prediction of residues determining protein functional specificity |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2718669/ https://www.ncbi.nlm.nih.gov/pubmed/18450811 http://dx.doi.org/10.1093/bioinformatics/btn214 |
work_keys_str_mv | AT caprajohna characterizationandpredictionofresiduesdeterminingproteinfunctionalspecificity AT singhmona characterizationandpredictionofresiduesdeterminingproteinfunctionalspecificity |