Cargando…

Characterization and prediction of residues determining protein functional specificity

Motivation: Within a homologous protein family, proteins may be grouped into subtypes that share specific functions that are not common to the entire family. Often, the amino acids present in a small number of sequence positions determine each protein's particular function-al specificity. Knowl...

Descripción completa

Detalles Bibliográficos
Autores principales: Capra, John A., Singh, Mona
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2718669/
https://www.ncbi.nlm.nih.gov/pubmed/18450811
http://dx.doi.org/10.1093/bioinformatics/btn214
_version_ 1782170012424863744
author Capra, John A.
Singh, Mona
author_facet Capra, John A.
Singh, Mona
author_sort Capra, John A.
collection PubMed
description Motivation: Within a homologous protein family, proteins may be grouped into subtypes that share specific functions that are not common to the entire family. Often, the amino acids present in a small number of sequence positions determine each protein's particular function-al specificity. Knowledge of these specificity determining positions (SDPs) aids in protein function prediction, drug design and experimental analysis. A number of sequence-based computational methods have been introduced for identifying SDPs; however, their further development and evaluation have been hindered by the limited number of known experimentally determined SDPs. Results: We combine several bioinformatics resources to automate a process, typically undertaken manually, to build a dataset of SDPs. The resulting large dataset, which consists of SDPs in enzymes, enables us to characterize SDPs in terms of their physicochemical and evolution-ary properties. It also facilitates the large-scale evaluation of sequence-based SDP prediction methods. We present a simple sequence-based SDP prediction method, GroupSim, and show that, surprisingly, it is competitive with a representative set of current methods. We also describe ConsWin, a heuristic that considers sequence conservation of neighboring amino acids, and demonstrate that it improves the performance of all methods tested on our large dataset of enzyme SDPs. Availability: Datasets and GroupSim code are available online at http://compbio.cs.princeton.edu/specificity/ Contact: msingh@cs.princeton.edu Supplementary information: Supplementary data are available at Bioinformatics online.
format Text
id pubmed-2718669
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-27186692009-07-31 Characterization and prediction of residues determining protein functional specificity Capra, John A. Singh, Mona Bioinformatics Original Papers Motivation: Within a homologous protein family, proteins may be grouped into subtypes that share specific functions that are not common to the entire family. Often, the amino acids present in a small number of sequence positions determine each protein's particular function-al specificity. Knowledge of these specificity determining positions (SDPs) aids in protein function prediction, drug design and experimental analysis. A number of sequence-based computational methods have been introduced for identifying SDPs; however, their further development and evaluation have been hindered by the limited number of known experimentally determined SDPs. Results: We combine several bioinformatics resources to automate a process, typically undertaken manually, to build a dataset of SDPs. The resulting large dataset, which consists of SDPs in enzymes, enables us to characterize SDPs in terms of their physicochemical and evolution-ary properties. It also facilitates the large-scale evaluation of sequence-based SDP prediction methods. We present a simple sequence-based SDP prediction method, GroupSim, and show that, surprisingly, it is competitive with a representative set of current methods. We also describe ConsWin, a heuristic that considers sequence conservation of neighboring amino acids, and demonstrate that it improves the performance of all methods tested on our large dataset of enzyme SDPs. Availability: Datasets and GroupSim code are available online at http://compbio.cs.princeton.edu/specificity/ Contact: msingh@cs.princeton.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2008-07-01 2008-05-01 /pmc/articles/PMC2718669/ /pubmed/18450811 http://dx.doi.org/10.1093/bioinformatics/btn214 Text en © 2008 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Capra, John A.
Singh, Mona
Characterization and prediction of residues determining protein functional specificity
title Characterization and prediction of residues determining protein functional specificity
title_full Characterization and prediction of residues determining protein functional specificity
title_fullStr Characterization and prediction of residues determining protein functional specificity
title_full_unstemmed Characterization and prediction of residues determining protein functional specificity
title_short Characterization and prediction of residues determining protein functional specificity
title_sort characterization and prediction of residues determining protein functional specificity
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2718669/
https://www.ncbi.nlm.nih.gov/pubmed/18450811
http://dx.doi.org/10.1093/bioinformatics/btn214
work_keys_str_mv AT caprajohna characterizationandpredictionofresiduesdeterminingproteinfunctionalspecificity
AT singhmona characterizationandpredictionofresiduesdeterminingproteinfunctionalspecificity