Cargando…

Binding Site Prediction for Protein-Protein Interactions and Novel Motif Discovery using Re-occurring Polypeptide Sequences

BACKGROUND: While there are many methods for predicting protein-protein interaction, very few can determine the specific site of interaction on each protein. Characterization of the specific sequence regions mediating interaction (binding sites) is crucial for an understanding of cellular pathways....

Descripción completa

Detalles Bibliográficos
Autores principales: Amos-Binks, Adam, Patulea, Catalin, Pitre, Sylvain, Schoenrock, Andrew, Gui, Yuan, Green, James R, Golshani, Ashkan, Dehne, Frank
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3120708/
https://www.ncbi.nlm.nih.gov/pubmed/21635751
http://dx.doi.org/10.1186/1471-2105-12-225
_version_ 1782206739330891776
author Amos-Binks, Adam
Patulea, Catalin
Pitre, Sylvain
Schoenrock, Andrew
Gui, Yuan
Green, James R
Golshani, Ashkan
Dehne, Frank
author_facet Amos-Binks, Adam
Patulea, Catalin
Pitre, Sylvain
Schoenrock, Andrew
Gui, Yuan
Green, James R
Golshani, Ashkan
Dehne, Frank
author_sort Amos-Binks, Adam
collection PubMed
description BACKGROUND: While there are many methods for predicting protein-protein interaction, very few can determine the specific site of interaction on each protein. Characterization of the specific sequence regions mediating interaction (binding sites) is crucial for an understanding of cellular pathways. Experimental methods often report false binding sites due to experimental limitations, while computational methods tend to require data which is not available at the proteome-scale. Here we present PIPE-Sites, a novel method of protein specific binding site prediction based on pairs of re-occurring polypeptide sequences, which have been previously shown to accurately predict protein-protein interactions. PIPE-Sites operates at high specificity and requires only the sequences of query proteins and a database of known binary interactions with no binding site data, making it applicable to binding site prediction at the proteome-scale. RESULTS: PIPE-Sites was evaluated using a dataset of 265 yeast and 423 human interacting proteins pairs with experimentally-determined binding sites. We found that PIPE-Sites predictions were closer to the confirmed binding site than those of two existing binding site prediction methods based on domain-domain interactions, when applied to the same dataset. Finally, we applied PIPE-Sites to two datasets of 2347 yeast and 14,438 human novel interacting protein pairs predicted to interact with high confidence. An analysis of the predicted interaction sites revealed a number of protein subsequences which are highly re-occurring in binding sites and which may represent novel binding motifs. CONCLUSIONS: PIPE-Sites is an accurate method for predicting protein binding sites and is applicable to the proteome-scale. Thus, PIPE-Sites could be useful for exhaustive analysis of protein binding patterns in whole proteomes as well as discovery of novel binding motifs. PIPE-Sites is available online at http://pipe-sites.cgmlab.org/.
format Online
Article
Text
id pubmed-3120708
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31207082011-06-23 Binding Site Prediction for Protein-Protein Interactions and Novel Motif Discovery using Re-occurring Polypeptide Sequences Amos-Binks, Adam Patulea, Catalin Pitre, Sylvain Schoenrock, Andrew Gui, Yuan Green, James R Golshani, Ashkan Dehne, Frank BMC Bioinformatics Methodology Article BACKGROUND: While there are many methods for predicting protein-protein interaction, very few can determine the specific site of interaction on each protein. Characterization of the specific sequence regions mediating interaction (binding sites) is crucial for an understanding of cellular pathways. Experimental methods often report false binding sites due to experimental limitations, while computational methods tend to require data which is not available at the proteome-scale. Here we present PIPE-Sites, a novel method of protein specific binding site prediction based on pairs of re-occurring polypeptide sequences, which have been previously shown to accurately predict protein-protein interactions. PIPE-Sites operates at high specificity and requires only the sequences of query proteins and a database of known binary interactions with no binding site data, making it applicable to binding site prediction at the proteome-scale. RESULTS: PIPE-Sites was evaluated using a dataset of 265 yeast and 423 human interacting proteins pairs with experimentally-determined binding sites. We found that PIPE-Sites predictions were closer to the confirmed binding site than those of two existing binding site prediction methods based on domain-domain interactions, when applied to the same dataset. Finally, we applied PIPE-Sites to two datasets of 2347 yeast and 14,438 human novel interacting protein pairs predicted to interact with high confidence. An analysis of the predicted interaction sites revealed a number of protein subsequences which are highly re-occurring in binding sites and which may represent novel binding motifs. CONCLUSIONS: PIPE-Sites is an accurate method for predicting protein binding sites and is applicable to the proteome-scale. Thus, PIPE-Sites could be useful for exhaustive analysis of protein binding patterns in whole proteomes as well as discovery of novel binding motifs. PIPE-Sites is available online at http://pipe-sites.cgmlab.org/. BioMed Central 2011-06-02 /pmc/articles/PMC3120708/ /pubmed/21635751 http://dx.doi.org/10.1186/1471-2105-12-225 Text en Copyright ©2011 Amos-Binks et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Amos-Binks, Adam
Patulea, Catalin
Pitre, Sylvain
Schoenrock, Andrew
Gui, Yuan
Green, James R
Golshani, Ashkan
Dehne, Frank
Binding Site Prediction for Protein-Protein Interactions and Novel Motif Discovery using Re-occurring Polypeptide Sequences
title Binding Site Prediction for Protein-Protein Interactions and Novel Motif Discovery using Re-occurring Polypeptide Sequences
title_full Binding Site Prediction for Protein-Protein Interactions and Novel Motif Discovery using Re-occurring Polypeptide Sequences
title_fullStr Binding Site Prediction for Protein-Protein Interactions and Novel Motif Discovery using Re-occurring Polypeptide Sequences
title_full_unstemmed Binding Site Prediction for Protein-Protein Interactions and Novel Motif Discovery using Re-occurring Polypeptide Sequences
title_short Binding Site Prediction for Protein-Protein Interactions and Novel Motif Discovery using Re-occurring Polypeptide Sequences
title_sort binding site prediction for protein-protein interactions and novel motif discovery using re-occurring polypeptide sequences
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3120708/
https://www.ncbi.nlm.nih.gov/pubmed/21635751
http://dx.doi.org/10.1186/1471-2105-12-225
work_keys_str_mv AT amosbinksadam bindingsitepredictionforproteinproteininteractionsandnovelmotifdiscoveryusingreoccurringpolypeptidesequences
AT patuleacatalin bindingsitepredictionforproteinproteininteractionsandnovelmotifdiscoveryusingreoccurringpolypeptidesequences
AT pitresylvain bindingsitepredictionforproteinproteininteractionsandnovelmotifdiscoveryusingreoccurringpolypeptidesequences
AT schoenrockandrew bindingsitepredictionforproteinproteininteractionsandnovelmotifdiscoveryusingreoccurringpolypeptidesequences
AT guiyuan bindingsitepredictionforproteinproteininteractionsandnovelmotifdiscoveryusingreoccurringpolypeptidesequences
AT greenjamesr bindingsitepredictionforproteinproteininteractionsandnovelmotifdiscoveryusingreoccurringpolypeptidesequences
AT golshaniashkan bindingsitepredictionforproteinproteininteractionsandnovelmotifdiscoveryusingreoccurringpolypeptidesequences
AT dehnefrank bindingsitepredictionforproteinproteininteractionsandnovelmotifdiscoveryusingreoccurringpolypeptidesequences