Cargando…
Proteome scanning to predict PDZ domain interactions using support vector machines
BACKGROUND: PDZ domains mediate protein-protein interactions involved in important biological processes through the recognition of short linear motifs in their target proteins. Two recent independent studies have used protein microarray or phage display technology to detect PDZ domain interactions w...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2967561/ https://www.ncbi.nlm.nih.gov/pubmed/20939902 http://dx.doi.org/10.1186/1471-2105-11-507 |
_version_ | 1782189687995105280 |
---|---|
author | Hui, Shirley Bader, Gary D |
author_facet | Hui, Shirley Bader, Gary D |
author_sort | Hui, Shirley |
collection | PubMed |
description | BACKGROUND: PDZ domains mediate protein-protein interactions involved in important biological processes through the recognition of short linear motifs in their target proteins. Two recent independent studies have used protein microarray or phage display technology to detect PDZ domain interactions with peptide ligands on a large scale. Several computational predictors of PDZ domain interactions have been developed, however they are trained using only protein microarray data and focus on limited subsets of PDZ domains. An accurate predictor of genomic PDZ domain interactions would allow the proteomes of organisms to be scanned for potential binders. Such an application would require an accurate and precise predictor to avoid generating too many false positive hits given the large amount of possible interactors in a given proteome. Once validated these predictions will help to increase the coverage of current PDZ domain interaction networks and further our understanding of the roles that PDZ domains play in a variety of biological processes. RESULTS: We developed a PDZ domain interaction predictor using a support vector machine (SVM) trained with both protein microarray and phage display data. In order to use the phage display data for training, which only contains positive interactions, we developed a method to generate artificial negative interactions. Using cross-validation and a series of independent tests, we showed that our SVM successfully predicts interactions in different organisms. We then used the SVM to scan the proteomes of human, worm and fly to predict binders for several PDZ domains. Predictions were validated using known genomic interactions and published protein microarray experiments. Based on our results, new protein interactions potentially associated with Usher and Bardet-Biedl syndromes were predicted. A comparison of performance measures (F1 measure and FPR) for the SVM and published predictors demonstrated our SVM's improved accuracy and precision at proteome scanning. CONCLUSIONS: We built an SVM using mouse and human experimental training data to predict PDZ domain interactions. We showed that it correctly predicts known interactions from proteomes of different organisms and is more accurate and precise at proteome scanning compared with published state-of-the-art predictors. |
format | Text |
id | pubmed-2967561 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-29675612010-11-03 Proteome scanning to predict PDZ domain interactions using support vector machines Hui, Shirley Bader, Gary D BMC Bioinformatics Research Article BACKGROUND: PDZ domains mediate protein-protein interactions involved in important biological processes through the recognition of short linear motifs in their target proteins. Two recent independent studies have used protein microarray or phage display technology to detect PDZ domain interactions with peptide ligands on a large scale. Several computational predictors of PDZ domain interactions have been developed, however they are trained using only protein microarray data and focus on limited subsets of PDZ domains. An accurate predictor of genomic PDZ domain interactions would allow the proteomes of organisms to be scanned for potential binders. Such an application would require an accurate and precise predictor to avoid generating too many false positive hits given the large amount of possible interactors in a given proteome. Once validated these predictions will help to increase the coverage of current PDZ domain interaction networks and further our understanding of the roles that PDZ domains play in a variety of biological processes. RESULTS: We developed a PDZ domain interaction predictor using a support vector machine (SVM) trained with both protein microarray and phage display data. In order to use the phage display data for training, which only contains positive interactions, we developed a method to generate artificial negative interactions. Using cross-validation and a series of independent tests, we showed that our SVM successfully predicts interactions in different organisms. We then used the SVM to scan the proteomes of human, worm and fly to predict binders for several PDZ domains. Predictions were validated using known genomic interactions and published protein microarray experiments. Based on our results, new protein interactions potentially associated with Usher and Bardet-Biedl syndromes were predicted. A comparison of performance measures (F1 measure and FPR) for the SVM and published predictors demonstrated our SVM's improved accuracy and precision at proteome scanning. CONCLUSIONS: We built an SVM using mouse and human experimental training data to predict PDZ domain interactions. We showed that it correctly predicts known interactions from proteomes of different organisms and is more accurate and precise at proteome scanning compared with published state-of-the-art predictors. BioMed Central 2010-10-12 /pmc/articles/PMC2967561/ /pubmed/20939902 http://dx.doi.org/10.1186/1471-2105-11-507 Text en Copyright ©2010 Hui and Bader; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Hui, Shirley Bader, Gary D Proteome scanning to predict PDZ domain interactions using support vector machines |
title | Proteome scanning to predict PDZ domain interactions using support vector machines |
title_full | Proteome scanning to predict PDZ domain interactions using support vector machines |
title_fullStr | Proteome scanning to predict PDZ domain interactions using support vector machines |
title_full_unstemmed | Proteome scanning to predict PDZ domain interactions using support vector machines |
title_short | Proteome scanning to predict PDZ domain interactions using support vector machines |
title_sort | proteome scanning to predict pdz domain interactions using support vector machines |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2967561/ https://www.ncbi.nlm.nih.gov/pubmed/20939902 http://dx.doi.org/10.1186/1471-2105-11-507 |
work_keys_str_mv | AT huishirley proteomescanningtopredictpdzdomaininteractionsusingsupportvectormachines AT badergaryd proteomescanningtopredictpdzdomaininteractionsusingsupportvectormachines |