Cargando…

Proteome scanning to predict PDZ domain interactions using support vector machines

BACKGROUND: PDZ domains mediate protein-protein interactions involved in important biological processes through the recognition of short linear motifs in their target proteins. Two recent independent studies have used protein microarray or phage display technology to detect PDZ domain interactions w...

Descripción completa

Detalles Bibliográficos
Autores principales: Hui, Shirley, Bader, Gary D
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2967561/
https://www.ncbi.nlm.nih.gov/pubmed/20939902
http://dx.doi.org/10.1186/1471-2105-11-507
_version_ 1782189687995105280
author Hui, Shirley
Bader, Gary D
author_facet Hui, Shirley
Bader, Gary D
author_sort Hui, Shirley
collection PubMed
description BACKGROUND: PDZ domains mediate protein-protein interactions involved in important biological processes through the recognition of short linear motifs in their target proteins. Two recent independent studies have used protein microarray or phage display technology to detect PDZ domain interactions with peptide ligands on a large scale. Several computational predictors of PDZ domain interactions have been developed, however they are trained using only protein microarray data and focus on limited subsets of PDZ domains. An accurate predictor of genomic PDZ domain interactions would allow the proteomes of organisms to be scanned for potential binders. Such an application would require an accurate and precise predictor to avoid generating too many false positive hits given the large amount of possible interactors in a given proteome. Once validated these predictions will help to increase the coverage of current PDZ domain interaction networks and further our understanding of the roles that PDZ domains play in a variety of biological processes. RESULTS: We developed a PDZ domain interaction predictor using a support vector machine (SVM) trained with both protein microarray and phage display data. In order to use the phage display data for training, which only contains positive interactions, we developed a method to generate artificial negative interactions. Using cross-validation and a series of independent tests, we showed that our SVM successfully predicts interactions in different organisms. We then used the SVM to scan the proteomes of human, worm and fly to predict binders for several PDZ domains. Predictions were validated using known genomic interactions and published protein microarray experiments. Based on our results, new protein interactions potentially associated with Usher and Bardet-Biedl syndromes were predicted. A comparison of performance measures (F1 measure and FPR) for the SVM and published predictors demonstrated our SVM's improved accuracy and precision at proteome scanning. CONCLUSIONS: We built an SVM using mouse and human experimental training data to predict PDZ domain interactions. We showed that it correctly predicts known interactions from proteomes of different organisms and is more accurate and precise at proteome scanning compared with published state-of-the-art predictors.
format Text
id pubmed-2967561
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-29675612010-11-03 Proteome scanning to predict PDZ domain interactions using support vector machines Hui, Shirley Bader, Gary D BMC Bioinformatics Research Article BACKGROUND: PDZ domains mediate protein-protein interactions involved in important biological processes through the recognition of short linear motifs in their target proteins. Two recent independent studies have used protein microarray or phage display technology to detect PDZ domain interactions with peptide ligands on a large scale. Several computational predictors of PDZ domain interactions have been developed, however they are trained using only protein microarray data and focus on limited subsets of PDZ domains. An accurate predictor of genomic PDZ domain interactions would allow the proteomes of organisms to be scanned for potential binders. Such an application would require an accurate and precise predictor to avoid generating too many false positive hits given the large amount of possible interactors in a given proteome. Once validated these predictions will help to increase the coverage of current PDZ domain interaction networks and further our understanding of the roles that PDZ domains play in a variety of biological processes. RESULTS: We developed a PDZ domain interaction predictor using a support vector machine (SVM) trained with both protein microarray and phage display data. In order to use the phage display data for training, which only contains positive interactions, we developed a method to generate artificial negative interactions. Using cross-validation and a series of independent tests, we showed that our SVM successfully predicts interactions in different organisms. We then used the SVM to scan the proteomes of human, worm and fly to predict binders for several PDZ domains. Predictions were validated using known genomic interactions and published protein microarray experiments. Based on our results, new protein interactions potentially associated with Usher and Bardet-Biedl syndromes were predicted. A comparison of performance measures (F1 measure and FPR) for the SVM and published predictors demonstrated our SVM's improved accuracy and precision at proteome scanning. CONCLUSIONS: We built an SVM using mouse and human experimental training data to predict PDZ domain interactions. We showed that it correctly predicts known interactions from proteomes of different organisms and is more accurate and precise at proteome scanning compared with published state-of-the-art predictors. BioMed Central 2010-10-12 /pmc/articles/PMC2967561/ /pubmed/20939902 http://dx.doi.org/10.1186/1471-2105-11-507 Text en Copyright ©2010 Hui and Bader; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Hui, Shirley
Bader, Gary D
Proteome scanning to predict PDZ domain interactions using support vector machines
title Proteome scanning to predict PDZ domain interactions using support vector machines
title_full Proteome scanning to predict PDZ domain interactions using support vector machines
title_fullStr Proteome scanning to predict PDZ domain interactions using support vector machines
title_full_unstemmed Proteome scanning to predict PDZ domain interactions using support vector machines
title_short Proteome scanning to predict PDZ domain interactions using support vector machines
title_sort proteome scanning to predict pdz domain interactions using support vector machines
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2967561/
https://www.ncbi.nlm.nih.gov/pubmed/20939902
http://dx.doi.org/10.1186/1471-2105-11-507
work_keys_str_mv AT huishirley proteomescanningtopredictpdzdomaininteractionsusingsupportvectormachines
AT badergaryd proteomescanningtopredictpdzdomaininteractionsusingsupportvectormachines