Cargando…
A regression framework incorporating quantitative and negative interaction data improves quantitative prediction of PDZ domain–peptide interaction from primary sequence
Motivation: Predicting protein interactions involving peptide recognition domains is essential for understanding the many important biological processes they mediate. It is important to consider the binding strength of these interactions to help us construct more biologically relevant protein intera...
Autores principales: | , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3031032/ https://www.ncbi.nlm.nih.gov/pubmed/21127034 http://dx.doi.org/10.1093/bioinformatics/btq657 |
_version_ | 1782197312253067264 |
---|---|
author | Shao, Xiaojian Tan, Chris S. H. Voss, Courtney Li, Shawn S. C. Deng, Naiyang Bader, Gary D. |
author_facet | Shao, Xiaojian Tan, Chris S. H. Voss, Courtney Li, Shawn S. C. Deng, Naiyang Bader, Gary D. |
author_sort | Shao, Xiaojian |
collection | PubMed |
description | Motivation: Predicting protein interactions involving peptide recognition domains is essential for understanding the many important biological processes they mediate. It is important to consider the binding strength of these interactions to help us construct more biologically relevant protein interaction networks that consider cellular context and competition between potential binders. Results: We developed a novel regression framework that considers both positive (quantitative) and negative (qualitative) interaction data available for mouse PDZ domains to quantitatively predict interactions between PDZ domains, a large peptide recognition domain family, and their peptide ligands using primary sequence information. First, we show that it is possible to learn from existing quantitative and negative interaction data to infer the relative binding strength of interactions involving previously unseen PDZ domains and/or peptides given their primary sequence. Performance was measured using cross-validated hold out testing and testing with previously unseen PDZ domain–peptide interactions. Second, we find that incorporating negative data improves quantitative interaction prediction. Third, we show that sequence similarity is an important prediction performance determinant, which suggests that experimentally collecting additional quantitative interaction data for underrepresented PDZ domain subfamilies will improve prediction. Availability and Implementation: The Matlab code for our SemiSVR predictor and all data used here are available at http://baderlab.org/Data/PDZAffinity. Contact: gary.bader@utoronto.ca; dengnaiyang@cau.edu.cn Supplementary information: Supplementary data are available at Bioinformatics online. |
format | Text |
id | pubmed-3031032 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-30310322011-02-02 A regression framework incorporating quantitative and negative interaction data improves quantitative prediction of PDZ domain–peptide interaction from primary sequence Shao, Xiaojian Tan, Chris S. H. Voss, Courtney Li, Shawn S. C. Deng, Naiyang Bader, Gary D. Bioinformatics Original Papers Motivation: Predicting protein interactions involving peptide recognition domains is essential for understanding the many important biological processes they mediate. It is important to consider the binding strength of these interactions to help us construct more biologically relevant protein interaction networks that consider cellular context and competition between potential binders. Results: We developed a novel regression framework that considers both positive (quantitative) and negative (qualitative) interaction data available for mouse PDZ domains to quantitatively predict interactions between PDZ domains, a large peptide recognition domain family, and their peptide ligands using primary sequence information. First, we show that it is possible to learn from existing quantitative and negative interaction data to infer the relative binding strength of interactions involving previously unseen PDZ domains and/or peptides given their primary sequence. Performance was measured using cross-validated hold out testing and testing with previously unseen PDZ domain–peptide interactions. Second, we find that incorporating negative data improves quantitative interaction prediction. Third, we show that sequence similarity is an important prediction performance determinant, which suggests that experimentally collecting additional quantitative interaction data for underrepresented PDZ domain subfamilies will improve prediction. Availability and Implementation: The Matlab code for our SemiSVR predictor and all data used here are available at http://baderlab.org/Data/PDZAffinity. Contact: gary.bader@utoronto.ca; dengnaiyang@cau.edu.cn Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2011-02-01 2010-12-02 /pmc/articles/PMC3031032/ /pubmed/21127034 http://dx.doi.org/10.1093/bioinformatics/btq657 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Papers Shao, Xiaojian Tan, Chris S. H. Voss, Courtney Li, Shawn S. C. Deng, Naiyang Bader, Gary D. A regression framework incorporating quantitative and negative interaction data improves quantitative prediction of PDZ domain–peptide interaction from primary sequence |
title | A regression framework incorporating quantitative and negative interaction data improves quantitative prediction of PDZ domain–peptide interaction from primary sequence |
title_full | A regression framework incorporating quantitative and negative interaction data improves quantitative prediction of PDZ domain–peptide interaction from primary sequence |
title_fullStr | A regression framework incorporating quantitative and negative interaction data improves quantitative prediction of PDZ domain–peptide interaction from primary sequence |
title_full_unstemmed | A regression framework incorporating quantitative and negative interaction data improves quantitative prediction of PDZ domain–peptide interaction from primary sequence |
title_short | A regression framework incorporating quantitative and negative interaction data improves quantitative prediction of PDZ domain–peptide interaction from primary sequence |
title_sort | regression framework incorporating quantitative and negative interaction data improves quantitative prediction of pdz domain–peptide interaction from primary sequence |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3031032/ https://www.ncbi.nlm.nih.gov/pubmed/21127034 http://dx.doi.org/10.1093/bioinformatics/btq657 |
work_keys_str_mv | AT shaoxiaojian aregressionframeworkincorporatingquantitativeandnegativeinteractiondataimprovesquantitativepredictionofpdzdomainpeptideinteractionfromprimarysequence AT tanchrissh aregressionframeworkincorporatingquantitativeandnegativeinteractiondataimprovesquantitativepredictionofpdzdomainpeptideinteractionfromprimarysequence AT vosscourtney aregressionframeworkincorporatingquantitativeandnegativeinteractiondataimprovesquantitativepredictionofpdzdomainpeptideinteractionfromprimarysequence AT lishawnsc aregressionframeworkincorporatingquantitativeandnegativeinteractiondataimprovesquantitativepredictionofpdzdomainpeptideinteractionfromprimarysequence AT dengnaiyang aregressionframeworkincorporatingquantitativeandnegativeinteractiondataimprovesquantitativepredictionofpdzdomainpeptideinteractionfromprimarysequence AT badergaryd aregressionframeworkincorporatingquantitativeandnegativeinteractiondataimprovesquantitativepredictionofpdzdomainpeptideinteractionfromprimarysequence AT shaoxiaojian regressionframeworkincorporatingquantitativeandnegativeinteractiondataimprovesquantitativepredictionofpdzdomainpeptideinteractionfromprimarysequence AT tanchrissh regressionframeworkincorporatingquantitativeandnegativeinteractiondataimprovesquantitativepredictionofpdzdomainpeptideinteractionfromprimarysequence AT vosscourtney regressionframeworkincorporatingquantitativeandnegativeinteractiondataimprovesquantitativepredictionofpdzdomainpeptideinteractionfromprimarysequence AT lishawnsc regressionframeworkincorporatingquantitativeandnegativeinteractiondataimprovesquantitativepredictionofpdzdomainpeptideinteractionfromprimarysequence AT dengnaiyang regressionframeworkincorporatingquantitativeandnegativeinteractiondataimprovesquantitativepredictionofpdzdomainpeptideinteractionfromprimarysequence AT badergaryd regressionframeworkincorporatingquantitativeandnegativeinteractiondataimprovesquantitativepredictionofpdzdomainpeptideinteractionfromprimarysequence |