Cargando…

A regression framework incorporating quantitative and negative interaction data improves quantitative prediction of PDZ domain–peptide interaction from primary sequence

Motivation: Predicting protein interactions involving peptide recognition domains is essential for understanding the many important biological processes they mediate. It is important to consider the binding strength of these interactions to help us construct more biologically relevant protein intera...

Descripción completa

Detalles Bibliográficos
Autores principales: Shao, Xiaojian, Tan, Chris S. H., Voss, Courtney, Li, Shawn S. C., Deng, Naiyang, Bader, Gary D.
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3031032/
https://www.ncbi.nlm.nih.gov/pubmed/21127034
http://dx.doi.org/10.1093/bioinformatics/btq657
_version_ 1782197312253067264
author Shao, Xiaojian
Tan, Chris S. H.
Voss, Courtney
Li, Shawn S. C.
Deng, Naiyang
Bader, Gary D.
author_facet Shao, Xiaojian
Tan, Chris S. H.
Voss, Courtney
Li, Shawn S. C.
Deng, Naiyang
Bader, Gary D.
author_sort Shao, Xiaojian
collection PubMed
description Motivation: Predicting protein interactions involving peptide recognition domains is essential for understanding the many important biological processes they mediate. It is important to consider the binding strength of these interactions to help us construct more biologically relevant protein interaction networks that consider cellular context and competition between potential binders. Results: We developed a novel regression framework that considers both positive (quantitative) and negative (qualitative) interaction data available for mouse PDZ domains to quantitatively predict interactions between PDZ domains, a large peptide recognition domain family, and their peptide ligands using primary sequence information. First, we show that it is possible to learn from existing quantitative and negative interaction data to infer the relative binding strength of interactions involving previously unseen PDZ domains and/or peptides given their primary sequence. Performance was measured using cross-validated hold out testing and testing with previously unseen PDZ domain–peptide interactions. Second, we find that incorporating negative data improves quantitative interaction prediction. Third, we show that sequence similarity is an important prediction performance determinant, which suggests that experimentally collecting additional quantitative interaction data for underrepresented PDZ domain subfamilies will improve prediction. Availability and Implementation: The Matlab code for our SemiSVR predictor and all data used here are available at http://baderlab.org/Data/PDZAffinity. Contact: gary.bader@utoronto.ca; dengnaiyang@cau.edu.cn Supplementary information: Supplementary data are available at Bioinformatics online.
format Text
id pubmed-3031032
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-30310322011-02-02 A regression framework incorporating quantitative and negative interaction data improves quantitative prediction of PDZ domain–peptide interaction from primary sequence Shao, Xiaojian Tan, Chris S. H. Voss, Courtney Li, Shawn S. C. Deng, Naiyang Bader, Gary D. Bioinformatics Original Papers Motivation: Predicting protein interactions involving peptide recognition domains is essential for understanding the many important biological processes they mediate. It is important to consider the binding strength of these interactions to help us construct more biologically relevant protein interaction networks that consider cellular context and competition between potential binders. Results: We developed a novel regression framework that considers both positive (quantitative) and negative (qualitative) interaction data available for mouse PDZ domains to quantitatively predict interactions between PDZ domains, a large peptide recognition domain family, and their peptide ligands using primary sequence information. First, we show that it is possible to learn from existing quantitative and negative interaction data to infer the relative binding strength of interactions involving previously unseen PDZ domains and/or peptides given their primary sequence. Performance was measured using cross-validated hold out testing and testing with previously unseen PDZ domain–peptide interactions. Second, we find that incorporating negative data improves quantitative interaction prediction. Third, we show that sequence similarity is an important prediction performance determinant, which suggests that experimentally collecting additional quantitative interaction data for underrepresented PDZ domain subfamilies will improve prediction. Availability and Implementation: The Matlab code for our SemiSVR predictor and all data used here are available at http://baderlab.org/Data/PDZAffinity. Contact: gary.bader@utoronto.ca; dengnaiyang@cau.edu.cn Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2011-02-01 2010-12-02 /pmc/articles/PMC3031032/ /pubmed/21127034 http://dx.doi.org/10.1093/bioinformatics/btq657 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Shao, Xiaojian
Tan, Chris S. H.
Voss, Courtney
Li, Shawn S. C.
Deng, Naiyang
Bader, Gary D.
A regression framework incorporating quantitative and negative interaction data improves quantitative prediction of PDZ domain–peptide interaction from primary sequence
title A regression framework incorporating quantitative and negative interaction data improves quantitative prediction of PDZ domain–peptide interaction from primary sequence
title_full A regression framework incorporating quantitative and negative interaction data improves quantitative prediction of PDZ domain–peptide interaction from primary sequence
title_fullStr A regression framework incorporating quantitative and negative interaction data improves quantitative prediction of PDZ domain–peptide interaction from primary sequence
title_full_unstemmed A regression framework incorporating quantitative and negative interaction data improves quantitative prediction of PDZ domain–peptide interaction from primary sequence
title_short A regression framework incorporating quantitative and negative interaction data improves quantitative prediction of PDZ domain–peptide interaction from primary sequence
title_sort regression framework incorporating quantitative and negative interaction data improves quantitative prediction of pdz domain–peptide interaction from primary sequence
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3031032/
https://www.ncbi.nlm.nih.gov/pubmed/21127034
http://dx.doi.org/10.1093/bioinformatics/btq657
work_keys_str_mv AT shaoxiaojian aregressionframeworkincorporatingquantitativeandnegativeinteractiondataimprovesquantitativepredictionofpdzdomainpeptideinteractionfromprimarysequence
AT tanchrissh aregressionframeworkincorporatingquantitativeandnegativeinteractiondataimprovesquantitativepredictionofpdzdomainpeptideinteractionfromprimarysequence
AT vosscourtney aregressionframeworkincorporatingquantitativeandnegativeinteractiondataimprovesquantitativepredictionofpdzdomainpeptideinteractionfromprimarysequence
AT lishawnsc aregressionframeworkincorporatingquantitativeandnegativeinteractiondataimprovesquantitativepredictionofpdzdomainpeptideinteractionfromprimarysequence
AT dengnaiyang aregressionframeworkincorporatingquantitativeandnegativeinteractiondataimprovesquantitativepredictionofpdzdomainpeptideinteractionfromprimarysequence
AT badergaryd aregressionframeworkincorporatingquantitativeandnegativeinteractiondataimprovesquantitativepredictionofpdzdomainpeptideinteractionfromprimarysequence
AT shaoxiaojian regressionframeworkincorporatingquantitativeandnegativeinteractiondataimprovesquantitativepredictionofpdzdomainpeptideinteractionfromprimarysequence
AT tanchrissh regressionframeworkincorporatingquantitativeandnegativeinteractiondataimprovesquantitativepredictionofpdzdomainpeptideinteractionfromprimarysequence
AT vosscourtney regressionframeworkincorporatingquantitativeandnegativeinteractiondataimprovesquantitativepredictionofpdzdomainpeptideinteractionfromprimarysequence
AT lishawnsc regressionframeworkincorporatingquantitativeandnegativeinteractiondataimprovesquantitativepredictionofpdzdomainpeptideinteractionfromprimarysequence
AT dengnaiyang regressionframeworkincorporatingquantitativeandnegativeinteractiondataimprovesquantitativepredictionofpdzdomainpeptideinteractionfromprimarysequence
AT badergaryd regressionframeworkincorporatingquantitativeandnegativeinteractiondataimprovesquantitativepredictionofpdzdomainpeptideinteractionfromprimarysequence