Cargando…

Systematic Modeling, Prediction, and Comparison of Domain–Peptide Affinities: Does it Work Effectively With the Peptide QSAR Methodology?

The protein–protein association in cellular signaling networks (CSNs) often acts as weak, transient, and reversible domain–peptide interaction (DPI), in which a flexible peptide segment on the surface of one protein is recognized and bound by a rigid peptide-recognition domain from another. Reliable...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Qian, Lin, Jing, Wen, Li, Wang, Shaozhou, Zhou, Peng, Mei, Li, Shang, Shuyong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8795790/
https://www.ncbi.nlm.nih.gov/pubmed/35096016
http://dx.doi.org/10.3389/fgene.2021.800857
_version_ 1784641150182752256
author Liu, Qian
Lin, Jing
Wen, Li
Wang, Shaozhou
Zhou, Peng
Mei, Li
Shang, Shuyong
author_facet Liu, Qian
Lin, Jing
Wen, Li
Wang, Shaozhou
Zhou, Peng
Mei, Li
Shang, Shuyong
author_sort Liu, Qian
collection PubMed
description The protein–protein association in cellular signaling networks (CSNs) often acts as weak, transient, and reversible domain–peptide interaction (DPI), in which a flexible peptide segment on the surface of one protein is recognized and bound by a rigid peptide-recognition domain from another. Reliable modeling and accurate prediction of DPI binding affinities would help to ascertain the diverse biological events involved in CSNs and benefit our understanding of various biological implications underlying DPIs. Traditionally, peptide quantitative structure-activity relationship (pQSAR) has been widely used to model and predict the biological activity of oligopeptides, which employs amino acid descriptors (AADs) to characterize peptide structures at sequence level and then statistically correlate the resulting descriptor vector with observed activity data via regression. However, the QSAR has not yet been widely applied to treat the direct binding behavior of large-scale peptide ligands to their protein receptors. In this work, we attempted to clarify whether the pQSAR methodology can work effectively for modeling and predicting DPI affinities in a high-throughput manner? Over twenty thousand short linear motif (SLiM)-containing peptide segments involved in SH3, PDZ and 14-3-3 domain-medicated CSNs were compiled to define a comprehensive sequence-based data set of DPI affinities, which were represented by the Boehringer light units (BLUs) derived from previous arbitrary light intensity assays following SPOT peptide synthesis. Four sophisticated MLMs (MLMs) were then utilized to perform pQSAR modeling on the set described with different AADs to systematically create a variety of linear and nonlinear predictors, and then verified by rigorous statistical test. It is revealed that the genome-wide DPI events can only be modeled qualitatively or semiquantitatively with traditional pQSAR strategy due to the intrinsic disorder of peptide conformation and the potential interplay between different peptide residues. In addition, the arbitrary BLUs used to characterize DPI affinity values were measured via an indirect approach, which may not very reliable and may involve strong noise, thus leading to a considerable bias in the modeling. The R ( prd ) (2) = 0.7 can be considered as the upper limit of external generalization ability of the pQSAR methodology working on large-scale DPI affinity data.
format Online
Article
Text
id pubmed-8795790
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-87957902022-01-29 Systematic Modeling, Prediction, and Comparison of Domain–Peptide Affinities: Does it Work Effectively With the Peptide QSAR Methodology? Liu, Qian Lin, Jing Wen, Li Wang, Shaozhou Zhou, Peng Mei, Li Shang, Shuyong Front Genet Genetics The protein–protein association in cellular signaling networks (CSNs) often acts as weak, transient, and reversible domain–peptide interaction (DPI), in which a flexible peptide segment on the surface of one protein is recognized and bound by a rigid peptide-recognition domain from another. Reliable modeling and accurate prediction of DPI binding affinities would help to ascertain the diverse biological events involved in CSNs and benefit our understanding of various biological implications underlying DPIs. Traditionally, peptide quantitative structure-activity relationship (pQSAR) has been widely used to model and predict the biological activity of oligopeptides, which employs amino acid descriptors (AADs) to characterize peptide structures at sequence level and then statistically correlate the resulting descriptor vector with observed activity data via regression. However, the QSAR has not yet been widely applied to treat the direct binding behavior of large-scale peptide ligands to their protein receptors. In this work, we attempted to clarify whether the pQSAR methodology can work effectively for modeling and predicting DPI affinities in a high-throughput manner? Over twenty thousand short linear motif (SLiM)-containing peptide segments involved in SH3, PDZ and 14-3-3 domain-medicated CSNs were compiled to define a comprehensive sequence-based data set of DPI affinities, which were represented by the Boehringer light units (BLUs) derived from previous arbitrary light intensity assays following SPOT peptide synthesis. Four sophisticated MLMs (MLMs) were then utilized to perform pQSAR modeling on the set described with different AADs to systematically create a variety of linear and nonlinear predictors, and then verified by rigorous statistical test. It is revealed that the genome-wide DPI events can only be modeled qualitatively or semiquantitatively with traditional pQSAR strategy due to the intrinsic disorder of peptide conformation and the potential interplay between different peptide residues. In addition, the arbitrary BLUs used to characterize DPI affinity values were measured via an indirect approach, which may not very reliable and may involve strong noise, thus leading to a considerable bias in the modeling. The R ( prd ) (2) = 0.7 can be considered as the upper limit of external generalization ability of the pQSAR methodology working on large-scale DPI affinity data. Frontiers Media S.A. 2022-01-14 /pmc/articles/PMC8795790/ /pubmed/35096016 http://dx.doi.org/10.3389/fgene.2021.800857 Text en Copyright © 2022 Liu, Lin, Wen, Wang, Zhou, Mei and Shang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Liu, Qian
Lin, Jing
Wen, Li
Wang, Shaozhou
Zhou, Peng
Mei, Li
Shang, Shuyong
Systematic Modeling, Prediction, and Comparison of Domain–Peptide Affinities: Does it Work Effectively With the Peptide QSAR Methodology?
title Systematic Modeling, Prediction, and Comparison of Domain–Peptide Affinities: Does it Work Effectively With the Peptide QSAR Methodology?
title_full Systematic Modeling, Prediction, and Comparison of Domain–Peptide Affinities: Does it Work Effectively With the Peptide QSAR Methodology?
title_fullStr Systematic Modeling, Prediction, and Comparison of Domain–Peptide Affinities: Does it Work Effectively With the Peptide QSAR Methodology?
title_full_unstemmed Systematic Modeling, Prediction, and Comparison of Domain–Peptide Affinities: Does it Work Effectively With the Peptide QSAR Methodology?
title_short Systematic Modeling, Prediction, and Comparison of Domain–Peptide Affinities: Does it Work Effectively With the Peptide QSAR Methodology?
title_sort systematic modeling, prediction, and comparison of domain–peptide affinities: does it work effectively with the peptide qsar methodology?
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8795790/
https://www.ncbi.nlm.nih.gov/pubmed/35096016
http://dx.doi.org/10.3389/fgene.2021.800857
work_keys_str_mv AT liuqian systematicmodelingpredictionandcomparisonofdomainpeptideaffinitiesdoesitworkeffectivelywiththepeptideqsarmethodology
AT linjing systematicmodelingpredictionandcomparisonofdomainpeptideaffinitiesdoesitworkeffectivelywiththepeptideqsarmethodology
AT wenli systematicmodelingpredictionandcomparisonofdomainpeptideaffinitiesdoesitworkeffectivelywiththepeptideqsarmethodology
AT wangshaozhou systematicmodelingpredictionandcomparisonofdomainpeptideaffinitiesdoesitworkeffectivelywiththepeptideqsarmethodology
AT zhoupeng systematicmodelingpredictionandcomparisonofdomainpeptideaffinitiesdoesitworkeffectivelywiththepeptideqsarmethodology
AT meili systematicmodelingpredictionandcomparisonofdomainpeptideaffinitiesdoesitworkeffectivelywiththepeptideqsarmethodology
AT shangshuyong systematicmodelingpredictionandcomparisonofdomainpeptideaffinitiesdoesitworkeffectivelywiththepeptideqsarmethodology