Cargando…
Systematic Modeling, Prediction, and Comparison of Domain–Peptide Affinities: Does it Work Effectively With the Peptide QSAR Methodology?
The protein–protein association in cellular signaling networks (CSNs) often acts as weak, transient, and reversible domain–peptide interaction (DPI), in which a flexible peptide segment on the surface of one protein is recognized and bound by a rigid peptide-recognition domain from another. Reliable...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8795790/ https://www.ncbi.nlm.nih.gov/pubmed/35096016 http://dx.doi.org/10.3389/fgene.2021.800857 |
_version_ | 1784641150182752256 |
---|---|
author | Liu, Qian Lin, Jing Wen, Li Wang, Shaozhou Zhou, Peng Mei, Li Shang, Shuyong |
author_facet | Liu, Qian Lin, Jing Wen, Li Wang, Shaozhou Zhou, Peng Mei, Li Shang, Shuyong |
author_sort | Liu, Qian |
collection | PubMed |
description | The protein–protein association in cellular signaling networks (CSNs) often acts as weak, transient, and reversible domain–peptide interaction (DPI), in which a flexible peptide segment on the surface of one protein is recognized and bound by a rigid peptide-recognition domain from another. Reliable modeling and accurate prediction of DPI binding affinities would help to ascertain the diverse biological events involved in CSNs and benefit our understanding of various biological implications underlying DPIs. Traditionally, peptide quantitative structure-activity relationship (pQSAR) has been widely used to model and predict the biological activity of oligopeptides, which employs amino acid descriptors (AADs) to characterize peptide structures at sequence level and then statistically correlate the resulting descriptor vector with observed activity data via regression. However, the QSAR has not yet been widely applied to treat the direct binding behavior of large-scale peptide ligands to their protein receptors. In this work, we attempted to clarify whether the pQSAR methodology can work effectively for modeling and predicting DPI affinities in a high-throughput manner? Over twenty thousand short linear motif (SLiM)-containing peptide segments involved in SH3, PDZ and 14-3-3 domain-medicated CSNs were compiled to define a comprehensive sequence-based data set of DPI affinities, which were represented by the Boehringer light units (BLUs) derived from previous arbitrary light intensity assays following SPOT peptide synthesis. Four sophisticated MLMs (MLMs) were then utilized to perform pQSAR modeling on the set described with different AADs to systematically create a variety of linear and nonlinear predictors, and then verified by rigorous statistical test. It is revealed that the genome-wide DPI events can only be modeled qualitatively or semiquantitatively with traditional pQSAR strategy due to the intrinsic disorder of peptide conformation and the potential interplay between different peptide residues. In addition, the arbitrary BLUs used to characterize DPI affinity values were measured via an indirect approach, which may not very reliable and may involve strong noise, thus leading to a considerable bias in the modeling. The R ( prd ) (2) = 0.7 can be considered as the upper limit of external generalization ability of the pQSAR methodology working on large-scale DPI affinity data. |
format | Online Article Text |
id | pubmed-8795790 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-87957902022-01-29 Systematic Modeling, Prediction, and Comparison of Domain–Peptide Affinities: Does it Work Effectively With the Peptide QSAR Methodology? Liu, Qian Lin, Jing Wen, Li Wang, Shaozhou Zhou, Peng Mei, Li Shang, Shuyong Front Genet Genetics The protein–protein association in cellular signaling networks (CSNs) often acts as weak, transient, and reversible domain–peptide interaction (DPI), in which a flexible peptide segment on the surface of one protein is recognized and bound by a rigid peptide-recognition domain from another. Reliable modeling and accurate prediction of DPI binding affinities would help to ascertain the diverse biological events involved in CSNs and benefit our understanding of various biological implications underlying DPIs. Traditionally, peptide quantitative structure-activity relationship (pQSAR) has been widely used to model and predict the biological activity of oligopeptides, which employs amino acid descriptors (AADs) to characterize peptide structures at sequence level and then statistically correlate the resulting descriptor vector with observed activity data via regression. However, the QSAR has not yet been widely applied to treat the direct binding behavior of large-scale peptide ligands to their protein receptors. In this work, we attempted to clarify whether the pQSAR methodology can work effectively for modeling and predicting DPI affinities in a high-throughput manner? Over twenty thousand short linear motif (SLiM)-containing peptide segments involved in SH3, PDZ and 14-3-3 domain-medicated CSNs were compiled to define a comprehensive sequence-based data set of DPI affinities, which were represented by the Boehringer light units (BLUs) derived from previous arbitrary light intensity assays following SPOT peptide synthesis. Four sophisticated MLMs (MLMs) were then utilized to perform pQSAR modeling on the set described with different AADs to systematically create a variety of linear and nonlinear predictors, and then verified by rigorous statistical test. It is revealed that the genome-wide DPI events can only be modeled qualitatively or semiquantitatively with traditional pQSAR strategy due to the intrinsic disorder of peptide conformation and the potential interplay between different peptide residues. In addition, the arbitrary BLUs used to characterize DPI affinity values were measured via an indirect approach, which may not very reliable and may involve strong noise, thus leading to a considerable bias in the modeling. The R ( prd ) (2) = 0.7 can be considered as the upper limit of external generalization ability of the pQSAR methodology working on large-scale DPI affinity data. Frontiers Media S.A. 2022-01-14 /pmc/articles/PMC8795790/ /pubmed/35096016 http://dx.doi.org/10.3389/fgene.2021.800857 Text en Copyright © 2022 Liu, Lin, Wen, Wang, Zhou, Mei and Shang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Liu, Qian Lin, Jing Wen, Li Wang, Shaozhou Zhou, Peng Mei, Li Shang, Shuyong Systematic Modeling, Prediction, and Comparison of Domain–Peptide Affinities: Does it Work Effectively With the Peptide QSAR Methodology? |
title | Systematic Modeling, Prediction, and Comparison of Domain–Peptide Affinities: Does it Work Effectively With the Peptide QSAR Methodology? |
title_full | Systematic Modeling, Prediction, and Comparison of Domain–Peptide Affinities: Does it Work Effectively With the Peptide QSAR Methodology? |
title_fullStr | Systematic Modeling, Prediction, and Comparison of Domain–Peptide Affinities: Does it Work Effectively With the Peptide QSAR Methodology? |
title_full_unstemmed | Systematic Modeling, Prediction, and Comparison of Domain–Peptide Affinities: Does it Work Effectively With the Peptide QSAR Methodology? |
title_short | Systematic Modeling, Prediction, and Comparison of Domain–Peptide Affinities: Does it Work Effectively With the Peptide QSAR Methodology? |
title_sort | systematic modeling, prediction, and comparison of domain–peptide affinities: does it work effectively with the peptide qsar methodology? |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8795790/ https://www.ncbi.nlm.nih.gov/pubmed/35096016 http://dx.doi.org/10.3389/fgene.2021.800857 |
work_keys_str_mv | AT liuqian systematicmodelingpredictionandcomparisonofdomainpeptideaffinitiesdoesitworkeffectivelywiththepeptideqsarmethodology AT linjing systematicmodelingpredictionandcomparisonofdomainpeptideaffinitiesdoesitworkeffectivelywiththepeptideqsarmethodology AT wenli systematicmodelingpredictionandcomparisonofdomainpeptideaffinitiesdoesitworkeffectivelywiththepeptideqsarmethodology AT wangshaozhou systematicmodelingpredictionandcomparisonofdomainpeptideaffinitiesdoesitworkeffectivelywiththepeptideqsarmethodology AT zhoupeng systematicmodelingpredictionandcomparisonofdomainpeptideaffinitiesdoesitworkeffectivelywiththepeptideqsarmethodology AT meili systematicmodelingpredictionandcomparisonofdomainpeptideaffinitiesdoesitworkeffectivelywiththepeptideqsarmethodology AT shangshuyong systematicmodelingpredictionandcomparisonofdomainpeptideaffinitiesdoesitworkeffectivelywiththepeptideqsarmethodology |