Cargando…
Predicting Protein–protein Association Rates using Coarse-grained Simulation and Machine Learning
Protein–protein interactions dominate all major biological processes in living cells. We have developed a new Monte Carlo-based simulation algorithm to study the kinetic process of protein association. We tested our method on a previously used large benchmark set of 49 protein complexes. The predict...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5394550/ https://www.ncbi.nlm.nih.gov/pubmed/28418043 http://dx.doi.org/10.1038/srep46622 |
_version_ | 1783229750968320000 |
---|---|
author | Xie, Zhong-Ru Chen, Jiawen Wu, Yinghao |
author_facet | Xie, Zhong-Ru Chen, Jiawen Wu, Yinghao |
author_sort | Xie, Zhong-Ru |
collection | PubMed |
description | Protein–protein interactions dominate all major biological processes in living cells. We have developed a new Monte Carlo-based simulation algorithm to study the kinetic process of protein association. We tested our method on a previously used large benchmark set of 49 protein complexes. The predicted rate was overestimated in the benchmark test compared to the experimental results for a group of protein complexes. We hypothesized that this resulted from molecular flexibility at the interface regions of the interacting proteins. After applying a machine learning algorithm with input variables that accounted for both the conformational flexibility and the energetic factor of binding, we successfully identified most of the protein complexes with overestimated association rates and improved our final prediction by using a cross-validation test. This method was then applied to a new independent test set and resulted in a similar prediction accuracy to that obtained using the training set. It has been thought that diffusion-limited protein association is dominated by long-range interactions. Our results provide strong evidence that the conformational flexibility also plays an important role in regulating protein association. Our studies provide new insights into the mechanism of protein association and offer a computationally efficient tool for predicting its rate. |
format | Online Article Text |
id | pubmed-5394550 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-53945502017-04-20 Predicting Protein–protein Association Rates using Coarse-grained Simulation and Machine Learning Xie, Zhong-Ru Chen, Jiawen Wu, Yinghao Sci Rep Article Protein–protein interactions dominate all major biological processes in living cells. We have developed a new Monte Carlo-based simulation algorithm to study the kinetic process of protein association. We tested our method on a previously used large benchmark set of 49 protein complexes. The predicted rate was overestimated in the benchmark test compared to the experimental results for a group of protein complexes. We hypothesized that this resulted from molecular flexibility at the interface regions of the interacting proteins. After applying a machine learning algorithm with input variables that accounted for both the conformational flexibility and the energetic factor of binding, we successfully identified most of the protein complexes with overestimated association rates and improved our final prediction by using a cross-validation test. This method was then applied to a new independent test set and resulted in a similar prediction accuracy to that obtained using the training set. It has been thought that diffusion-limited protein association is dominated by long-range interactions. Our results provide strong evidence that the conformational flexibility also plays an important role in regulating protein association. Our studies provide new insights into the mechanism of protein association and offer a computationally efficient tool for predicting its rate. Nature Publishing Group 2017-04-18 /pmc/articles/PMC5394550/ /pubmed/28418043 http://dx.doi.org/10.1038/srep46622 Text en Copyright © 2017, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Xie, Zhong-Ru Chen, Jiawen Wu, Yinghao Predicting Protein–protein Association Rates using Coarse-grained Simulation and Machine Learning |
title | Predicting Protein–protein Association Rates using Coarse-grained Simulation and Machine Learning |
title_full | Predicting Protein–protein Association Rates using Coarse-grained Simulation and Machine Learning |
title_fullStr | Predicting Protein–protein Association Rates using Coarse-grained Simulation and Machine Learning |
title_full_unstemmed | Predicting Protein–protein Association Rates using Coarse-grained Simulation and Machine Learning |
title_short | Predicting Protein–protein Association Rates using Coarse-grained Simulation and Machine Learning |
title_sort | predicting protein–protein association rates using coarse-grained simulation and machine learning |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5394550/ https://www.ncbi.nlm.nih.gov/pubmed/28418043 http://dx.doi.org/10.1038/srep46622 |
work_keys_str_mv | AT xiezhongru predictingproteinproteinassociationratesusingcoarsegrainedsimulationandmachinelearning AT chenjiawen predictingproteinproteinassociationratesusingcoarsegrainedsimulationandmachinelearning AT wuyinghao predictingproteinproteinassociationratesusingcoarsegrainedsimulationandmachinelearning |