Cargando…

Predicting Protein–protein Association Rates using Coarse-grained Simulation and Machine Learning

Protein–protein interactions dominate all major biological processes in living cells. We have developed a new Monte Carlo-based simulation algorithm to study the kinetic process of protein association. We tested our method on a previously used large benchmark set of 49 protein complexes. The predict...

Descripción completa

Detalles Bibliográficos
Autores principales: Xie, Zhong-Ru, Chen, Jiawen, Wu, Yinghao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5394550/
https://www.ncbi.nlm.nih.gov/pubmed/28418043
http://dx.doi.org/10.1038/srep46622
_version_ 1783229750968320000
author Xie, Zhong-Ru
Chen, Jiawen
Wu, Yinghao
author_facet Xie, Zhong-Ru
Chen, Jiawen
Wu, Yinghao
author_sort Xie, Zhong-Ru
collection PubMed
description Protein–protein interactions dominate all major biological processes in living cells. We have developed a new Monte Carlo-based simulation algorithm to study the kinetic process of protein association. We tested our method on a previously used large benchmark set of 49 protein complexes. The predicted rate was overestimated in the benchmark test compared to the experimental results for a group of protein complexes. We hypothesized that this resulted from molecular flexibility at the interface regions of the interacting proteins. After applying a machine learning algorithm with input variables that accounted for both the conformational flexibility and the energetic factor of binding, we successfully identified most of the protein complexes with overestimated association rates and improved our final prediction by using a cross-validation test. This method was then applied to a new independent test set and resulted in a similar prediction accuracy to that obtained using the training set. It has been thought that diffusion-limited protein association is dominated by long-range interactions. Our results provide strong evidence that the conformational flexibility also plays an important role in regulating protein association. Our studies provide new insights into the mechanism of protein association and offer a computationally efficient tool for predicting its rate.
format Online
Article
Text
id pubmed-5394550
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-53945502017-04-20 Predicting Protein–protein Association Rates using Coarse-grained Simulation and Machine Learning Xie, Zhong-Ru Chen, Jiawen Wu, Yinghao Sci Rep Article Protein–protein interactions dominate all major biological processes in living cells. We have developed a new Monte Carlo-based simulation algorithm to study the kinetic process of protein association. We tested our method on a previously used large benchmark set of 49 protein complexes. The predicted rate was overestimated in the benchmark test compared to the experimental results for a group of protein complexes. We hypothesized that this resulted from molecular flexibility at the interface regions of the interacting proteins. After applying a machine learning algorithm with input variables that accounted for both the conformational flexibility and the energetic factor of binding, we successfully identified most of the protein complexes with overestimated association rates and improved our final prediction by using a cross-validation test. This method was then applied to a new independent test set and resulted in a similar prediction accuracy to that obtained using the training set. It has been thought that diffusion-limited protein association is dominated by long-range interactions. Our results provide strong evidence that the conformational flexibility also plays an important role in regulating protein association. Our studies provide new insights into the mechanism of protein association and offer a computationally efficient tool for predicting its rate. Nature Publishing Group 2017-04-18 /pmc/articles/PMC5394550/ /pubmed/28418043 http://dx.doi.org/10.1038/srep46622 Text en Copyright © 2017, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Xie, Zhong-Ru
Chen, Jiawen
Wu, Yinghao
Predicting Protein–protein Association Rates using Coarse-grained Simulation and Machine Learning
title Predicting Protein–protein Association Rates using Coarse-grained Simulation and Machine Learning
title_full Predicting Protein–protein Association Rates using Coarse-grained Simulation and Machine Learning
title_fullStr Predicting Protein–protein Association Rates using Coarse-grained Simulation and Machine Learning
title_full_unstemmed Predicting Protein–protein Association Rates using Coarse-grained Simulation and Machine Learning
title_short Predicting Protein–protein Association Rates using Coarse-grained Simulation and Machine Learning
title_sort predicting protein–protein association rates using coarse-grained simulation and machine learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5394550/
https://www.ncbi.nlm.nih.gov/pubmed/28418043
http://dx.doi.org/10.1038/srep46622
work_keys_str_mv AT xiezhongru predictingproteinproteinassociationratesusingcoarsegrainedsimulationandmachinelearning
AT chenjiawen predictingproteinproteinassociationratesusingcoarsegrainedsimulationandmachinelearning
AT wuyinghao predictingproteinproteinassociationratesusingcoarsegrainedsimulationandmachinelearning