Cargando…
iSEE: Interface structure, evolution, and energy‐based machine learning predictor of binding affinity changes upon mutations
Quantitative evaluation of binding affinity changes upon mutations is crucial for protein engineering and drug design. Machine learning‐based methods are gaining increasing momentum in this field. Due to the limited number of experimental data, using a small number of sensitive predictive features i...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley & Sons, Inc.
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6587874/ https://www.ncbi.nlm.nih.gov/pubmed/30417935 http://dx.doi.org/10.1002/prot.25630 |
_version_ | 1783429159707475968 |
---|---|
author | Geng, Cunliang Vangone, Anna Folkers, Gert E. Xue, Li C. Bonvin, Alexandre M. J. J. |
author_facet | Geng, Cunliang Vangone, Anna Folkers, Gert E. Xue, Li C. Bonvin, Alexandre M. J. J. |
author_sort | Geng, Cunliang |
collection | PubMed |
description | Quantitative evaluation of binding affinity changes upon mutations is crucial for protein engineering and drug design. Machine learning‐based methods are gaining increasing momentum in this field. Due to the limited number of experimental data, using a small number of sensitive predictive features is vital to the generalization and robustness of such machine learning methods. Here we introduce a fast and reliable predictor of binding affinity changes upon single point mutation, based on a random forest approach. Our method, iSEE, uses a limited number of interface Structure, Evolution, and Energy‐based features for the prediction. iSEE achieves, using only 31 features, a high prediction performance with a Pearson correlation coefficient (PCC) of 0.80 and a root mean square error of 1.41 kcal/mol on a diverse training dataset consisting of 1102 mutations in 57 protein‐protein complexes. It competes with existing state‐of‐the‐art methods on two blind test datasets. Predictions for a new dataset of 487 mutations in 56 protein complexes from the recently published SKEMPI 2.0 database reveals that none of the current methods perform well (PCC < 0.42), although their combination does improve the predictions. Feature analysis for iSEE underlines the significance of evolutionary conservations for quantitative prediction of mutation effects. As an application example, we perform a full mutation scanning of the interface residues in the MDM2–p53 complex. |
format | Online Article Text |
id | pubmed-6587874 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | John Wiley & Sons, Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-65878742019-07-02 iSEE: Interface structure, evolution, and energy‐based machine learning predictor of binding affinity changes upon mutations Geng, Cunliang Vangone, Anna Folkers, Gert E. Xue, Li C. Bonvin, Alexandre M. J. J. Proteins Research Articles Quantitative evaluation of binding affinity changes upon mutations is crucial for protein engineering and drug design. Machine learning‐based methods are gaining increasing momentum in this field. Due to the limited number of experimental data, using a small number of sensitive predictive features is vital to the generalization and robustness of such machine learning methods. Here we introduce a fast and reliable predictor of binding affinity changes upon single point mutation, based on a random forest approach. Our method, iSEE, uses a limited number of interface Structure, Evolution, and Energy‐based features for the prediction. iSEE achieves, using only 31 features, a high prediction performance with a Pearson correlation coefficient (PCC) of 0.80 and a root mean square error of 1.41 kcal/mol on a diverse training dataset consisting of 1102 mutations in 57 protein‐protein complexes. It competes with existing state‐of‐the‐art methods on two blind test datasets. Predictions for a new dataset of 487 mutations in 56 protein complexes from the recently published SKEMPI 2.0 database reveals that none of the current methods perform well (PCC < 0.42), although their combination does improve the predictions. Feature analysis for iSEE underlines the significance of evolutionary conservations for quantitative prediction of mutation effects. As an application example, we perform a full mutation scanning of the interface residues in the MDM2–p53 complex. John Wiley & Sons, Inc. 2018-12-03 2019-02 /pmc/articles/PMC6587874/ /pubmed/30417935 http://dx.doi.org/10.1002/prot.25630 Text en © 2018 The Authors. Proteins: Structure, Function, and Bioinformatics published by Wiley Periodicals, Inc. This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes. |
spellingShingle | Research Articles Geng, Cunliang Vangone, Anna Folkers, Gert E. Xue, Li C. Bonvin, Alexandre M. J. J. iSEE: Interface structure, evolution, and energy‐based machine learning predictor of binding affinity changes upon mutations |
title | iSEE: Interface structure, evolution, and energy‐based machine learning predictor of binding affinity changes upon mutations |
title_full | iSEE: Interface structure, evolution, and energy‐based machine learning predictor of binding affinity changes upon mutations |
title_fullStr | iSEE: Interface structure, evolution, and energy‐based machine learning predictor of binding affinity changes upon mutations |
title_full_unstemmed | iSEE: Interface structure, evolution, and energy‐based machine learning predictor of binding affinity changes upon mutations |
title_short | iSEE: Interface structure, evolution, and energy‐based machine learning predictor of binding affinity changes upon mutations |
title_sort | isee: interface structure, evolution, and energy‐based machine learning predictor of binding affinity changes upon mutations |
topic | Research Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6587874/ https://www.ncbi.nlm.nih.gov/pubmed/30417935 http://dx.doi.org/10.1002/prot.25630 |
work_keys_str_mv | AT gengcunliang iseeinterfacestructureevolutionandenergybasedmachinelearningpredictorofbindingaffinitychangesuponmutations AT vangoneanna iseeinterfacestructureevolutionandenergybasedmachinelearningpredictorofbindingaffinitychangesuponmutations AT folkersgerte iseeinterfacestructureevolutionandenergybasedmachinelearningpredictorofbindingaffinitychangesuponmutations AT xuelic iseeinterfacestructureevolutionandenergybasedmachinelearningpredictorofbindingaffinitychangesuponmutations AT bonvinalexandremjj iseeinterfacestructureevolutionandenergybasedmachinelearningpredictorofbindingaffinitychangesuponmutations |