Cargando…

iSEE: Interface structure, evolution, and energy‐based machine learning predictor of binding affinity changes upon mutations

Quantitative evaluation of binding affinity changes upon mutations is crucial for protein engineering and drug design. Machine learning‐based methods are gaining increasing momentum in this field. Due to the limited number of experimental data, using a small number of sensitive predictive features i...

Descripción completa

Detalles Bibliográficos
Autores principales: Geng, Cunliang, Vangone, Anna, Folkers, Gert E., Xue, Li C., Bonvin, Alexandre M. J. J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley & Sons, Inc. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6587874/
https://www.ncbi.nlm.nih.gov/pubmed/30417935
http://dx.doi.org/10.1002/prot.25630
_version_ 1783429159707475968
author Geng, Cunliang
Vangone, Anna
Folkers, Gert E.
Xue, Li C.
Bonvin, Alexandre M. J. J.
author_facet Geng, Cunliang
Vangone, Anna
Folkers, Gert E.
Xue, Li C.
Bonvin, Alexandre M. J. J.
author_sort Geng, Cunliang
collection PubMed
description Quantitative evaluation of binding affinity changes upon mutations is crucial for protein engineering and drug design. Machine learning‐based methods are gaining increasing momentum in this field. Due to the limited number of experimental data, using a small number of sensitive predictive features is vital to the generalization and robustness of such machine learning methods. Here we introduce a fast and reliable predictor of binding affinity changes upon single point mutation, based on a random forest approach. Our method, iSEE, uses a limited number of interface Structure, Evolution, and Energy‐based features for the prediction. iSEE achieves, using only 31 features, a high prediction performance with a Pearson correlation coefficient (PCC) of 0.80 and a root mean square error of 1.41 kcal/mol on a diverse training dataset consisting of 1102 mutations in 57 protein‐protein complexes. It competes with existing state‐of‐the‐art methods on two blind test datasets. Predictions for a new dataset of 487 mutations in 56 protein complexes from the recently published SKEMPI 2.0 database reveals that none of the current methods perform well (PCC < 0.42), although their combination does improve the predictions. Feature analysis for iSEE underlines the significance of evolutionary conservations for quantitative prediction of mutation effects. As an application example, we perform a full mutation scanning of the interface residues in the MDM2–p53 complex.
format Online
Article
Text
id pubmed-6587874
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher John Wiley & Sons, Inc.
record_format MEDLINE/PubMed
spelling pubmed-65878742019-07-02 iSEE: Interface structure, evolution, and energy‐based machine learning predictor of binding affinity changes upon mutations Geng, Cunliang Vangone, Anna Folkers, Gert E. Xue, Li C. Bonvin, Alexandre M. J. J. Proteins Research Articles Quantitative evaluation of binding affinity changes upon mutations is crucial for protein engineering and drug design. Machine learning‐based methods are gaining increasing momentum in this field. Due to the limited number of experimental data, using a small number of sensitive predictive features is vital to the generalization and robustness of such machine learning methods. Here we introduce a fast and reliable predictor of binding affinity changes upon single point mutation, based on a random forest approach. Our method, iSEE, uses a limited number of interface Structure, Evolution, and Energy‐based features for the prediction. iSEE achieves, using only 31 features, a high prediction performance with a Pearson correlation coefficient (PCC) of 0.80 and a root mean square error of 1.41 kcal/mol on a diverse training dataset consisting of 1102 mutations in 57 protein‐protein complexes. It competes with existing state‐of‐the‐art methods on two blind test datasets. Predictions for a new dataset of 487 mutations in 56 protein complexes from the recently published SKEMPI 2.0 database reveals that none of the current methods perform well (PCC < 0.42), although their combination does improve the predictions. Feature analysis for iSEE underlines the significance of evolutionary conservations for quantitative prediction of mutation effects. As an application example, we perform a full mutation scanning of the interface residues in the MDM2–p53 complex. John Wiley & Sons, Inc. 2018-12-03 2019-02 /pmc/articles/PMC6587874/ /pubmed/30417935 http://dx.doi.org/10.1002/prot.25630 Text en © 2018 The Authors. Proteins: Structure, Function, and Bioinformatics published by Wiley Periodicals, Inc. This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.
spellingShingle Research Articles
Geng, Cunliang
Vangone, Anna
Folkers, Gert E.
Xue, Li C.
Bonvin, Alexandre M. J. J.
iSEE: Interface structure, evolution, and energy‐based machine learning predictor of binding affinity changes upon mutations
title iSEE: Interface structure, evolution, and energy‐based machine learning predictor of binding affinity changes upon mutations
title_full iSEE: Interface structure, evolution, and energy‐based machine learning predictor of binding affinity changes upon mutations
title_fullStr iSEE: Interface structure, evolution, and energy‐based machine learning predictor of binding affinity changes upon mutations
title_full_unstemmed iSEE: Interface structure, evolution, and energy‐based machine learning predictor of binding affinity changes upon mutations
title_short iSEE: Interface structure, evolution, and energy‐based machine learning predictor of binding affinity changes upon mutations
title_sort isee: interface structure, evolution, and energy‐based machine learning predictor of binding affinity changes upon mutations
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6587874/
https://www.ncbi.nlm.nih.gov/pubmed/30417935
http://dx.doi.org/10.1002/prot.25630
work_keys_str_mv AT gengcunliang iseeinterfacestructureevolutionandenergybasedmachinelearningpredictorofbindingaffinitychangesuponmutations
AT vangoneanna iseeinterfacestructureevolutionandenergybasedmachinelearningpredictorofbindingaffinitychangesuponmutations
AT folkersgerte iseeinterfacestructureevolutionandenergybasedmachinelearningpredictorofbindingaffinitychangesuponmutations
AT xuelic iseeinterfacestructureevolutionandenergybasedmachinelearningpredictorofbindingaffinitychangesuponmutations
AT bonvinalexandremjj iseeinterfacestructureevolutionandenergybasedmachinelearningpredictorofbindingaffinitychangesuponmutations