Cargando…

Predicting the mutation effects of protein–ligand interactions via end-point binding free energy calculations: strategies and analyses

Protein mutations occur frequently in biological systems, which may impact, for example, the binding of drugs to their targets through impairing the critical H-bonds, changing the hydrophobic interactions, etc. Thus, accurately predicting the effects of mutations on biological systems is of great in...

Descripción completa

Detalles Bibliográficos
Autores principales: Yu, Yang, Wang, Zhe, Wang, Lingling, Tian, Sheng, Hou, Tingjun, Sun, Huiyong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9392442/
https://www.ncbi.nlm.nih.gov/pubmed/35987841
http://dx.doi.org/10.1186/s13321-022-00639-y
_version_ 1784771063650975744
author Yu, Yang
Wang, Zhe
Wang, Lingling
Tian, Sheng
Hou, Tingjun
Sun, Huiyong
author_facet Yu, Yang
Wang, Zhe
Wang, Lingling
Tian, Sheng
Hou, Tingjun
Sun, Huiyong
author_sort Yu, Yang
collection PubMed
description Protein mutations occur frequently in biological systems, which may impact, for example, the binding of drugs to their targets through impairing the critical H-bonds, changing the hydrophobic interactions, etc. Thus, accurately predicting the effects of mutations on biological systems is of great interests to various fields. Unfortunately, it is still unavailable to conduct large-scale wet-lab mutation experiments because of the unaffordable experimental time and financial costs. Alternatively, in silico computation can serve as a pioneer to guide the experiments. In fact, numerous pioneering works have been conducted from computationally cheaper machine-learning (ML) methods to the more expensive alchemical methods with the purpose to accurately predict the mutation effects. However, these methods usually either cannot result in a physically understandable model (ML-based methods) or work with huge computational resources (alchemical methods). Thus, compromised methods with good physical characteristics and high computational efficiency are expected. Therefore, here, we conducted a comprehensive investigation on the mutation issues of biological systems with the famous end-point binding free energy calculation methods represented by MM/GBSA and MM/PBSA. Different computational strategies considering different length of MD simulations, different value of dielectric constants and whether to incorporate entropy effects to the predicted total binding affinities were investigated to provide a more accurate way for predicting the energetic change upon protein mutations. Overall, our result shows that a relatively long MD simulation (e.g. 100 ns) benefits the prediction accuracy for both MM/GBSA and MM/PBSA (with the best Pearson correlation coefficient between the predicted ∆∆G and the experimental data of ~ 0.44 for a challenging dataset). Further analyses shows that systems involving large perturbations (e.g. multiple mutations and large number of atoms change in the mutation site) are much easier to be accurately predicted since the algorithm works more sensitively to the large change of the systems. Besides, system-specific investigation reveals that conformational adjustment is needed to refine the micro-environment of the manually mutated systems and thus lead one to understand why longer MD simulation is necessary to improve the predicting result. The proposed strategy is expected to be applied in large-scale mutation effects investigation with interpretation. GRAPHICAL ABSTRACT: [Image: see text] SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13321-022-00639-y.
format Online
Article
Text
id pubmed-9392442
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-93924422022-08-22 Predicting the mutation effects of protein–ligand interactions via end-point binding free energy calculations: strategies and analyses Yu, Yang Wang, Zhe Wang, Lingling Tian, Sheng Hou, Tingjun Sun, Huiyong J Cheminform Research Protein mutations occur frequently in biological systems, which may impact, for example, the binding of drugs to their targets through impairing the critical H-bonds, changing the hydrophobic interactions, etc. Thus, accurately predicting the effects of mutations on biological systems is of great interests to various fields. Unfortunately, it is still unavailable to conduct large-scale wet-lab mutation experiments because of the unaffordable experimental time and financial costs. Alternatively, in silico computation can serve as a pioneer to guide the experiments. In fact, numerous pioneering works have been conducted from computationally cheaper machine-learning (ML) methods to the more expensive alchemical methods with the purpose to accurately predict the mutation effects. However, these methods usually either cannot result in a physically understandable model (ML-based methods) or work with huge computational resources (alchemical methods). Thus, compromised methods with good physical characteristics and high computational efficiency are expected. Therefore, here, we conducted a comprehensive investigation on the mutation issues of biological systems with the famous end-point binding free energy calculation methods represented by MM/GBSA and MM/PBSA. Different computational strategies considering different length of MD simulations, different value of dielectric constants and whether to incorporate entropy effects to the predicted total binding affinities were investigated to provide a more accurate way for predicting the energetic change upon protein mutations. Overall, our result shows that a relatively long MD simulation (e.g. 100 ns) benefits the prediction accuracy for both MM/GBSA and MM/PBSA (with the best Pearson correlation coefficient between the predicted ∆∆G and the experimental data of ~ 0.44 for a challenging dataset). Further analyses shows that systems involving large perturbations (e.g. multiple mutations and large number of atoms change in the mutation site) are much easier to be accurately predicted since the algorithm works more sensitively to the large change of the systems. Besides, system-specific investigation reveals that conformational adjustment is needed to refine the micro-environment of the manually mutated systems and thus lead one to understand why longer MD simulation is necessary to improve the predicting result. The proposed strategy is expected to be applied in large-scale mutation effects investigation with interpretation. GRAPHICAL ABSTRACT: [Image: see text] SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13321-022-00639-y. Springer International Publishing 2022-08-20 /pmc/articles/PMC9392442/ /pubmed/35987841 http://dx.doi.org/10.1186/s13321-022-00639-y Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Yu, Yang
Wang, Zhe
Wang, Lingling
Tian, Sheng
Hou, Tingjun
Sun, Huiyong
Predicting the mutation effects of protein–ligand interactions via end-point binding free energy calculations: strategies and analyses
title Predicting the mutation effects of protein–ligand interactions via end-point binding free energy calculations: strategies and analyses
title_full Predicting the mutation effects of protein–ligand interactions via end-point binding free energy calculations: strategies and analyses
title_fullStr Predicting the mutation effects of protein–ligand interactions via end-point binding free energy calculations: strategies and analyses
title_full_unstemmed Predicting the mutation effects of protein–ligand interactions via end-point binding free energy calculations: strategies and analyses
title_short Predicting the mutation effects of protein–ligand interactions via end-point binding free energy calculations: strategies and analyses
title_sort predicting the mutation effects of protein–ligand interactions via end-point binding free energy calculations: strategies and analyses
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9392442/
https://www.ncbi.nlm.nih.gov/pubmed/35987841
http://dx.doi.org/10.1186/s13321-022-00639-y
work_keys_str_mv AT yuyang predictingthemutationeffectsofproteinligandinteractionsviaendpointbindingfreeenergycalculationsstrategiesandanalyses
AT wangzhe predictingthemutationeffectsofproteinligandinteractionsviaendpointbindingfreeenergycalculationsstrategiesandanalyses
AT wanglingling predictingthemutationeffectsofproteinligandinteractionsviaendpointbindingfreeenergycalculationsstrategiesandanalyses
AT tiansheng predictingthemutationeffectsofproteinligandinteractionsviaendpointbindingfreeenergycalculationsstrategiesandanalyses
AT houtingjun predictingthemutationeffectsofproteinligandinteractionsviaendpointbindingfreeenergycalculationsstrategiesandanalyses
AT sunhuiyong predictingthemutationeffectsofproteinligandinteractionsviaendpointbindingfreeenergycalculationsstrategiesandanalyses