Cargando…

Distribution of Bound Conformations in Conformational Ensembles for X-ray Ligands Predicted by the ANI-2X Machine Learning Potential

[Image: see text] In this study, we systematically studied the energy distribution of bioactive conformations of small molecular ligands in their conformational ensembles using ANI-2X, a machine learning potential, in conjunction with one of our recently developed geometry optimization algorithms, k...

Descripción completa

Detalles Bibliográficos
Autores principales: Han, Fengyang, Hao, Dongxiao, He, Xibing, Wang, Luxuan, Niu, Taoyu, Wang, Junmei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2023
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10647024/
https://www.ncbi.nlm.nih.gov/pubmed/37899502
http://dx.doi.org/10.1021/acs.jcim.3c01350
_version_ 1785147483237646336
author Han, Fengyang
Hao, Dongxiao
He, Xibing
Wang, Luxuan
Niu, Taoyu
Wang, Junmei
author_facet Han, Fengyang
Hao, Dongxiao
He, Xibing
Wang, Luxuan
Niu, Taoyu
Wang, Junmei
author_sort Han, Fengyang
collection PubMed
description [Image: see text] In this study, we systematically studied the energy distribution of bioactive conformations of small molecular ligands in their conformational ensembles using ANI-2X, a machine learning potential, in conjunction with one of our recently developed geometry optimization algorithms, known as a conjugate gradient with backtracking line search (CG-BS). We first evaluated the combination of these methods (ANI-2X/CG-BS) using two molecule sets. For the 231-molecule set, ab initio calculations were performed at both the ωB97X/6-31G(d) and B3LYP-D3BJ/DZVP levels for accuracy comparison, while for the 8,992-molecule set, ab initio calculations were carried out at the B3LYP-D3BJ/DZVP level. For each molecule in the two molecular sets, up to 10 conformations were generated, which diminish the influence of individual outliers on the performance evaluation. Encouraged by the performance of ANI-2x/CG-BS in these evaluations, we calculated the energy distributions using ANI-2x/CG-BS for more than 27,000 ligands in the protein data bank (PDB). Each ligand has at least one conformation bound to a biological molecule, and this ligand conformation is labeled as a bound conformation. Besides the bound conformations, up to 200 conformations were generated using OpenEye’s Omega2 software (https://docs.eyesopen.com/applications/ omega/) for each conformation. We performed a statistical analysis of how the bound conformation energies are distributed in the ensembles for 17,197 PDB ligands that have their bound conformation energies within the energy ranges of the Omega2-generated conformation ensembles. We found that half of the ligands have their relative conformation energy lower than 2.91 kcal/mol for the bound conformations in comparison with the global conformations, and about 90% of the bound conformations are within 10 kcal/mol above the global conformation energies. This information is useful to guide the construction of libraries for shape-based virtual screening and to improve the docking algorithm to efficiently sample bound conformations.
format Online
Article
Text
id pubmed-10647024
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-106470242023-11-15 Distribution of Bound Conformations in Conformational Ensembles for X-ray Ligands Predicted by the ANI-2X Machine Learning Potential Han, Fengyang Hao, Dongxiao He, Xibing Wang, Luxuan Niu, Taoyu Wang, Junmei J Chem Inf Model [Image: see text] In this study, we systematically studied the energy distribution of bioactive conformations of small molecular ligands in their conformational ensembles using ANI-2X, a machine learning potential, in conjunction with one of our recently developed geometry optimization algorithms, known as a conjugate gradient with backtracking line search (CG-BS). We first evaluated the combination of these methods (ANI-2X/CG-BS) using two molecule sets. For the 231-molecule set, ab initio calculations were performed at both the ωB97X/6-31G(d) and B3LYP-D3BJ/DZVP levels for accuracy comparison, while for the 8,992-molecule set, ab initio calculations were carried out at the B3LYP-D3BJ/DZVP level. For each molecule in the two molecular sets, up to 10 conformations were generated, which diminish the influence of individual outliers on the performance evaluation. Encouraged by the performance of ANI-2x/CG-BS in these evaluations, we calculated the energy distributions using ANI-2x/CG-BS for more than 27,000 ligands in the protein data bank (PDB). Each ligand has at least one conformation bound to a biological molecule, and this ligand conformation is labeled as a bound conformation. Besides the bound conformations, up to 200 conformations were generated using OpenEye’s Omega2 software (https://docs.eyesopen.com/applications/ omega/) for each conformation. We performed a statistical analysis of how the bound conformation energies are distributed in the ensembles for 17,197 PDB ligands that have their bound conformation energies within the energy ranges of the Omega2-generated conformation ensembles. We found that half of the ligands have their relative conformation energy lower than 2.91 kcal/mol for the bound conformations in comparison with the global conformations, and about 90% of the bound conformations are within 10 kcal/mol above the global conformation energies. This information is useful to guide the construction of libraries for shape-based virtual screening and to improve the docking algorithm to efficiently sample bound conformations. American Chemical Society 2023-10-30 /pmc/articles/PMC10647024/ /pubmed/37899502 http://dx.doi.org/10.1021/acs.jcim.3c01350 Text en © 2023 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Han, Fengyang
Hao, Dongxiao
He, Xibing
Wang, Luxuan
Niu, Taoyu
Wang, Junmei
Distribution of Bound Conformations in Conformational Ensembles for X-ray Ligands Predicted by the ANI-2X Machine Learning Potential
title Distribution of Bound Conformations in Conformational Ensembles for X-ray Ligands Predicted by the ANI-2X Machine Learning Potential
title_full Distribution of Bound Conformations in Conformational Ensembles for X-ray Ligands Predicted by the ANI-2X Machine Learning Potential
title_fullStr Distribution of Bound Conformations in Conformational Ensembles for X-ray Ligands Predicted by the ANI-2X Machine Learning Potential
title_full_unstemmed Distribution of Bound Conformations in Conformational Ensembles for X-ray Ligands Predicted by the ANI-2X Machine Learning Potential
title_short Distribution of Bound Conformations in Conformational Ensembles for X-ray Ligands Predicted by the ANI-2X Machine Learning Potential
title_sort distribution of bound conformations in conformational ensembles for x-ray ligands predicted by the ani-2x machine learning potential
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10647024/
https://www.ncbi.nlm.nih.gov/pubmed/37899502
http://dx.doi.org/10.1021/acs.jcim.3c01350
work_keys_str_mv AT hanfengyang distributionofboundconformationsinconformationalensemblesforxrayligandspredictedbytheani2xmachinelearningpotential
AT haodongxiao distributionofboundconformationsinconformationalensemblesforxrayligandspredictedbytheani2xmachinelearningpotential
AT hexibing distributionofboundconformationsinconformationalensemblesforxrayligandspredictedbytheani2xmachinelearningpotential
AT wangluxuan distributionofboundconformationsinconformationalensemblesforxrayligandspredictedbytheani2xmachinelearningpotential
AT niutaoyu distributionofboundconformationsinconformationalensemblesforxrayligandspredictedbytheani2xmachinelearningpotential
AT wangjunmei distributionofboundconformationsinconformationalensemblesforxrayligandspredictedbytheani2xmachinelearningpotential