Cargando…
Dynamic clustering threshold reduces conformer ensemble size while maintaining a biologically relevant ensemble
Representing the 3D structures of ligands in virtual screenings via multi-conformer ensembles can be computationally intensive, especially for compounds with a large number of rotatable bonds. Thus, reducing the size of multi-conformer databases and the number of query conformers, while simultaneous...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Springer Netherlands
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2901495/ https://www.ncbi.nlm.nih.gov/pubmed/20499135 http://dx.doi.org/10.1007/s10822-010-9365-1 |
_version_ | 1782183690335420416 |
---|---|
author | Yongye, Austin B. Bender, Andreas Martínez-Mayorga, Karina |
author_facet | Yongye, Austin B. Bender, Andreas Martínez-Mayorga, Karina |
author_sort | Yongye, Austin B. |
collection | PubMed |
description | Representing the 3D structures of ligands in virtual screenings via multi-conformer ensembles can be computationally intensive, especially for compounds with a large number of rotatable bonds. Thus, reducing the size of multi-conformer databases and the number of query conformers, while simultaneously reproducing the bioactive conformer with good accuracy, is of crucial interest. While clustering and RMSD filtering methods are employed in existing conformer generators, the novelty of this work is the inclusion of a clustering scheme (NMRCLUST) that does not require a user-defined cut-off value. This algorithm simultaneously optimizes the number and the average spread of the clusters. Here we describe and test four inter-dependent approaches for selecting computer-generated conformers, namely: OMEGA, NMRCLUST, RMS filtering and averaged-RMS filtering. The bioactive conformations of 65 selected ligands were extracted from the corresponding protein:ligand complexes from the Protein Data Bank, including eight ligands that adopted dissimilar bound conformations within different receptors. We show that NMRCLUST can be employed to further filter OMEGA-generated conformers while maintaining biological relevance of the ensemble. It was observed that NMRCLUST (containing on average 10 times fewer conformers per compound) performed nearly as well as OMEGA, and both outperformed RMS filtering and averaged-RMS filtering in terms of identifying the bioactive conformations with excellent and good matches (0.5 < RMSD < 1.0 Å). Furthermore, we propose thresholds for OMEGA root-mean square filtering depending on the number of rotors in a compound: 0.8, 1.0 and 1.4 for structures with low (1–4), medium (5–9) and high (10–15) numbers of rotatable bonds, respectively. The protocol employed is general and can be applied to reduce the number of conformers in multi-conformer compound collections and alleviate the complexity of downstream data processing in virtual screening experiments. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s10822-010-9365-1) contains supplementary material, which is available to authorized users. |
format | Text |
id | pubmed-2901495 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | Springer Netherlands |
record_format | MEDLINE/PubMed |
spelling | pubmed-29014952010-07-30 Dynamic clustering threshold reduces conformer ensemble size while maintaining a biologically relevant ensemble Yongye, Austin B. Bender, Andreas Martínez-Mayorga, Karina J Comput Aided Mol Des Article Representing the 3D structures of ligands in virtual screenings via multi-conformer ensembles can be computationally intensive, especially for compounds with a large number of rotatable bonds. Thus, reducing the size of multi-conformer databases and the number of query conformers, while simultaneously reproducing the bioactive conformer with good accuracy, is of crucial interest. While clustering and RMSD filtering methods are employed in existing conformer generators, the novelty of this work is the inclusion of a clustering scheme (NMRCLUST) that does not require a user-defined cut-off value. This algorithm simultaneously optimizes the number and the average spread of the clusters. Here we describe and test four inter-dependent approaches for selecting computer-generated conformers, namely: OMEGA, NMRCLUST, RMS filtering and averaged-RMS filtering. The bioactive conformations of 65 selected ligands were extracted from the corresponding protein:ligand complexes from the Protein Data Bank, including eight ligands that adopted dissimilar bound conformations within different receptors. We show that NMRCLUST can be employed to further filter OMEGA-generated conformers while maintaining biological relevance of the ensemble. It was observed that NMRCLUST (containing on average 10 times fewer conformers per compound) performed nearly as well as OMEGA, and both outperformed RMS filtering and averaged-RMS filtering in terms of identifying the bioactive conformations with excellent and good matches (0.5 < RMSD < 1.0 Å). Furthermore, we propose thresholds for OMEGA root-mean square filtering depending on the number of rotors in a compound: 0.8, 1.0 and 1.4 for structures with low (1–4), medium (5–9) and high (10–15) numbers of rotatable bonds, respectively. The protocol employed is general and can be applied to reduce the number of conformers in multi-conformer compound collections and alleviate the complexity of downstream data processing in virtual screening experiments. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s10822-010-9365-1) contains supplementary material, which is available to authorized users. Springer Netherlands 2010-05-25 2010 /pmc/articles/PMC2901495/ /pubmed/20499135 http://dx.doi.org/10.1007/s10822-010-9365-1 Text en © The Author(s) 2010 https://creativecommons.org/licenses/by-nc/4.0/ This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited. |
spellingShingle | Article Yongye, Austin B. Bender, Andreas Martínez-Mayorga, Karina Dynamic clustering threshold reduces conformer ensemble size while maintaining a biologically relevant ensemble |
title | Dynamic clustering threshold reduces conformer ensemble size while maintaining a biologically relevant ensemble |
title_full | Dynamic clustering threshold reduces conformer ensemble size while maintaining a biologically relevant ensemble |
title_fullStr | Dynamic clustering threshold reduces conformer ensemble size while maintaining a biologically relevant ensemble |
title_full_unstemmed | Dynamic clustering threshold reduces conformer ensemble size while maintaining a biologically relevant ensemble |
title_short | Dynamic clustering threshold reduces conformer ensemble size while maintaining a biologically relevant ensemble |
title_sort | dynamic clustering threshold reduces conformer ensemble size while maintaining a biologically relevant ensemble |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2901495/ https://www.ncbi.nlm.nih.gov/pubmed/20499135 http://dx.doi.org/10.1007/s10822-010-9365-1 |
work_keys_str_mv | AT yongyeaustinb dynamicclusteringthresholdreducesconformerensemblesizewhilemaintainingabiologicallyrelevantensemble AT benderandreas dynamicclusteringthresholdreducesconformerensemblesizewhilemaintainingabiologicallyrelevantensemble AT martinezmayorgakarina dynamicclusteringthresholdreducesconformerensemblesizewhilemaintainingabiologicallyrelevantensemble |