Cargando…
Selecting Diversified Compounds to Build a Tangible Library for Biological and Biochemical Assays
The quality of diverse compound selection mainly depends on cluster algorithms, descriptors, the combinations of the descriptors, and similarity metrics. The Jarvis-Patrick algorithm, MDL search keys, and Daylight fingerprints are a well accepted algorithm and structure descriptors for compound libr...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6257665/ https://www.ncbi.nlm.nih.gov/pubmed/20657406 http://dx.doi.org/10.3390/molecules15075031 |
_version_ | 1783374366064508928 |
---|---|
author | Gu, Qiong Xu, Jun Gu, Lianquan |
author_facet | Gu, Qiong Xu, Jun Gu, Lianquan |
author_sort | Gu, Qiong |
collection | PubMed |
description | The quality of diverse compound selection mainly depends on cluster algorithms, descriptors, the combinations of the descriptors, and similarity metrics. The Jarvis-Patrick algorithm, MDL search keys, and Daylight fingerprints are a well accepted algorithm and structure descriptors for compound library diversity analysis. Based upon our 288 experiments on selecting compounds from various descriptor combinations, we have found (1) hybrid Daylight and MDL structural descriptors for diversity analyses can produce worse results; (2) selections based purely on 2,048-bit Daylight fingerprints yield better results than the ones based purely on MDL 166-bit search keys; (3) when Daylight fingerprints and MDL search keys are combined, it is better to compute the similarities independently, then to take the smaller value for the outcome. This will yield better average separation of clusters; (4) regarding the consistency of different clustering approaches, the Daylight fingerprints based clustering is more consistent with the SCA approach than it does with the MDL search keys based approach; (5) The MDL search keys based selection approach tends to select a greater number of compounds from larger clusters. As the Daylight fingerprint is folded two and three times, respectively, information is lost, and this approach tends to select a greater number of compounds from larger clusters as well. These results have not been reported before to our knowledge. |
format | Online Article Text |
id | pubmed-6257665 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-62576652018-12-06 Selecting Diversified Compounds to Build a Tangible Library for Biological and Biochemical Assays Gu, Qiong Xu, Jun Gu, Lianquan Molecules Article The quality of diverse compound selection mainly depends on cluster algorithms, descriptors, the combinations of the descriptors, and similarity metrics. The Jarvis-Patrick algorithm, MDL search keys, and Daylight fingerprints are a well accepted algorithm and structure descriptors for compound library diversity analysis. Based upon our 288 experiments on selecting compounds from various descriptor combinations, we have found (1) hybrid Daylight and MDL structural descriptors for diversity analyses can produce worse results; (2) selections based purely on 2,048-bit Daylight fingerprints yield better results than the ones based purely on MDL 166-bit search keys; (3) when Daylight fingerprints and MDL search keys are combined, it is better to compute the similarities independently, then to take the smaller value for the outcome. This will yield better average separation of clusters; (4) regarding the consistency of different clustering approaches, the Daylight fingerprints based clustering is more consistent with the SCA approach than it does with the MDL search keys based approach; (5) The MDL search keys based selection approach tends to select a greater number of compounds from larger clusters. As the Daylight fingerprint is folded two and three times, respectively, information is lost, and this approach tends to select a greater number of compounds from larger clusters as well. These results have not been reported before to our knowledge. MDPI 2010-07-23 /pmc/articles/PMC6257665/ /pubmed/20657406 http://dx.doi.org/10.3390/molecules15075031 Text en © 2010 by the authors; http://creativecommons.org/licenses/by/3.0/ licensee MDPI, Basel, Switzerland. This article is an Open Access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/). |
spellingShingle | Article Gu, Qiong Xu, Jun Gu, Lianquan Selecting Diversified Compounds to Build a Tangible Library for Biological and Biochemical Assays |
title | Selecting Diversified Compounds to Build a Tangible Library for Biological and Biochemical Assays |
title_full | Selecting Diversified Compounds to Build a Tangible Library for Biological and Biochemical Assays |
title_fullStr | Selecting Diversified Compounds to Build a Tangible Library for Biological and Biochemical Assays |
title_full_unstemmed | Selecting Diversified Compounds to Build a Tangible Library for Biological and Biochemical Assays |
title_short | Selecting Diversified Compounds to Build a Tangible Library for Biological and Biochemical Assays |
title_sort | selecting diversified compounds to build a tangible library for biological and biochemical assays |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6257665/ https://www.ncbi.nlm.nih.gov/pubmed/20657406 http://dx.doi.org/10.3390/molecules15075031 |
work_keys_str_mv | AT guqiong selectingdiversifiedcompoundstobuildatangiblelibraryforbiologicalandbiochemicalassays AT xujun selectingdiversifiedcompoundstobuildatangiblelibraryforbiologicalandbiochemicalassays AT gulianquan selectingdiversifiedcompoundstobuildatangiblelibraryforbiologicalandbiochemicalassays |