Cargando…

Selecting Diversified Compounds to Build a Tangible Library for Biological and Biochemical Assays

The quality of diverse compound selection mainly depends on cluster algorithms, descriptors, the combinations of the descriptors, and similarity metrics. The Jarvis-Patrick algorithm, MDL search keys, and Daylight fingerprints are a well accepted algorithm and structure descriptors for compound libr...

Descripción completa

Detalles Bibliográficos
Autores principales: Gu, Qiong, Xu, Jun, Gu, Lianquan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6257665/
https://www.ncbi.nlm.nih.gov/pubmed/20657406
http://dx.doi.org/10.3390/molecules15075031
_version_ 1783374366064508928
author Gu, Qiong
Xu, Jun
Gu, Lianquan
author_facet Gu, Qiong
Xu, Jun
Gu, Lianquan
author_sort Gu, Qiong
collection PubMed
description The quality of diverse compound selection mainly depends on cluster algorithms, descriptors, the combinations of the descriptors, and similarity metrics. The Jarvis-Patrick algorithm, MDL search keys, and Daylight fingerprints are a well accepted algorithm and structure descriptors for compound library diversity analysis. Based upon our 288 experiments on selecting compounds from various descriptor combinations, we have found (1) hybrid Daylight and MDL structural descriptors for diversity analyses can produce worse results; (2) selections based purely on 2,048-bit Daylight fingerprints yield better results than the ones based purely on MDL 166-bit search keys; (3) when Daylight fingerprints and MDL search keys are combined, it is better to compute the similarities independently, then to take the smaller value for the outcome. This will yield better average separation of clusters; (4) regarding the consistency of different clustering approaches, the Daylight fingerprints based clustering is more consistent with the SCA approach than it does with the MDL search keys based approach; (5) The MDL search keys based selection approach tends to select a greater number of compounds from larger clusters. As the Daylight fingerprint is folded two and three times, respectively, information is lost, and this approach tends to select a greater number of compounds from larger clusters as well. These results have not been reported before to our knowledge.
format Online
Article
Text
id pubmed-6257665
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-62576652018-12-06 Selecting Diversified Compounds to Build a Tangible Library for Biological and Biochemical Assays Gu, Qiong Xu, Jun Gu, Lianquan Molecules Article The quality of diverse compound selection mainly depends on cluster algorithms, descriptors, the combinations of the descriptors, and similarity metrics. The Jarvis-Patrick algorithm, MDL search keys, and Daylight fingerprints are a well accepted algorithm and structure descriptors for compound library diversity analysis. Based upon our 288 experiments on selecting compounds from various descriptor combinations, we have found (1) hybrid Daylight and MDL structural descriptors for diversity analyses can produce worse results; (2) selections based purely on 2,048-bit Daylight fingerprints yield better results than the ones based purely on MDL 166-bit search keys; (3) when Daylight fingerprints and MDL search keys are combined, it is better to compute the similarities independently, then to take the smaller value for the outcome. This will yield better average separation of clusters; (4) regarding the consistency of different clustering approaches, the Daylight fingerprints based clustering is more consistent with the SCA approach than it does with the MDL search keys based approach; (5) The MDL search keys based selection approach tends to select a greater number of compounds from larger clusters. As the Daylight fingerprint is folded two and three times, respectively, information is lost, and this approach tends to select a greater number of compounds from larger clusters as well. These results have not been reported before to our knowledge. MDPI 2010-07-23 /pmc/articles/PMC6257665/ /pubmed/20657406 http://dx.doi.org/10.3390/molecules15075031 Text en © 2010 by the authors; http://creativecommons.org/licenses/by/3.0/ licensee MDPI, Basel, Switzerland. This article is an Open Access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).
spellingShingle Article
Gu, Qiong
Xu, Jun
Gu, Lianquan
Selecting Diversified Compounds to Build a Tangible Library for Biological and Biochemical Assays
title Selecting Diversified Compounds to Build a Tangible Library for Biological and Biochemical Assays
title_full Selecting Diversified Compounds to Build a Tangible Library for Biological and Biochemical Assays
title_fullStr Selecting Diversified Compounds to Build a Tangible Library for Biological and Biochemical Assays
title_full_unstemmed Selecting Diversified Compounds to Build a Tangible Library for Biological and Biochemical Assays
title_short Selecting Diversified Compounds to Build a Tangible Library for Biological and Biochemical Assays
title_sort selecting diversified compounds to build a tangible library for biological and biochemical assays
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6257665/
https://www.ncbi.nlm.nih.gov/pubmed/20657406
http://dx.doi.org/10.3390/molecules15075031
work_keys_str_mv AT guqiong selectingdiversifiedcompoundstobuildatangiblelibraryforbiologicalandbiochemicalassays
AT xujun selectingdiversifiedcompoundstobuildatangiblelibraryforbiologicalandbiochemicalassays
AT gulianquan selectingdiversifiedcompoundstobuildatangiblelibraryforbiologicalandbiochemicalassays