Cargando…

Structural diversity of biologically interesting datasets: a scaffold analysis approach

BACKGROUND: The recent public availability of the human metabolome and natural product datasets has revitalized "metabolite-likeness" and "natural product-likeness" as a drug design concept to design lead libraries targeting specific pathways. Many reports have analyzed the physi...

Descripción completa

Detalles Bibliográficos
Autores principales:	Khanna, Varun, Ranganathan, Shoba
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2011
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3179739/ https://www.ncbi.nlm.nih.gov/pubmed/21824432 http://dx.doi.org/10.1186/1758-2946-3-30

_version_	1782212551381090304
author	Khanna, Varun Ranganathan, Shoba
author_facet	Khanna, Varun Ranganathan, Shoba
author_sort	Khanna, Varun
collection	PubMed
description	BACKGROUND: The recent public availability of the human metabolome and natural product datasets has revitalized "metabolite-likeness" and "natural product-likeness" as a drug design concept to design lead libraries targeting specific pathways. Many reports have analyzed the physicochemical property space of biologically important datasets, with only a few comprehensively characterizing the scaffold diversity in public datasets of biological interest. With large collections of high quality public data currently available, we carried out a comparative analysis of current day leads with other biologically relevant datasets. RESULTS: In this study, we note a two-fold enrichment of metabolite scaffolds in drug dataset (42%) as compared to currently used lead libraries (23%). We also note that only a small percentage (5%) of natural product scaffolds space is shared by the lead dataset. We have identified specific scaffolds that are present in metabolites and natural products, with close counterparts in the drugs, but are missing in the lead dataset. To determine the distribution of compounds in physicochemical property space we analyzed the molecular polar surface area, the molecular solubility, the number of rings and the number of rotatable bonds in addition to four well-known Lipinski properties. Here, we note that, with only few exceptions, most of the drugs follow Lipinski's rule. The average values of the molecular polar surface area and the molecular solubility in metabolites is the highest while the number of rings is the lowest. In addition, we note that natural products contain the maximum number of rings and the rotatable bonds than any other dataset under consideration. CONCLUSIONS: Currently used lead libraries make little use of the metabolites and natural products scaffold space. We believe that metabolites and natural products are recognized by at least one protein in the biosphere therefore, sampling the fragment and scaffold space of these compounds, along with the knowledge of distribution in physicochemical property space, can result in better lead libraries. Hence, we recommend the greater use of metabolites and natural products while designing lead libraries. Nevertheless, metabolites have a limited distribution in chemical space that limits the usage of metabolites in library design.
format	Online Article Text
id	pubmed-3179739
institution	National Center for Biotechnology Information
language	English
publishDate	2011
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-31797392011-09-25 Structural diversity of biologically interesting datasets: a scaffold analysis approach Khanna, Varun Ranganathan, Shoba J Cheminform Research Article BACKGROUND: The recent public availability of the human metabolome and natural product datasets has revitalized "metabolite-likeness" and "natural product-likeness" as a drug design concept to design lead libraries targeting specific pathways. Many reports have analyzed the physicochemical property space of biologically important datasets, with only a few comprehensively characterizing the scaffold diversity in public datasets of biological interest. With large collections of high quality public data currently available, we carried out a comparative analysis of current day leads with other biologically relevant datasets. RESULTS: In this study, we note a two-fold enrichment of metabolite scaffolds in drug dataset (42%) as compared to currently used lead libraries (23%). We also note that only a small percentage (5%) of natural product scaffolds space is shared by the lead dataset. We have identified specific scaffolds that are present in metabolites and natural products, with close counterparts in the drugs, but are missing in the lead dataset. To determine the distribution of compounds in physicochemical property space we analyzed the molecular polar surface area, the molecular solubility, the number of rings and the number of rotatable bonds in addition to four well-known Lipinski properties. Here, we note that, with only few exceptions, most of the drugs follow Lipinski's rule. The average values of the molecular polar surface area and the molecular solubility in metabolites is the highest while the number of rings is the lowest. In addition, we note that natural products contain the maximum number of rings and the rotatable bonds than any other dataset under consideration. CONCLUSIONS: Currently used lead libraries make little use of the metabolites and natural products scaffold space. We believe that metabolites and natural products are recognized by at least one protein in the biosphere therefore, sampling the fragment and scaffold space of these compounds, along with the knowledge of distribution in physicochemical property space, can result in better lead libraries. Hence, we recommend the greater use of metabolites and natural products while designing lead libraries. Nevertheless, metabolites have a limited distribution in chemical space that limits the usage of metabolites in library design. BioMed Central 2011-08-08 /pmc/articles/PMC3179739/ /pubmed/21824432 http://dx.doi.org/10.1186/1758-2946-3-30 Text en Copyright ©2011 Khanna and Ranganathan; licensee Chemistry Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Article Khanna, Varun Ranganathan, Shoba Structural diversity of biologically interesting datasets: a scaffold analysis approach
title	Structural diversity of biologically interesting datasets: a scaffold analysis approach
title_full	Structural diversity of biologically interesting datasets: a scaffold analysis approach
title_fullStr	Structural diversity of biologically interesting datasets: a scaffold analysis approach
title_full_unstemmed	Structural diversity of biologically interesting datasets: a scaffold analysis approach
title_short	Structural diversity of biologically interesting datasets: a scaffold analysis approach
title_sort	structural diversity of biologically interesting datasets: a scaffold analysis approach
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3179739/ https://www.ncbi.nlm.nih.gov/pubmed/21824432 http://dx.doi.org/10.1186/1758-2946-3-30
work_keys_str_mv	AT khannavarun structuraldiversityofbiologicallyinterestingdatasetsascaffoldanalysisapproach AT ranganathanshoba structuraldiversityofbiologicallyinterestingdatasetsascaffoldanalysisapproach

Structural diversity of biologically interesting datasets: a scaffold analysis approach

Ejemplares similares