Cargando…

Scaffold analysis of PubChem database as background for hierarchical scaffold-based visualization

BACKGROUND: Visualization of large molecular datasets is a challenging yet important topic utilised in diverse fields of chemistry ranging from material engineering to drug design. Especially in drug design, modern methods of high-throughput screening generate large amounts of molecular data that ca...

Descripción completa

Detalles Bibliográficos
Autores principales: Velkoborsky, Jakub, Hoksza, David
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5199768/
https://www.ncbi.nlm.nih.gov/pubmed/28090217
http://dx.doi.org/10.1186/s13321-016-0186-7
_version_ 1782489053562667008
author Velkoborsky, Jakub
Hoksza, David
author_facet Velkoborsky, Jakub
Hoksza, David
author_sort Velkoborsky, Jakub
collection PubMed
description BACKGROUND: Visualization of large molecular datasets is a challenging yet important topic utilised in diverse fields of chemistry ranging from material engineering to drug design. Especially in drug design, modern methods of high-throughput screening generate large amounts of molecular data that call for methods enabling their analysis. One such method is classification of compounds based on their molecular scaffolds, a concept widely used by medicinal chemists to group molecules of similar properties. This classification can then be utilized for intuitive visualization of compounds. RESULTS: In this paper, we propose a scaffold hierarchy as a result of large-scale analysis of the PubChem Compound database. The analysis not only provided insights into scaffold diversity of the PubChem Compound database, but also enables scaffold-based hierarchical visualization of user compound data sets on the background of empirical chemical space, as defined by the PubChem data, or on the background of any other user-defined data set. The visualization is performed by a web based client-server application called Scaffvis. It provides an interactive zoomable tree map visualization of data sets up to hundreds of thousands molecules. Scaffvis is free to use and its source codes have been published under an open source license. [Figure: see text] ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13321-016-0186-7) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5199768
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-51997682017-01-13 Scaffold analysis of PubChem database as background for hierarchical scaffold-based visualization Velkoborsky, Jakub Hoksza, David J Cheminform Methodology BACKGROUND: Visualization of large molecular datasets is a challenging yet important topic utilised in diverse fields of chemistry ranging from material engineering to drug design. Especially in drug design, modern methods of high-throughput screening generate large amounts of molecular data that call for methods enabling their analysis. One such method is classification of compounds based on their molecular scaffolds, a concept widely used by medicinal chemists to group molecules of similar properties. This classification can then be utilized for intuitive visualization of compounds. RESULTS: In this paper, we propose a scaffold hierarchy as a result of large-scale analysis of the PubChem Compound database. The analysis not only provided insights into scaffold diversity of the PubChem Compound database, but also enables scaffold-based hierarchical visualization of user compound data sets on the background of empirical chemical space, as defined by the PubChem data, or on the background of any other user-defined data set. The visualization is performed by a web based client-server application called Scaffvis. It provides an interactive zoomable tree map visualization of data sets up to hundreds of thousands molecules. Scaffvis is free to use and its source codes have been published under an open source license. [Figure: see text] ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13321-016-0186-7) contains supplementary material, which is available to authorized users. Springer International Publishing 2016-12-29 /pmc/articles/PMC5199768/ /pubmed/28090217 http://dx.doi.org/10.1186/s13321-016-0186-7 Text en © The Author(s) 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology
Velkoborsky, Jakub
Hoksza, David
Scaffold analysis of PubChem database as background for hierarchical scaffold-based visualization
title Scaffold analysis of PubChem database as background for hierarchical scaffold-based visualization
title_full Scaffold analysis of PubChem database as background for hierarchical scaffold-based visualization
title_fullStr Scaffold analysis of PubChem database as background for hierarchical scaffold-based visualization
title_full_unstemmed Scaffold analysis of PubChem database as background for hierarchical scaffold-based visualization
title_short Scaffold analysis of PubChem database as background for hierarchical scaffold-based visualization
title_sort scaffold analysis of pubchem database as background for hierarchical scaffold-based visualization
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5199768/
https://www.ncbi.nlm.nih.gov/pubmed/28090217
http://dx.doi.org/10.1186/s13321-016-0186-7
work_keys_str_mv AT velkoborskyjakub scaffoldanalysisofpubchemdatabaseasbackgroundforhierarchicalscaffoldbasedvisualization
AT hokszadavid scaffoldanalysisofpubchemdatabaseasbackgroundforhierarchicalscaffoldbasedvisualization