Cargando…
Scaffold analysis of PubChem database as background for hierarchical scaffold-based visualization
BACKGROUND: Visualization of large molecular datasets is a challenging yet important topic utilised in diverse fields of chemistry ranging from material engineering to drug design. Especially in drug design, modern methods of high-throughput screening generate large amounts of molecular data that ca...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5199768/ https://www.ncbi.nlm.nih.gov/pubmed/28090217 http://dx.doi.org/10.1186/s13321-016-0186-7 |
_version_ | 1782489053562667008 |
---|---|
author | Velkoborsky, Jakub Hoksza, David |
author_facet | Velkoborsky, Jakub Hoksza, David |
author_sort | Velkoborsky, Jakub |
collection | PubMed |
description | BACKGROUND: Visualization of large molecular datasets is a challenging yet important topic utilised in diverse fields of chemistry ranging from material engineering to drug design. Especially in drug design, modern methods of high-throughput screening generate large amounts of molecular data that call for methods enabling their analysis. One such method is classification of compounds based on their molecular scaffolds, a concept widely used by medicinal chemists to group molecules of similar properties. This classification can then be utilized for intuitive visualization of compounds. RESULTS: In this paper, we propose a scaffold hierarchy as a result of large-scale analysis of the PubChem Compound database. The analysis not only provided insights into scaffold diversity of the PubChem Compound database, but also enables scaffold-based hierarchical visualization of user compound data sets on the background of empirical chemical space, as defined by the PubChem data, or on the background of any other user-defined data set. The visualization is performed by a web based client-server application called Scaffvis. It provides an interactive zoomable tree map visualization of data sets up to hundreds of thousands molecules. Scaffvis is free to use and its source codes have been published under an open source license. [Figure: see text] ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13321-016-0186-7) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5199768 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-51997682017-01-13 Scaffold analysis of PubChem database as background for hierarchical scaffold-based visualization Velkoborsky, Jakub Hoksza, David J Cheminform Methodology BACKGROUND: Visualization of large molecular datasets is a challenging yet important topic utilised in diverse fields of chemistry ranging from material engineering to drug design. Especially in drug design, modern methods of high-throughput screening generate large amounts of molecular data that call for methods enabling their analysis. One such method is classification of compounds based on their molecular scaffolds, a concept widely used by medicinal chemists to group molecules of similar properties. This classification can then be utilized for intuitive visualization of compounds. RESULTS: In this paper, we propose a scaffold hierarchy as a result of large-scale analysis of the PubChem Compound database. The analysis not only provided insights into scaffold diversity of the PubChem Compound database, but also enables scaffold-based hierarchical visualization of user compound data sets on the background of empirical chemical space, as defined by the PubChem data, or on the background of any other user-defined data set. The visualization is performed by a web based client-server application called Scaffvis. It provides an interactive zoomable tree map visualization of data sets up to hundreds of thousands molecules. Scaffvis is free to use and its source codes have been published under an open source license. [Figure: see text] ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13321-016-0186-7) contains supplementary material, which is available to authorized users. Springer International Publishing 2016-12-29 /pmc/articles/PMC5199768/ /pubmed/28090217 http://dx.doi.org/10.1186/s13321-016-0186-7 Text en © The Author(s) 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Velkoborsky, Jakub Hoksza, David Scaffold analysis of PubChem database as background for hierarchical scaffold-based visualization |
title | Scaffold analysis of PubChem database as background for hierarchical scaffold-based visualization |
title_full | Scaffold analysis of PubChem database as background for hierarchical scaffold-based visualization |
title_fullStr | Scaffold analysis of PubChem database as background for hierarchical scaffold-based visualization |
title_full_unstemmed | Scaffold analysis of PubChem database as background for hierarchical scaffold-based visualization |
title_short | Scaffold analysis of PubChem database as background for hierarchical scaffold-based visualization |
title_sort | scaffold analysis of pubchem database as background for hierarchical scaffold-based visualization |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5199768/ https://www.ncbi.nlm.nih.gov/pubmed/28090217 http://dx.doi.org/10.1186/s13321-016-0186-7 |
work_keys_str_mv | AT velkoborskyjakub scaffoldanalysisofpubchemdatabaseasbackgroundforhierarchicalscaffoldbasedvisualization AT hokszadavid scaffoldanalysisofpubchemdatabaseasbackgroundforhierarchicalscaffoldbasedvisualization |