Cargando…
ChemMaps: Towards an approach for visualizing the chemical space based on adaptive satellite compounds
We present a novel approach called ChemMaps for visualizing chemical space based on the similarity matrix of compound datasets generated with molecular fingerprints’ similarity. The method uses a ‘satellites’ approach, where satellites are, in principle, molecules whose similarity to the rest of the...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
F1000Research
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5538041/ https://www.ncbi.nlm.nih.gov/pubmed/28794856 http://dx.doi.org/10.12688/f1000research.12095.2 |
_version_ | 1783254290938200064 |
---|---|
author | Naveja, J. Jesús Medina-Franco, José L. |
author_facet | Naveja, J. Jesús Medina-Franco, José L. |
author_sort | Naveja, J. Jesús |
collection | PubMed |
description | We present a novel approach called ChemMaps for visualizing chemical space based on the similarity matrix of compound datasets generated with molecular fingerprints’ similarity. The method uses a ‘satellites’ approach, where satellites are, in principle, molecules whose similarity to the rest of the molecules in the database provides sufficient information for generating a visualization of the chemical space. Such an approach could help make chemical space visualizations more efficient. We hereby describe a proof-of-principle application of the method to various databases that have different diversity measures. Unsurprisingly, we found the method works better with databases that have low 2D diversity. 3D diversity played a secondary role, although it seems to be more relevant as 2D diversity increases. For less diverse datasets, taking as few as 25% satellites seems to be sufficient for a fair depiction of the chemical space. We propose to iteratively increase the satellites number by a factor of 5% relative to the whole database, and stop when the new and the prior chemical space correlate highly. This Research Note represents a first exploratory step, prior to the full application of this method for several datasets. |
format | Online Article Text |
id | pubmed-5538041 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | F1000Research |
record_format | MEDLINE/PubMed |
spelling | pubmed-55380412017-08-08 ChemMaps: Towards an approach for visualizing the chemical space based on adaptive satellite compounds Naveja, J. Jesús Medina-Franco, José L. F1000Res Research Note We present a novel approach called ChemMaps for visualizing chemical space based on the similarity matrix of compound datasets generated with molecular fingerprints’ similarity. The method uses a ‘satellites’ approach, where satellites are, in principle, molecules whose similarity to the rest of the molecules in the database provides sufficient information for generating a visualization of the chemical space. Such an approach could help make chemical space visualizations more efficient. We hereby describe a proof-of-principle application of the method to various databases that have different diversity measures. Unsurprisingly, we found the method works better with databases that have low 2D diversity. 3D diversity played a secondary role, although it seems to be more relevant as 2D diversity increases. For less diverse datasets, taking as few as 25% satellites seems to be sufficient for a fair depiction of the chemical space. We propose to iteratively increase the satellites number by a factor of 5% relative to the whole database, and stop when the new and the prior chemical space correlate highly. This Research Note represents a first exploratory step, prior to the full application of this method for several datasets. F1000Research 2017-08-04 /pmc/articles/PMC5538041/ /pubmed/28794856 http://dx.doi.org/10.12688/f1000research.12095.2 Text en Copyright: © 2017 Naveja JJ and Medina-Franco JL http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Note Naveja, J. Jesús Medina-Franco, José L. ChemMaps: Towards an approach for visualizing the chemical space based on adaptive satellite compounds |
title | ChemMaps: Towards an approach for visualizing the chemical space based on adaptive satellite compounds |
title_full | ChemMaps: Towards an approach for visualizing the chemical space based on adaptive satellite compounds |
title_fullStr | ChemMaps: Towards an approach for visualizing the chemical space based on adaptive satellite compounds |
title_full_unstemmed | ChemMaps: Towards an approach for visualizing the chemical space based on adaptive satellite compounds |
title_short | ChemMaps: Towards an approach for visualizing the chemical space based on adaptive satellite compounds |
title_sort | chemmaps: towards an approach for visualizing the chemical space based on adaptive satellite compounds |
topic | Research Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5538041/ https://www.ncbi.nlm.nih.gov/pubmed/28794856 http://dx.doi.org/10.12688/f1000research.12095.2 |
work_keys_str_mv | AT navejajjesus chemmapstowardsanapproachforvisualizingthechemicalspacebasedonadaptivesatellitecompounds AT medinafrancojosel chemmapstowardsanapproachforvisualizingthechemicalspacebasedonadaptivesatellitecompounds |