Cargando…

ChemMaps: Towards an approach for visualizing the chemical space based on adaptive satellite compounds

We present a novel approach called ChemMaps for visualizing chemical space based on the similarity matrix of compound datasets generated with molecular fingerprints’ similarity. The method uses a ‘satellites’ approach, where satellites are, in principle, molecules whose similarity to the rest of the...

Descripción completa

Detalles Bibliográficos
Autores principales: Naveja, J. Jesús, Medina-Franco, José L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: F1000Research 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5538041/
https://www.ncbi.nlm.nih.gov/pubmed/28794856
http://dx.doi.org/10.12688/f1000research.12095.2
_version_ 1783254290938200064
author Naveja, J. Jesús
Medina-Franco, José L.
author_facet Naveja, J. Jesús
Medina-Franco, José L.
author_sort Naveja, J. Jesús
collection PubMed
description We present a novel approach called ChemMaps for visualizing chemical space based on the similarity matrix of compound datasets generated with molecular fingerprints’ similarity. The method uses a ‘satellites’ approach, where satellites are, in principle, molecules whose similarity to the rest of the molecules in the database provides sufficient information for generating a visualization of the chemical space. Such an approach could help make chemical space visualizations more efficient. We hereby describe a proof-of-principle application of the method to various databases that have different diversity measures. Unsurprisingly, we found the method works better with databases that have low 2D diversity. 3D diversity played a secondary role, although it seems to be more relevant as 2D diversity increases. For less diverse datasets, taking as few as 25% satellites seems to be sufficient for a fair depiction of the chemical space. We propose to iteratively increase the satellites number by a factor of 5% relative to the whole database, and stop when the new and the prior chemical space correlate highly. This Research Note represents a first exploratory step, prior to the full application of this method for several datasets.
format Online
Article
Text
id pubmed-5538041
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher F1000Research
record_format MEDLINE/PubMed
spelling pubmed-55380412017-08-08 ChemMaps: Towards an approach for visualizing the chemical space based on adaptive satellite compounds Naveja, J. Jesús Medina-Franco, José L. F1000Res Research Note We present a novel approach called ChemMaps for visualizing chemical space based on the similarity matrix of compound datasets generated with molecular fingerprints’ similarity. The method uses a ‘satellites’ approach, where satellites are, in principle, molecules whose similarity to the rest of the molecules in the database provides sufficient information for generating a visualization of the chemical space. Such an approach could help make chemical space visualizations more efficient. We hereby describe a proof-of-principle application of the method to various databases that have different diversity measures. Unsurprisingly, we found the method works better with databases that have low 2D diversity. 3D diversity played a secondary role, although it seems to be more relevant as 2D diversity increases. For less diverse datasets, taking as few as 25% satellites seems to be sufficient for a fair depiction of the chemical space. We propose to iteratively increase the satellites number by a factor of 5% relative to the whole database, and stop when the new and the prior chemical space correlate highly. This Research Note represents a first exploratory step, prior to the full application of this method for several datasets. F1000Research 2017-08-04 /pmc/articles/PMC5538041/ /pubmed/28794856 http://dx.doi.org/10.12688/f1000research.12095.2 Text en Copyright: © 2017 Naveja JJ and Medina-Franco JL http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Note
Naveja, J. Jesús
Medina-Franco, José L.
ChemMaps: Towards an approach for visualizing the chemical space based on adaptive satellite compounds
title ChemMaps: Towards an approach for visualizing the chemical space based on adaptive satellite compounds
title_full ChemMaps: Towards an approach for visualizing the chemical space based on adaptive satellite compounds
title_fullStr ChemMaps: Towards an approach for visualizing the chemical space based on adaptive satellite compounds
title_full_unstemmed ChemMaps: Towards an approach for visualizing the chemical space based on adaptive satellite compounds
title_short ChemMaps: Towards an approach for visualizing the chemical space based on adaptive satellite compounds
title_sort chemmaps: towards an approach for visualizing the chemical space based on adaptive satellite compounds
topic Research Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5538041/
https://www.ncbi.nlm.nih.gov/pubmed/28794856
http://dx.doi.org/10.12688/f1000research.12095.2
work_keys_str_mv AT navejajjesus chemmapstowardsanapproachforvisualizingthechemicalspacebasedonadaptivesatellitecompounds
AT medinafrancojosel chemmapstowardsanapproachforvisualizingthechemicalspacebasedonadaptivesatellitecompounds