Cargando…

Activity-relevant similarity values for fingerprints and implications for similarity searching

A largely unsolved problem in chemoinformatics is the issue of how calculated compound similarity relates to activity similarity, which is central to many applications. In general, activity relationships are predicted from calculated similarity values. However, there is no solid scientific foundatio...

Descripción completa

Detalles Bibliográficos
Autores principales: Jasial, Swarit, Hu, Ye, Vogt, Martin, Bajorath, Jürgen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: F1000Research 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4830209/
https://www.ncbi.nlm.nih.gov/pubmed/27127620
http://dx.doi.org/10.12688/f1000research.8357.2
_version_ 1782426874434027520
author Jasial, Swarit
Hu, Ye
Vogt, Martin
Bajorath, Jürgen
author_facet Jasial, Swarit
Hu, Ye
Vogt, Martin
Bajorath, Jürgen
author_sort Jasial, Swarit
collection PubMed
description A largely unsolved problem in chemoinformatics is the issue of how calculated compound similarity relates to activity similarity, which is central to many applications. In general, activity relationships are predicted from calculated similarity values. However, there is no solid scientific foundation to bridge between calculated molecular and observed activity similarity. Accordingly, the success rate of identifying new active compounds by similarity searching is limited. Although various attempts have been made to establish relationships between calculated fingerprint similarity values and biological activities, none of these has yielded generally applicable rules for similarity searching. In this study, we have addressed the question of molecular versus activity similarity in a more fundamental way. First, we have evaluated if activity-relevant similarity value ranges could in principle be identified for standard fingerprints and distinguished from similarity resulting from random compound comparisons. Then, we have analyzed if activity-relevant similarity values could be used to guide typical similarity search calculations aiming to identify active compounds in databases. It was found that activity-relevant similarity values can be identified as a characteristic feature of fingerprints. However, it was also shown that such values cannot be reliably used as thresholds for practical similarity search calculations. In addition, the analysis presented herein helped to rationalize differences in fingerprint search performance.
format Online
Article
Text
id pubmed-4830209
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher F1000Research
record_format MEDLINE/PubMed
spelling pubmed-48302092016-04-27 Activity-relevant similarity values for fingerprints and implications for similarity searching Jasial, Swarit Hu, Ye Vogt, Martin Bajorath, Jürgen F1000Res Research Article A largely unsolved problem in chemoinformatics is the issue of how calculated compound similarity relates to activity similarity, which is central to many applications. In general, activity relationships are predicted from calculated similarity values. However, there is no solid scientific foundation to bridge between calculated molecular and observed activity similarity. Accordingly, the success rate of identifying new active compounds by similarity searching is limited. Although various attempts have been made to establish relationships between calculated fingerprint similarity values and biological activities, none of these has yielded generally applicable rules for similarity searching. In this study, we have addressed the question of molecular versus activity similarity in a more fundamental way. First, we have evaluated if activity-relevant similarity value ranges could in principle be identified for standard fingerprints and distinguished from similarity resulting from random compound comparisons. Then, we have analyzed if activity-relevant similarity values could be used to guide typical similarity search calculations aiming to identify active compounds in databases. It was found that activity-relevant similarity values can be identified as a characteristic feature of fingerprints. However, it was also shown that such values cannot be reliably used as thresholds for practical similarity search calculations. In addition, the analysis presented herein helped to rationalize differences in fingerprint search performance. F1000Research 2016-04-28 /pmc/articles/PMC4830209/ /pubmed/27127620 http://dx.doi.org/10.12688/f1000research.8357.2 Text en Copyright: © 2016 Jasial S et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Jasial, Swarit
Hu, Ye
Vogt, Martin
Bajorath, Jürgen
Activity-relevant similarity values for fingerprints and implications for similarity searching
title Activity-relevant similarity values for fingerprints and implications for similarity searching
title_full Activity-relevant similarity values for fingerprints and implications for similarity searching
title_fullStr Activity-relevant similarity values for fingerprints and implications for similarity searching
title_full_unstemmed Activity-relevant similarity values for fingerprints and implications for similarity searching
title_short Activity-relevant similarity values for fingerprints and implications for similarity searching
title_sort activity-relevant similarity values for fingerprints and implications for similarity searching
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4830209/
https://www.ncbi.nlm.nih.gov/pubmed/27127620
http://dx.doi.org/10.12688/f1000research.8357.2
work_keys_str_mv AT jasialswarit activityrelevantsimilarityvaluesforfingerprintsandimplicationsforsimilaritysearching
AT huye activityrelevantsimilarityvaluesforfingerprintsandimplicationsforsimilaritysearching
AT vogtmartin activityrelevantsimilarityvaluesforfingerprintsandimplicationsforsimilaritysearching
AT bajorathjurgen activityrelevantsimilarityvaluesforfingerprintsandimplicationsforsimilaritysearching