Cargando…
Analysis of the effects of related fingerprints on molecular similarity using an eigenvalue entropy approach
Two-dimensional (2D) chemical fingerprints are widely used as binary features for the quantification of structural similarity of chemical compounds, which is an important step in similarity-based virtual screening (VS). Here, using an eigenvalue-based entropy approach, we identified 2D fingerprints...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7989080/ https://www.ncbi.nlm.nih.gov/pubmed/33757582 http://dx.doi.org/10.1186/s13321-021-00506-2 |
_version_ | 1783668890184712192 |
---|---|
author | Kuwahara, Hiroyuki Gao, Xin |
author_facet | Kuwahara, Hiroyuki Gao, Xin |
author_sort | Kuwahara, Hiroyuki |
collection | PubMed |
description | Two-dimensional (2D) chemical fingerprints are widely used as binary features for the quantification of structural similarity of chemical compounds, which is an important step in similarity-based virtual screening (VS). Here, using an eigenvalue-based entropy approach, we identified 2D fingerprints with little to no contribution to shaping the eigenvalue distribution of the feature matrix as related ones and examined the degree to which these related 2D fingerprints influenced molecular similarity scores calculated with the Tanimoto coefficient. Our analysis identified many related fingerprints in publicly available fingerprint schemes and showed that their presence in the feature set could have substantial effects on the similarity scores and bias the outcome of molecular similarity analysis. Our results have implication in the optimal selection of 2D fingerprints for compound similarity analysis and the identification of potential hits for compounds with target biological activity in VS. |
format | Online Article Text |
id | pubmed-7989080 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-79890802021-03-25 Analysis of the effects of related fingerprints on molecular similarity using an eigenvalue entropy approach Kuwahara, Hiroyuki Gao, Xin J Cheminform Research Article Two-dimensional (2D) chemical fingerprints are widely used as binary features for the quantification of structural similarity of chemical compounds, which is an important step in similarity-based virtual screening (VS). Here, using an eigenvalue-based entropy approach, we identified 2D fingerprints with little to no contribution to shaping the eigenvalue distribution of the feature matrix as related ones and examined the degree to which these related 2D fingerprints influenced molecular similarity scores calculated with the Tanimoto coefficient. Our analysis identified many related fingerprints in publicly available fingerprint schemes and showed that their presence in the feature set could have substantial effects on the similarity scores and bias the outcome of molecular similarity analysis. Our results have implication in the optimal selection of 2D fingerprints for compound similarity analysis and the identification of potential hits for compounds with target biological activity in VS. Springer International Publishing 2021-03-23 /pmc/articles/PMC7989080/ /pubmed/33757582 http://dx.doi.org/10.1186/s13321-021-00506-2 Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Article Kuwahara, Hiroyuki Gao, Xin Analysis of the effects of related fingerprints on molecular similarity using an eigenvalue entropy approach |
title | Analysis of the effects of related fingerprints on molecular similarity using an eigenvalue entropy approach |
title_full | Analysis of the effects of related fingerprints on molecular similarity using an eigenvalue entropy approach |
title_fullStr | Analysis of the effects of related fingerprints on molecular similarity using an eigenvalue entropy approach |
title_full_unstemmed | Analysis of the effects of related fingerprints on molecular similarity using an eigenvalue entropy approach |
title_short | Analysis of the effects of related fingerprints on molecular similarity using an eigenvalue entropy approach |
title_sort | analysis of the effects of related fingerprints on molecular similarity using an eigenvalue entropy approach |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7989080/ https://www.ncbi.nlm.nih.gov/pubmed/33757582 http://dx.doi.org/10.1186/s13321-021-00506-2 |
work_keys_str_mv | AT kuwaharahiroyuki analysisoftheeffectsofrelatedfingerprintsonmolecularsimilarityusinganeigenvalueentropyapproach AT gaoxin analysisoftheeffectsofrelatedfingerprintsonmolecularsimilarityusinganeigenvalueentropyapproach |