Cargando…

Life beyond the Tanimoto coefficient: similarity measures for interaction fingerprints

BACKGROUND: Interaction fingerprints (IFP) have been repeatedly shown to be valuable tools in virtual screening to identify novel hit compounds that can subsequently be optimized to drug candidates. As a complementary method to ligand docking, IFPs can be applied to quantify the similarity of predic...

Descripción completa

Detalles Bibliográficos
Autores principales:	Rácz, Anita, Bajusz, Dávid, Héberger, Károly
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer International Publishing 2018
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6755604/ https://www.ncbi.nlm.nih.gov/pubmed/30288626 http://dx.doi.org/10.1186/s13321-018-0302-y

_version_	1783453268627685376
author	Rácz, Anita Bajusz, Dávid Héberger, Károly
author_facet	Rácz, Anita Bajusz, Dávid Héberger, Károly
author_sort	Rácz, Anita
collection	PubMed
description	BACKGROUND: Interaction fingerprints (IFP) have been repeatedly shown to be valuable tools in virtual screening to identify novel hit compounds that can subsequently be optimized to drug candidates. As a complementary method to ligand docking, IFPs can be applied to quantify the similarity of predicted binding poses to a reference binding pose. For this purpose, a large number of similarity metrics can be applied, and various parameters of the IFPs themselves can be customized. In a large-scale comparison, we have assessed the effect of similarity metrics and IFP configurations to a number of virtual screening scenarios with ten different protein targets and thousands of molecules. Particularly, the effect of considering general interaction definitions (such as Any Contact, Backbone Interaction and Sidechain Interaction), the effect of filtering methods and the different groups of similarity metrics were studied. RESULTS: The performances were primarily compared based on AUC values, but we have also used the original similarity data for the comparison of similarity metrics with several statistical tests and the novel, robust sum of ranking differences (SRD) algorithm. With SRD, we can evaluate the consistency (or concordance) of the various similarity metrics to an ideal reference metric, which is provided by data fusion from the existing metrics. Different aspects of IFP configurations and similarity metrics were examined based on SRD values with analysis of variance (ANOVA) tests. CONCLUSION: A general approach is provided that can be applied for the reliable interpretation and usage of similarity measures with interaction fingerprints. Metrics that are viable alternatives to the commonly used Tanimoto coefficient were identified based on a comparison with an ideal reference metric (consensus). A careful selection of the applied bits (interaction definitions) and IFP filtering rules can improve the results of virtual screening (in terms of their agreement with the consensus metric). The open-source Python package FPKit was introduced for the similarity calculations and IFP filtering; it is available at: https://github.com/davidbajusz/fpkit. [Image: see text] ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13321-018-0302-y) contains supplementary material, which is available to authorized users.
format	Online Article Text
id	pubmed-6755604
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	Springer International Publishing
record_format	MEDLINE/PubMed
spelling	pubmed-67556042019-09-26 Life beyond the Tanimoto coefficient: similarity measures for interaction fingerprints Rácz, Anita Bajusz, Dávid Héberger, Károly J Cheminform Research Article BACKGROUND: Interaction fingerprints (IFP) have been repeatedly shown to be valuable tools in virtual screening to identify novel hit compounds that can subsequently be optimized to drug candidates. As a complementary method to ligand docking, IFPs can be applied to quantify the similarity of predicted binding poses to a reference binding pose. For this purpose, a large number of similarity metrics can be applied, and various parameters of the IFPs themselves can be customized. In a large-scale comparison, we have assessed the effect of similarity metrics and IFP configurations to a number of virtual screening scenarios with ten different protein targets and thousands of molecules. Particularly, the effect of considering general interaction definitions (such as Any Contact, Backbone Interaction and Sidechain Interaction), the effect of filtering methods and the different groups of similarity metrics were studied. RESULTS: The performances were primarily compared based on AUC values, but we have also used the original similarity data for the comparison of similarity metrics with several statistical tests and the novel, robust sum of ranking differences (SRD) algorithm. With SRD, we can evaluate the consistency (or concordance) of the various similarity metrics to an ideal reference metric, which is provided by data fusion from the existing metrics. Different aspects of IFP configurations and similarity metrics were examined based on SRD values with analysis of variance (ANOVA) tests. CONCLUSION: A general approach is provided that can be applied for the reliable interpretation and usage of similarity measures with interaction fingerprints. Metrics that are viable alternatives to the commonly used Tanimoto coefficient were identified based on a comparison with an ideal reference metric (consensus). A careful selection of the applied bits (interaction definitions) and IFP filtering rules can improve the results of virtual screening (in terms of their agreement with the consensus metric). The open-source Python package FPKit was introduced for the similarity calculations and IFP filtering; it is available at: https://github.com/davidbajusz/fpkit. [Image: see text] ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13321-018-0302-y) contains supplementary material, which is available to authorized users. Springer International Publishing 2018-10-04 /pmc/articles/PMC6755604/ /pubmed/30288626 http://dx.doi.org/10.1186/s13321-018-0302-y Text en © The Author(s) 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Research Article Rácz, Anita Bajusz, Dávid Héberger, Károly Life beyond the Tanimoto coefficient: similarity measures for interaction fingerprints
title	Life beyond the Tanimoto coefficient: similarity measures for interaction fingerprints
title_full	Life beyond the Tanimoto coefficient: similarity measures for interaction fingerprints
title_fullStr	Life beyond the Tanimoto coefficient: similarity measures for interaction fingerprints
title_full_unstemmed	Life beyond the Tanimoto coefficient: similarity measures for interaction fingerprints
title_short	Life beyond the Tanimoto coefficient: similarity measures for interaction fingerprints
title_sort	life beyond the tanimoto coefficient: similarity measures for interaction fingerprints
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6755604/ https://www.ncbi.nlm.nih.gov/pubmed/30288626 http://dx.doi.org/10.1186/s13321-018-0302-y
work_keys_str_mv	AT raczanita lifebeyondthetanimotocoefficientsimilaritymeasuresforinteractionfingerprints AT bajuszdavid lifebeyondthetanimotocoefficientsimilaritymeasuresforinteractionfingerprints AT hebergerkaroly lifebeyondthetanimotocoefficientsimilaritymeasuresforinteractionfingerprints

Life beyond the Tanimoto coefficient: similarity measures for interaction fingerprints

Ejemplares similares