Cargando…

Adapting Document Similarity Measures for Ligand-Based Virtual Screening

Quantifying the similarity of molecules is considered one of the major tasks in virtual screening. There are many similarity measures that have been proposed for this purpose, some of which have been derived from document and text retrieving areas as most often these similarity methods give good res...

Descripción completa

Detalles Bibliográficos
Autores principales: Himmat, Mubarak, Salim, Naomie, Al-Dabbagh, Mohammed Mumtaz, Saeed, Faisal, Ahmed, Ali
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6274479/
https://www.ncbi.nlm.nih.gov/pubmed/27089312
http://dx.doi.org/10.3390/molecules21040476
_version_ 1783377627834220544
author Himmat, Mubarak
Salim, Naomie
Al-Dabbagh, Mohammed Mumtaz
Saeed, Faisal
Ahmed, Ali
author_facet Himmat, Mubarak
Salim, Naomie
Al-Dabbagh, Mohammed Mumtaz
Saeed, Faisal
Ahmed, Ali
author_sort Himmat, Mubarak
collection PubMed
description Quantifying the similarity of molecules is considered one of the major tasks in virtual screening. There are many similarity measures that have been proposed for this purpose, some of which have been derived from document and text retrieving areas as most often these similarity methods give good results in document retrieval and can achieve good results in virtual screening. In this work, we propose a similarity measure for ligand-based virtual screening, which has been derived from a text processing similarity measure. It has been adopted to be suitable for virtual screening; we called this proposed measure the Adapted Similarity Measure of Text Processing (ASMTP). For evaluating and testing the proposed ASMTP we conducted several experiments on two different benchmark datasets: the Maximum Unbiased Validation (MUV) and the MDL Drug Data Report (MDDR). The experiments have been conducted by choosing 10 reference structures from each class randomly as queries and evaluate them in the recall of cut-offs at 1% and 5%. The overall obtained results are compared with some similarity methods including the Tanimoto coefficient, which are considered to be the conventional and standard similarity coefficients for fingerprint-based similarity calculations. The achieved results show that the performance of ligand-based virtual screening is better and outperforms the Tanimoto coefficients and other methods.
format Online
Article
Text
id pubmed-6274479
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-62744792018-12-28 Adapting Document Similarity Measures for Ligand-Based Virtual Screening Himmat, Mubarak Salim, Naomie Al-Dabbagh, Mohammed Mumtaz Saeed, Faisal Ahmed, Ali Molecules Article Quantifying the similarity of molecules is considered one of the major tasks in virtual screening. There are many similarity measures that have been proposed for this purpose, some of which have been derived from document and text retrieving areas as most often these similarity methods give good results in document retrieval and can achieve good results in virtual screening. In this work, we propose a similarity measure for ligand-based virtual screening, which has been derived from a text processing similarity measure. It has been adopted to be suitable for virtual screening; we called this proposed measure the Adapted Similarity Measure of Text Processing (ASMTP). For evaluating and testing the proposed ASMTP we conducted several experiments on two different benchmark datasets: the Maximum Unbiased Validation (MUV) and the MDL Drug Data Report (MDDR). The experiments have been conducted by choosing 10 reference structures from each class randomly as queries and evaluate them in the recall of cut-offs at 1% and 5%. The overall obtained results are compared with some similarity methods including the Tanimoto coefficient, which are considered to be the conventional and standard similarity coefficients for fingerprint-based similarity calculations. The achieved results show that the performance of ligand-based virtual screening is better and outperforms the Tanimoto coefficients and other methods. MDPI 2016-04-13 /pmc/articles/PMC6274479/ /pubmed/27089312 http://dx.doi.org/10.3390/molecules21040476 Text en © 2016 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons by Attribution (CC-BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Himmat, Mubarak
Salim, Naomie
Al-Dabbagh, Mohammed Mumtaz
Saeed, Faisal
Ahmed, Ali
Adapting Document Similarity Measures for Ligand-Based Virtual Screening
title Adapting Document Similarity Measures for Ligand-Based Virtual Screening
title_full Adapting Document Similarity Measures for Ligand-Based Virtual Screening
title_fullStr Adapting Document Similarity Measures for Ligand-Based Virtual Screening
title_full_unstemmed Adapting Document Similarity Measures for Ligand-Based Virtual Screening
title_short Adapting Document Similarity Measures for Ligand-Based Virtual Screening
title_sort adapting document similarity measures for ligand-based virtual screening
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6274479/
https://www.ncbi.nlm.nih.gov/pubmed/27089312
http://dx.doi.org/10.3390/molecules21040476
work_keys_str_mv AT himmatmubarak adaptingdocumentsimilaritymeasuresforligandbasedvirtualscreening
AT salimnaomie adaptingdocumentsimilaritymeasuresforligandbasedvirtualscreening
AT aldabbaghmohammedmumtaz adaptingdocumentsimilaritymeasuresforligandbasedvirtualscreening
AT saeedfaisal adaptingdocumentsimilaritymeasuresforligandbasedvirtualscreening
AT ahmedali adaptingdocumentsimilaritymeasuresforligandbasedvirtualscreening