Cargando…

An alphabetic code based atomic level molecular similarity search in databases

Atomic level molecular similarity and diversity studies have gained considerable importance through their wide application in Bioinformatics and Chemo-informatics for drug design. The availability of large volumes of data on chemical compounds requires new methodologies for efficient and effective s...

Descripción completa

Detalles Bibliográficos
Autores principales: Saranya, Nallusamy, Selvaraj, Samuel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Biomedical Informatics 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3398777/
https://www.ncbi.nlm.nih.gov/pubmed/22829718
http://dx.doi.org/10.6026/97320630008498
_version_ 1782238320246390784
author Saranya, Nallusamy
Selvaraj, Samuel
author_facet Saranya, Nallusamy
Selvaraj, Samuel
author_sort Saranya, Nallusamy
collection PubMed
description Atomic level molecular similarity and diversity studies have gained considerable importance through their wide application in Bioinformatics and Chemo-informatics for drug design. The availability of large volumes of data on chemical compounds requires new methodologies for efficient and effective searching of its archives in less time with optimal computational power. We describe an alphabetic algorithm for similarity searching based on atom-atom bonding preference for ligands. We represented 170 cyclindependent kinase 2 inhibitors using strings of pre-defined alphabets for searching using known protein sequence alignment tools. Thus, a common pattern was extracted using this set of compounds for database searching to retrieve similar active compounds. Area under the receiver operating characteristic (ROC) curve was used for the discrimination of similar and dissimilar compounds in the databases. An average retrieval rate of about 60% is obtained in cross-validation using the home-grown dataset and the directory of useful decoys (DUD, formally known as the ZINC database) data. This will help in the effective retrieval of similar compounds using database search.
format Online
Article
Text
id pubmed-3398777
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Biomedical Informatics
record_format MEDLINE/PubMed
spelling pubmed-33987772012-07-24 An alphabetic code based atomic level molecular similarity search in databases Saranya, Nallusamy Selvaraj, Samuel Bioinformation Hypothesis Atomic level molecular similarity and diversity studies have gained considerable importance through their wide application in Bioinformatics and Chemo-informatics for drug design. The availability of large volumes of data on chemical compounds requires new methodologies for efficient and effective searching of its archives in less time with optimal computational power. We describe an alphabetic algorithm for similarity searching based on atom-atom bonding preference for ligands. We represented 170 cyclindependent kinase 2 inhibitors using strings of pre-defined alphabets for searching using known protein sequence alignment tools. Thus, a common pattern was extracted using this set of compounds for database searching to retrieve similar active compounds. Area under the receiver operating characteristic (ROC) curve was used for the discrimination of similar and dissimilar compounds in the databases. An average retrieval rate of about 60% is obtained in cross-validation using the home-grown dataset and the directory of useful decoys (DUD, formally known as the ZINC database) data. This will help in the effective retrieval of similar compounds using database search. Biomedical Informatics 2012-06-16 /pmc/articles/PMC3398777/ /pubmed/22829718 http://dx.doi.org/10.6026/97320630008498 Text en © 2012 Biomedical Informatics This is an open-access article, which permits unrestricted use, distribution, and reproduction in any medium, for non-commercial purposes, provided the original author and source are credited.
spellingShingle Hypothesis
Saranya, Nallusamy
Selvaraj, Samuel
An alphabetic code based atomic level molecular similarity search in databases
title An alphabetic code based atomic level molecular similarity search in databases
title_full An alphabetic code based atomic level molecular similarity search in databases
title_fullStr An alphabetic code based atomic level molecular similarity search in databases
title_full_unstemmed An alphabetic code based atomic level molecular similarity search in databases
title_short An alphabetic code based atomic level molecular similarity search in databases
title_sort alphabetic code based atomic level molecular similarity search in databases
topic Hypothesis
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3398777/
https://www.ncbi.nlm.nih.gov/pubmed/22829718
http://dx.doi.org/10.6026/97320630008498
work_keys_str_mv AT saranyanallusamy analphabeticcodebasedatomiclevelmolecularsimilaritysearchindatabases
AT selvarajsamuel analphabeticcodebasedatomiclevelmolecularsimilaritysearchindatabases
AT saranyanallusamy alphabeticcodebasedatomiclevelmolecularsimilaritysearchindatabases
AT selvarajsamuel alphabeticcodebasedatomiclevelmolecularsimilaritysearchindatabases