Cargando…
An alphabetic code based atomic level molecular similarity search in databases
Atomic level molecular similarity and diversity studies have gained considerable importance through their wide application in Bioinformatics and Chemo-informatics for drug design. The availability of large volumes of data on chemical compounds requires new methodologies for efficient and effective s...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Biomedical Informatics
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3398777/ https://www.ncbi.nlm.nih.gov/pubmed/22829718 http://dx.doi.org/10.6026/97320630008498 |
_version_ | 1782238320246390784 |
---|---|
author | Saranya, Nallusamy Selvaraj, Samuel |
author_facet | Saranya, Nallusamy Selvaraj, Samuel |
author_sort | Saranya, Nallusamy |
collection | PubMed |
description | Atomic level molecular similarity and diversity studies have gained considerable importance through their wide application in Bioinformatics and Chemo-informatics for drug design. The availability of large volumes of data on chemical compounds requires new methodologies for efficient and effective searching of its archives in less time with optimal computational power. We describe an alphabetic algorithm for similarity searching based on atom-atom bonding preference for ligands. We represented 170 cyclindependent kinase 2 inhibitors using strings of pre-defined alphabets for searching using known protein sequence alignment tools. Thus, a common pattern was extracted using this set of compounds for database searching to retrieve similar active compounds. Area under the receiver operating characteristic (ROC) curve was used for the discrimination of similar and dissimilar compounds in the databases. An average retrieval rate of about 60% is obtained in cross-validation using the home-grown dataset and the directory of useful decoys (DUD, formally known as the ZINC database) data. This will help in the effective retrieval of similar compounds using database search. |
format | Online Article Text |
id | pubmed-3398777 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Biomedical Informatics |
record_format | MEDLINE/PubMed |
spelling | pubmed-33987772012-07-24 An alphabetic code based atomic level molecular similarity search in databases Saranya, Nallusamy Selvaraj, Samuel Bioinformation Hypothesis Atomic level molecular similarity and diversity studies have gained considerable importance through their wide application in Bioinformatics and Chemo-informatics for drug design. The availability of large volumes of data on chemical compounds requires new methodologies for efficient and effective searching of its archives in less time with optimal computational power. We describe an alphabetic algorithm for similarity searching based on atom-atom bonding preference for ligands. We represented 170 cyclindependent kinase 2 inhibitors using strings of pre-defined alphabets for searching using known protein sequence alignment tools. Thus, a common pattern was extracted using this set of compounds for database searching to retrieve similar active compounds. Area under the receiver operating characteristic (ROC) curve was used for the discrimination of similar and dissimilar compounds in the databases. An average retrieval rate of about 60% is obtained in cross-validation using the home-grown dataset and the directory of useful decoys (DUD, formally known as the ZINC database) data. This will help in the effective retrieval of similar compounds using database search. Biomedical Informatics 2012-06-16 /pmc/articles/PMC3398777/ /pubmed/22829718 http://dx.doi.org/10.6026/97320630008498 Text en © 2012 Biomedical Informatics This is an open-access article, which permits unrestricted use, distribution, and reproduction in any medium, for non-commercial purposes, provided the original author and source are credited. |
spellingShingle | Hypothesis Saranya, Nallusamy Selvaraj, Samuel An alphabetic code based atomic level molecular similarity search in databases |
title | An alphabetic code based atomic level molecular similarity search in databases |
title_full | An alphabetic code based atomic level molecular similarity search in databases |
title_fullStr | An alphabetic code based atomic level molecular similarity search in databases |
title_full_unstemmed | An alphabetic code based atomic level molecular similarity search in databases |
title_short | An alphabetic code based atomic level molecular similarity search in databases |
title_sort | alphabetic code based atomic level molecular similarity search in databases |
topic | Hypothesis |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3398777/ https://www.ncbi.nlm.nih.gov/pubmed/22829718 http://dx.doi.org/10.6026/97320630008498 |
work_keys_str_mv | AT saranyanallusamy analphabeticcodebasedatomiclevelmolecularsimilaritysearchindatabases AT selvarajsamuel analphabeticcodebasedatomiclevelmolecularsimilaritysearchindatabases AT saranyanallusamy alphabeticcodebasedatomiclevelmolecularsimilaritysearchindatabases AT selvarajsamuel alphabeticcodebasedatomiclevelmolecularsimilaritysearchindatabases |