Cargando…

Multi-view methods for protein structure comparison using latent dirichlet allocation

Motivation: With rapidly expanding protein structure databases, efficiently retrieving structures similar to a given protein is an important problem. It involves two major issues: (i) effective protein structure representation that captures inherent relationship between fragments and facilitates eff...

Descripción completa

Detalles Bibliográficos
Autores principales: Shivashankar, S., Srivathsan, S., Ravindran, B., Tendulkar, Ashish V.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3117356/
https://www.ncbi.nlm.nih.gov/pubmed/21685102
http://dx.doi.org/10.1093/bioinformatics/btr249
Descripción
Sumario:Motivation: With rapidly expanding protein structure databases, efficiently retrieving structures similar to a given protein is an important problem. It involves two major issues: (i) effective protein structure representation that captures inherent relationship between fragments and facilitates efficient comparison between the structures and (ii) effective framework to address different retrieval requirements. Recently, researchers proposed vector space model of proteins using bag of fragments representation (FragBag), which corresponds to the basic information retrieval model. Results: In this article, we propose an improved representation of protein structures using latent dirichlet allocation topic model. Another important requirement is to retrieve proteins, whether they are either close or remote homologs. In order to meet diverse objectives, we propose multi-viewpoint based framework that combines multiple representations and retrieval techniques. We compare the proposed representation and retrieval framework on the benchmark dataset developed by Kolodny and co-workers. The results indicate that the proposed techniques outperform state-of-the-art methods. Availability: http://www.cse.iitm.ac.in/~ashishvt/research/protein-lda/. Contact: ashishvt@cse.iitm.ac.in