Cargando…

TS-AMIR: a topology string alignment method for intensive rapid protein structure comparison

BACKGROUND: In structural biology, similarity analysis of protein structure is a crucial step in studying the relationship between proteins. Despite the considerable number of techniques that have been explored within the past two decades, the development of new alternative methods is still an activ...

Descripción completa

Detalles Bibliográficos
Autores principales: Razmara, Jafar, Deris, Safaai, Parvizpour, Sepideh
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3298807/
https://www.ncbi.nlm.nih.gov/pubmed/22336468
http://dx.doi.org/10.1186/1748-7188-7-4
_version_ 1782226043288944640
author Razmara, Jafar
Deris, Safaai
Parvizpour, Sepideh
author_facet Razmara, Jafar
Deris, Safaai
Parvizpour, Sepideh
author_sort Razmara, Jafar
collection PubMed
description BACKGROUND: In structural biology, similarity analysis of protein structure is a crucial step in studying the relationship between proteins. Despite the considerable number of techniques that have been explored within the past two decades, the development of new alternative methods is still an active research area due to the need for high performance tools. RESULTS: In this paper, we present TS-AMIR, a Topology String Alignment Method for Intensive Rapid comparison of protein structures. The proposed method works in two stages: In the first stage, the method generates a topology string based on the geometric details of secondary structure elements, and then, utilizes an n-gram modelling technique over entropy concept to capture similarities in these strings. This initial correspondence map between secondary structure elements is submitted to the second stage in order to obtain the alignment at the residue level. Applying the Kabsch method, a heuristic step-by-step algorithm is adopted in the second stage to align the residues, resulting in an optimal rotation matrix and minimized RMSD. The performance of the method was assessed in different information retrieval tests and the results were compared with those of CE and TM-align, representing two geometrical tools, and YAKUSA, 3D-BLAST and SARST as three representatives of linear encoding schemes. It is shown that the method obtains a high running speed similar to that of the linear encoding schemes. In addition, the method runs about 800 and 7200 times faster than TM-align and CE respectively, while maintaining a competitive accuracy with TM-align and CE. CONCLUSIONS: The experimental results demonstrate that linear encoding techniques are capable of reaching the same high degree of accuracy as that achieved by geometrical methods, while generally running hundreds of times faster than conventional programs.
format Online
Article
Text
id pubmed-3298807
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32988072012-03-12 TS-AMIR: a topology string alignment method for intensive rapid protein structure comparison Razmara, Jafar Deris, Safaai Parvizpour, Sepideh Algorithms Mol Biol Research BACKGROUND: In structural biology, similarity analysis of protein structure is a crucial step in studying the relationship between proteins. Despite the considerable number of techniques that have been explored within the past two decades, the development of new alternative methods is still an active research area due to the need for high performance tools. RESULTS: In this paper, we present TS-AMIR, a Topology String Alignment Method for Intensive Rapid comparison of protein structures. The proposed method works in two stages: In the first stage, the method generates a topology string based on the geometric details of secondary structure elements, and then, utilizes an n-gram modelling technique over entropy concept to capture similarities in these strings. This initial correspondence map between secondary structure elements is submitted to the second stage in order to obtain the alignment at the residue level. Applying the Kabsch method, a heuristic step-by-step algorithm is adopted in the second stage to align the residues, resulting in an optimal rotation matrix and minimized RMSD. The performance of the method was assessed in different information retrieval tests and the results were compared with those of CE and TM-align, representing two geometrical tools, and YAKUSA, 3D-BLAST and SARST as three representatives of linear encoding schemes. It is shown that the method obtains a high running speed similar to that of the linear encoding schemes. In addition, the method runs about 800 and 7200 times faster than TM-align and CE respectively, while maintaining a competitive accuracy with TM-align and CE. CONCLUSIONS: The experimental results demonstrate that linear encoding techniques are capable of reaching the same high degree of accuracy as that achieved by geometrical methods, while generally running hundreds of times faster than conventional programs. BioMed Central 2012-02-15 /pmc/articles/PMC3298807/ /pubmed/22336468 http://dx.doi.org/10.1186/1748-7188-7-4 Text en Copyright ©2012 Razmara et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Razmara, Jafar
Deris, Safaai
Parvizpour, Sepideh
TS-AMIR: a topology string alignment method for intensive rapid protein structure comparison
title TS-AMIR: a topology string alignment method for intensive rapid protein structure comparison
title_full TS-AMIR: a topology string alignment method for intensive rapid protein structure comparison
title_fullStr TS-AMIR: a topology string alignment method for intensive rapid protein structure comparison
title_full_unstemmed TS-AMIR: a topology string alignment method for intensive rapid protein structure comparison
title_short TS-AMIR: a topology string alignment method for intensive rapid protein structure comparison
title_sort ts-amir: a topology string alignment method for intensive rapid protein structure comparison
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3298807/
https://www.ncbi.nlm.nih.gov/pubmed/22336468
http://dx.doi.org/10.1186/1748-7188-7-4
work_keys_str_mv AT razmarajafar tsamiratopologystringalignmentmethodforintensiverapidproteinstructurecomparison
AT derissafaai tsamiratopologystringalignmentmethodforintensiverapidproteinstructurecomparison
AT parvizpoursepideh tsamiratopologystringalignmentmethodforintensiverapidproteinstructurecomparison