Cargando…

Efficient protein alignment algorithm for protein search

BACKGROUND: Proteins show a great variety of 3D conformations, which can be used to infer their evolutionary relationship and to classify them into more general groups; therefore protein structure alignment algorithms are very helpful for protein biologists. However, an accurate alignment algorithm...

Descripción completa

Detalles Bibliográficos
Autores principales: Lu, Zaixin, Zhao, Zhiyu, Fu, Bin
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3009506/
https://www.ncbi.nlm.nih.gov/pubmed/20122207
http://dx.doi.org/10.1186/1471-2105-11-S1-S34
_version_ 1782194694291193856
author Lu, Zaixin
Zhao, Zhiyu
Fu, Bin
author_facet Lu, Zaixin
Zhao, Zhiyu
Fu, Bin
author_sort Lu, Zaixin
collection PubMed
description BACKGROUND: Proteins show a great variety of 3D conformations, which can be used to infer their evolutionary relationship and to classify them into more general groups; therefore protein structure alignment algorithms are very helpful for protein biologists. However, an accurate alignment algorithm itself may be insufficient for effective discovering of structural relationships among tens of thousands of proteins. Due to the exponentially increasing amount of protein structural data, a fast and accurate structure alignment tool is necessary to access protein classification and protein similarity search; however, the complexity of current alignment algorithms are usually too high to make a fully alignment-based classification and search practical. RESULTS: We have developed an efficient protein pairwise alignment algorithm and applied it to our protein search tool, which aligns a query protein structure in the pairwise manner with all protein structures in the Protein Data Bank (PDB) to output similar protein structures. The algorithm can align hundreds of pairs of protein structures in one second. Given a protein structure, the tool efficiently discovers similar structures from tens of thousands of structures stored in the PDB always in 2 minutes in a single machine and 20 seconds in our cluster of 6 machines. The algorithm has been fully implemented and is accessible online at our webserver, which is supported by a cluster of computers. CONCLUSION: Our algorithm can work out hundreds of pairs of protein alignments in one second. Therefore, it is very suitable for protein search. Our experimental results show that it is more accurate than other well known protein search systems in finding proteins which are structurally similar at SCOP family and superfamily levels, and its speed is also competitive with those systems. In terms of the pairwise alignment performance, it is as good as some well known alignment algorithms.
format Text
id pubmed-3009506
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-30095062010-12-23 Efficient protein alignment algorithm for protein search Lu, Zaixin Zhao, Zhiyu Fu, Bin BMC Bioinformatics Research BACKGROUND: Proteins show a great variety of 3D conformations, which can be used to infer their evolutionary relationship and to classify them into more general groups; therefore protein structure alignment algorithms are very helpful for protein biologists. However, an accurate alignment algorithm itself may be insufficient for effective discovering of structural relationships among tens of thousands of proteins. Due to the exponentially increasing amount of protein structural data, a fast and accurate structure alignment tool is necessary to access protein classification and protein similarity search; however, the complexity of current alignment algorithms are usually too high to make a fully alignment-based classification and search practical. RESULTS: We have developed an efficient protein pairwise alignment algorithm and applied it to our protein search tool, which aligns a query protein structure in the pairwise manner with all protein structures in the Protein Data Bank (PDB) to output similar protein structures. The algorithm can align hundreds of pairs of protein structures in one second. Given a protein structure, the tool efficiently discovers similar structures from tens of thousands of structures stored in the PDB always in 2 minutes in a single machine and 20 seconds in our cluster of 6 machines. The algorithm has been fully implemented and is accessible online at our webserver, which is supported by a cluster of computers. CONCLUSION: Our algorithm can work out hundreds of pairs of protein alignments in one second. Therefore, it is very suitable for protein search. Our experimental results show that it is more accurate than other well known protein search systems in finding proteins which are structurally similar at SCOP family and superfamily levels, and its speed is also competitive with those systems. In terms of the pairwise alignment performance, it is as good as some well known alignment algorithms. BioMed Central 2010-01-18 /pmc/articles/PMC3009506/ /pubmed/20122207 http://dx.doi.org/10.1186/1471-2105-11-S1-S34 Text en Copyright ©2010 Lu et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Lu, Zaixin
Zhao, Zhiyu
Fu, Bin
Efficient protein alignment algorithm for protein search
title Efficient protein alignment algorithm for protein search
title_full Efficient protein alignment algorithm for protein search
title_fullStr Efficient protein alignment algorithm for protein search
title_full_unstemmed Efficient protein alignment algorithm for protein search
title_short Efficient protein alignment algorithm for protein search
title_sort efficient protein alignment algorithm for protein search
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3009506/
https://www.ncbi.nlm.nih.gov/pubmed/20122207
http://dx.doi.org/10.1186/1471-2105-11-S1-S34
work_keys_str_mv AT luzaixin efficientproteinalignmentalgorithmforproteinsearch
AT zhaozhiyu efficientproteinalignmentalgorithmforproteinsearch
AT fubin efficientproteinalignmentalgorithmforproteinsearch