Cargando…

Multiple structure alignment and consensus identification for proteins

BACKGROUND: An algorithm is presented to compute a multiple structure alignment for a set of proteins and to generate a consensus (pseudo) protein which captures common substructures present in the given proteins. The algorithm represents each protein as a sequence of triples of coordinates of the a...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ilinkin, Ivaylo, Ye, Jieping, Janardan, Ravi
Formato:	Texto
Lenguaje:	English
Publicado:	BioMed Central 2010
Materias:	Software
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2829528/ https://www.ncbi.nlm.nih.gov/pubmed/20122279 http://dx.doi.org/10.1186/1471-2105-11-71

_version_	1782178104380227584
author	Ilinkin, Ivaylo Ye, Jieping Janardan, Ravi
author_facet	Ilinkin, Ivaylo Ye, Jieping Janardan, Ravi
author_sort	Ilinkin, Ivaylo
collection	PubMed
description	BACKGROUND: An algorithm is presented to compute a multiple structure alignment for a set of proteins and to generate a consensus (pseudo) protein which captures common substructures present in the given proteins. The algorithm represents each protein as a sequence of triples of coordinates of the alpha-carbon atoms along the backbone. It then computes iteratively a sequence of transformation matrices (i.e., translations and rotations) to align the proteins in space and generate the consensus. The algorithm is a heuristic in that it computes an approximation to the optimal alignment that minimizes the sum of the pairwise distances between the consensus and the transformed proteins. RESULTS: Experimental results show that the algorithm converges quite rapidly and generates consensus structures that are visually similar to the input proteins. A comparison with other coordinate-based alignment algorithms (MAMMOTH and MATT) shows that the proposed algorithm is competitive in terms of speed and the sizes of the conserved regions discovered in an extensive benchmark dataset derived from the HOMSTRAD and SABmark databases. The algorithm has been implemented in C++ and can be downloaded from the project's web page. Alternatively, the algorithm can be used via a web server which makes it possible to align protein structures by uploading files from local disk or by downloading protein data from the RCSB Protein Data Bank. CONCLUSIONS: An algorithm is presented to compute a multiple structure alignment for a set of proteins, together with their consensus structure. Experimental results show its effectiveness in terms of the quality of the alignment and computational cost.
format	Text
id	pubmed-2829528
institution	National Center for Biotechnology Information
language	English
publishDate	2010
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-28295282010-02-28 Multiple structure alignment and consensus identification for proteins Ilinkin, Ivaylo Ye, Jieping Janardan, Ravi BMC Bioinformatics Software BACKGROUND: An algorithm is presented to compute a multiple structure alignment for a set of proteins and to generate a consensus (pseudo) protein which captures common substructures present in the given proteins. The algorithm represents each protein as a sequence of triples of coordinates of the alpha-carbon atoms along the backbone. It then computes iteratively a sequence of transformation matrices (i.e., translations and rotations) to align the proteins in space and generate the consensus. The algorithm is a heuristic in that it computes an approximation to the optimal alignment that minimizes the sum of the pairwise distances between the consensus and the transformed proteins. RESULTS: Experimental results show that the algorithm converges quite rapidly and generates consensus structures that are visually similar to the input proteins. A comparison with other coordinate-based alignment algorithms (MAMMOTH and MATT) shows that the proposed algorithm is competitive in terms of speed and the sizes of the conserved regions discovered in an extensive benchmark dataset derived from the HOMSTRAD and SABmark databases. The algorithm has been implemented in C++ and can be downloaded from the project's web page. Alternatively, the algorithm can be used via a web server which makes it possible to align protein structures by uploading files from local disk or by downloading protein data from the RCSB Protein Data Bank. CONCLUSIONS: An algorithm is presented to compute a multiple structure alignment for a set of proteins, together with their consensus structure. Experimental results show its effectiveness in terms of the quality of the alignment and computational cost. BioMed Central 2010-02-02 /pmc/articles/PMC2829528/ /pubmed/20122279 http://dx.doi.org/10.1186/1471-2105-11-71 Text en Copyright ©2010 Ilinkin et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Software Ilinkin, Ivaylo Ye, Jieping Janardan, Ravi Multiple structure alignment and consensus identification for proteins
title	Multiple structure alignment and consensus identification for proteins
title_full	Multiple structure alignment and consensus identification for proteins
title_fullStr	Multiple structure alignment and consensus identification for proteins
title_full_unstemmed	Multiple structure alignment and consensus identification for proteins
title_short	Multiple structure alignment and consensus identification for proteins
title_sort	multiple structure alignment and consensus identification for proteins
topic	Software
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2829528/ https://www.ncbi.nlm.nih.gov/pubmed/20122279 http://dx.doi.org/10.1186/1471-2105-11-71
work_keys_str_mv	AT ilinkinivaylo multiplestructurealignmentandconsensusidentificationforproteins AT yejieping multiplestructurealignmentandconsensusidentificationforproteins AT janardanravi multiplestructurealignmentandconsensusidentificationforproteins

Multiple structure alignment and consensus identification for proteins

Ejemplares similares