Cargando…

Topology independent protein structural alignment

BACKGROUND: Identifying structurally similar proteins with different chain topologies can aid studies in homology modeling, protein folding, protein design, and protein evolution. These include circular permuted protein structures, and the more general cases of non-cyclic permutations between simila...

Descripción completa

Detalles Bibliográficos
Autores principales: Dundas, Joe, Binkowski, TA, DasGupta, Bhaskar, Liang, Jie
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2096629/
https://www.ncbi.nlm.nih.gov/pubmed/17937816
http://dx.doi.org/10.1186/1471-2105-8-388
_version_ 1782138246121127936
author Dundas, Joe
Binkowski, TA
DasGupta, Bhaskar
Liang, Jie
author_facet Dundas, Joe
Binkowski, TA
DasGupta, Bhaskar
Liang, Jie
author_sort Dundas, Joe
collection PubMed
description BACKGROUND: Identifying structurally similar proteins with different chain topologies can aid studies in homology modeling, protein folding, protein design, and protein evolution. These include circular permuted protein structures, and the more general cases of non-cyclic permutations between similar structures, which are related by non-topological rearrangement beyond circular permutation. We present a method based on an approximation algorithm that finds sequence-order independent structural alignments that are close to optimal. We formulate the structural alignment problem as a special case of the maximum-weight independent set problem, and solve this computationally intensive problem approximately by iteratively solving relaxations of a corresponding integer programming problem. The resulting structural alignment is sequence order independent. Our method is also insensitive to insertions, deletions, and gaps. RESULTS: Using a novel similarity score and a statistical model for significance p-value, we are able to discover previously unknown circular permuted proteins between nucleoplasmin-core protein and auxin binding protein, between aspartate rasemase and 3-dehydrogenate dehydralase, as well as between migration inhibition factor and arginine repressor which involves an additional strand-swapping. We also report the finding of non-cyclic permuted protein structures existing in nature between AML1/core binding factor and ribofalvin synthase. Our method can be used for large scale alignment of protein structures regardless of the topology. CONCLUSION: The approximation algorithm introduced in this work can find good solutions for the problem of protein structure alignment. Furthermore, this algorithm can detect topological differences between two spatially similar protein structures. The alignment between MIF and the arginine repressor demonstrates our algorithm's ability to detect structural similarities even when spatial rearrangement of structural units has occurred. The effectiveness of our method is also demonstrated by the discovery of previously unknown circular permutations. In addition, we report in this study the finding of a naturally occurring non-cyclic permuted protein between AML1/Core Binding Factor chain F and riboflavin synthase chain A.
format Text
id pubmed-2096629
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-20966292007-12-03 Topology independent protein structural alignment Dundas, Joe Binkowski, TA DasGupta, Bhaskar Liang, Jie BMC Bioinformatics Research Article BACKGROUND: Identifying structurally similar proteins with different chain topologies can aid studies in homology modeling, protein folding, protein design, and protein evolution. These include circular permuted protein structures, and the more general cases of non-cyclic permutations between similar structures, which are related by non-topological rearrangement beyond circular permutation. We present a method based on an approximation algorithm that finds sequence-order independent structural alignments that are close to optimal. We formulate the structural alignment problem as a special case of the maximum-weight independent set problem, and solve this computationally intensive problem approximately by iteratively solving relaxations of a corresponding integer programming problem. The resulting structural alignment is sequence order independent. Our method is also insensitive to insertions, deletions, and gaps. RESULTS: Using a novel similarity score and a statistical model for significance p-value, we are able to discover previously unknown circular permuted proteins between nucleoplasmin-core protein and auxin binding protein, between aspartate rasemase and 3-dehydrogenate dehydralase, as well as between migration inhibition factor and arginine repressor which involves an additional strand-swapping. We also report the finding of non-cyclic permuted protein structures existing in nature between AML1/core binding factor and ribofalvin synthase. Our method can be used for large scale alignment of protein structures regardless of the topology. CONCLUSION: The approximation algorithm introduced in this work can find good solutions for the problem of protein structure alignment. Furthermore, this algorithm can detect topological differences between two spatially similar protein structures. The alignment between MIF and the arginine repressor demonstrates our algorithm's ability to detect structural similarities even when spatial rearrangement of structural units has occurred. The effectiveness of our method is also demonstrated by the discovery of previously unknown circular permutations. In addition, we report in this study the finding of a naturally occurring non-cyclic permuted protein between AML1/Core Binding Factor chain F and riboflavin synthase chain A. BioMed Central 2007-10-15 /pmc/articles/PMC2096629/ /pubmed/17937816 http://dx.doi.org/10.1186/1471-2105-8-388 Text en Copyright © 2007 Dundas et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Dundas, Joe
Binkowski, TA
DasGupta, Bhaskar
Liang, Jie
Topology independent protein structural alignment
title Topology independent protein structural alignment
title_full Topology independent protein structural alignment
title_fullStr Topology independent protein structural alignment
title_full_unstemmed Topology independent protein structural alignment
title_short Topology independent protein structural alignment
title_sort topology independent protein structural alignment
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2096629/
https://www.ncbi.nlm.nih.gov/pubmed/17937816
http://dx.doi.org/10.1186/1471-2105-8-388
work_keys_str_mv AT dundasjoe topologyindependentproteinstructuralalignment
AT binkowskita topologyindependentproteinstructuralalignment
AT dasguptabhaskar topologyindependentproteinstructuralalignment
AT liangjie topologyindependentproteinstructuralalignment