Cargando…
Topology independent protein structural alignment
BACKGROUND: Identifying structurally similar proteins with different chain topologies can aid studies in homology modeling, protein folding, protein design, and protein evolution. These include circular permuted protein structures, and the more general cases of non-cyclic permutations between simila...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2007
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2096629/ https://www.ncbi.nlm.nih.gov/pubmed/17937816 http://dx.doi.org/10.1186/1471-2105-8-388 |
_version_ | 1782138246121127936 |
---|---|
author | Dundas, Joe Binkowski, TA DasGupta, Bhaskar Liang, Jie |
author_facet | Dundas, Joe Binkowski, TA DasGupta, Bhaskar Liang, Jie |
author_sort | Dundas, Joe |
collection | PubMed |
description | BACKGROUND: Identifying structurally similar proteins with different chain topologies can aid studies in homology modeling, protein folding, protein design, and protein evolution. These include circular permuted protein structures, and the more general cases of non-cyclic permutations between similar structures, which are related by non-topological rearrangement beyond circular permutation. We present a method based on an approximation algorithm that finds sequence-order independent structural alignments that are close to optimal. We formulate the structural alignment problem as a special case of the maximum-weight independent set problem, and solve this computationally intensive problem approximately by iteratively solving relaxations of a corresponding integer programming problem. The resulting structural alignment is sequence order independent. Our method is also insensitive to insertions, deletions, and gaps. RESULTS: Using a novel similarity score and a statistical model for significance p-value, we are able to discover previously unknown circular permuted proteins between nucleoplasmin-core protein and auxin binding protein, between aspartate rasemase and 3-dehydrogenate dehydralase, as well as between migration inhibition factor and arginine repressor which involves an additional strand-swapping. We also report the finding of non-cyclic permuted protein structures existing in nature between AML1/core binding factor and ribofalvin synthase. Our method can be used for large scale alignment of protein structures regardless of the topology. CONCLUSION: The approximation algorithm introduced in this work can find good solutions for the problem of protein structure alignment. Furthermore, this algorithm can detect topological differences between two spatially similar protein structures. The alignment between MIF and the arginine repressor demonstrates our algorithm's ability to detect structural similarities even when spatial rearrangement of structural units has occurred. The effectiveness of our method is also demonstrated by the discovery of previously unknown circular permutations. In addition, we report in this study the finding of a naturally occurring non-cyclic permuted protein between AML1/Core Binding Factor chain F and riboflavin synthase chain A. |
format | Text |
id | pubmed-2096629 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2007 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-20966292007-12-03 Topology independent protein structural alignment Dundas, Joe Binkowski, TA DasGupta, Bhaskar Liang, Jie BMC Bioinformatics Research Article BACKGROUND: Identifying structurally similar proteins with different chain topologies can aid studies in homology modeling, protein folding, protein design, and protein evolution. These include circular permuted protein structures, and the more general cases of non-cyclic permutations between similar structures, which are related by non-topological rearrangement beyond circular permutation. We present a method based on an approximation algorithm that finds sequence-order independent structural alignments that are close to optimal. We formulate the structural alignment problem as a special case of the maximum-weight independent set problem, and solve this computationally intensive problem approximately by iteratively solving relaxations of a corresponding integer programming problem. The resulting structural alignment is sequence order independent. Our method is also insensitive to insertions, deletions, and gaps. RESULTS: Using a novel similarity score and a statistical model for significance p-value, we are able to discover previously unknown circular permuted proteins between nucleoplasmin-core protein and auxin binding protein, between aspartate rasemase and 3-dehydrogenate dehydralase, as well as between migration inhibition factor and arginine repressor which involves an additional strand-swapping. We also report the finding of non-cyclic permuted protein structures existing in nature between AML1/core binding factor and ribofalvin synthase. Our method can be used for large scale alignment of protein structures regardless of the topology. CONCLUSION: The approximation algorithm introduced in this work can find good solutions for the problem of protein structure alignment. Furthermore, this algorithm can detect topological differences between two spatially similar protein structures. The alignment between MIF and the arginine repressor demonstrates our algorithm's ability to detect structural similarities even when spatial rearrangement of structural units has occurred. The effectiveness of our method is also demonstrated by the discovery of previously unknown circular permutations. In addition, we report in this study the finding of a naturally occurring non-cyclic permuted protein between AML1/Core Binding Factor chain F and riboflavin synthase chain A. BioMed Central 2007-10-15 /pmc/articles/PMC2096629/ /pubmed/17937816 http://dx.doi.org/10.1186/1471-2105-8-388 Text en Copyright © 2007 Dundas et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Dundas, Joe Binkowski, TA DasGupta, Bhaskar Liang, Jie Topology independent protein structural alignment |
title | Topology independent protein structural alignment |
title_full | Topology independent protein structural alignment |
title_fullStr | Topology independent protein structural alignment |
title_full_unstemmed | Topology independent protein structural alignment |
title_short | Topology independent protein structural alignment |
title_sort | topology independent protein structural alignment |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2096629/ https://www.ncbi.nlm.nih.gov/pubmed/17937816 http://dx.doi.org/10.1186/1471-2105-8-388 |
work_keys_str_mv | AT dundasjoe topologyindependentproteinstructuralalignment AT binkowskita topologyindependentproteinstructuralalignment AT dasguptabhaskar topologyindependentproteinstructuralalignment AT liangjie topologyindependentproteinstructuralalignment |