Cargando…

Efficient Parameterized Algorithm for Biopolymer Structure-Sequence Alignment

Computational alignment of a biopolymer sequence (e.g., an RNA or a protein) to a structure is an effective approach to predict and search for the structure of new sequences. To identify the structure of remote homologs, the structure-sequence alignment has to consider not only sequence similarity b...

Descripción completa

Detalles Bibliográficos
Autores principales: Song, Yinglei, Liu, Chunmei, Huang, Xiuzhen, Malmberg, Russell L., Xu, Ying, Cai, Liming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7121179/
http://dx.doi.org/10.1007/11557067_31
_version_ 1783515144745123840
author Song, Yinglei
Liu, Chunmei
Huang, Xiuzhen
Malmberg, Russell L.
Xu, Ying
Cai, Liming
author_facet Song, Yinglei
Liu, Chunmei
Huang, Xiuzhen
Malmberg, Russell L.
Xu, Ying
Cai, Liming
author_sort Song, Yinglei
collection PubMed
description Computational alignment of a biopolymer sequence (e.g., an RNA or a protein) to a structure is an effective approach to predict and search for the structure of new sequences. To identify the structure of remote homologs, the structure-sequence alignment has to consider not only sequence similarity but also spatially conserved conformations caused by residue interactions, and consequently is computationally intractable. It is difficult to cope with the inefficiency without compromising alignment accuracy, especially for structure search in genomes or large databases. This paper introduces a novel method and a parameterized algorithm for structure-sequence alignment. Both the structure and the sequence are represented as graphs, where in general the graph for a biopolymer structure has a naturally small tree width. The algorithm constructs an optimal alignment by finding in the sequence graph the maximum valued subgraph isomorphic to the structure graph. It has the computational time complexity O(k (t) N (2)) for the structure of N residues and its tree decomposition of width t. The parameter k, small in nature, is determined by a statistical cutoff for the correspondence between the structure and the sequence. The paper demonstrates a successful application of the algorithm to developing a fast program for RNA structural homology search.
format Online
Article
Text
id pubmed-7121179
institution National Center for Biotechnology Information
language English
publishDate 2005
record_format MEDLINE/PubMed
spelling pubmed-71211792020-04-06 Efficient Parameterized Algorithm for Biopolymer Structure-Sequence Alignment Song, Yinglei Liu, Chunmei Huang, Xiuzhen Malmberg, Russell L. Xu, Ying Cai, Liming Algorithms in Bioinformatics Article Computational alignment of a biopolymer sequence (e.g., an RNA or a protein) to a structure is an effective approach to predict and search for the structure of new sequences. To identify the structure of remote homologs, the structure-sequence alignment has to consider not only sequence similarity but also spatially conserved conformations caused by residue interactions, and consequently is computationally intractable. It is difficult to cope with the inefficiency without compromising alignment accuracy, especially for structure search in genomes or large databases. This paper introduces a novel method and a parameterized algorithm for structure-sequence alignment. Both the structure and the sequence are represented as graphs, where in general the graph for a biopolymer structure has a naturally small tree width. The algorithm constructs an optimal alignment by finding in the sequence graph the maximum valued subgraph isomorphic to the structure graph. It has the computational time complexity O(k (t) N (2)) for the structure of N residues and its tree decomposition of width t. The parameter k, small in nature, is determined by a statistical cutoff for the correspondence between the structure and the sequence. The paper demonstrates a successful application of the algorithm to developing a fast program for RNA structural homology search. 2005 /pmc/articles/PMC7121179/ http://dx.doi.org/10.1007/11557067_31 Text en © Springer-Verlag Berlin Heidelberg 2005 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Song, Yinglei
Liu, Chunmei
Huang, Xiuzhen
Malmberg, Russell L.
Xu, Ying
Cai, Liming
Efficient Parameterized Algorithm for Biopolymer Structure-Sequence Alignment
title Efficient Parameterized Algorithm for Biopolymer Structure-Sequence Alignment
title_full Efficient Parameterized Algorithm for Biopolymer Structure-Sequence Alignment
title_fullStr Efficient Parameterized Algorithm for Biopolymer Structure-Sequence Alignment
title_full_unstemmed Efficient Parameterized Algorithm for Biopolymer Structure-Sequence Alignment
title_short Efficient Parameterized Algorithm for Biopolymer Structure-Sequence Alignment
title_sort efficient parameterized algorithm for biopolymer structure-sequence alignment
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7121179/
http://dx.doi.org/10.1007/11557067_31
work_keys_str_mv AT songyinglei efficientparameterizedalgorithmforbiopolymerstructuresequencealignment
AT liuchunmei efficientparameterizedalgorithmforbiopolymerstructuresequencealignment
AT huangxiuzhen efficientparameterizedalgorithmforbiopolymerstructuresequencealignment
AT malmbergrusselll efficientparameterizedalgorithmforbiopolymerstructuresequencealignment
AT xuying efficientparameterizedalgorithmforbiopolymerstructuresequencealignment
AT cailiming efficientparameterizedalgorithmforbiopolymerstructuresequencealignment