Cargando…

A memory-efficient dynamic programming algorithm for optimal alignment of a sequence to an RNA secondary structure

BACKGROUND: Covariance models (CMs) are probabilistic models of RNA secondary structure, analogous to profile hidden Markov models of linear sequence. The dynamic programming algorithm for aligning a CM to an RNA sequence of length N is O(N(3)) in memory. This is only practical for small RNAs. RESUL...

Descripción completa

Detalles Bibliográficos
Autor principal: Eddy, Sean R
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2002
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC119854/
https://www.ncbi.nlm.nih.gov/pubmed/12095421
http://dx.doi.org/10.1186/1471-2105-3-18
_version_ 1782120296983035904
author Eddy, Sean R
author_facet Eddy, Sean R
author_sort Eddy, Sean R
collection PubMed
description BACKGROUND: Covariance models (CMs) are probabilistic models of RNA secondary structure, analogous to profile hidden Markov models of linear sequence. The dynamic programming algorithm for aligning a CM to an RNA sequence of length N is O(N(3)) in memory. This is only practical for small RNAs. RESULTS: I describe a divide and conquer variant of the alignment algorithm that is analogous to memory-efficient Myers/Miller dynamic programming algorithms for linear sequence alignment. The new algorithm has an O(N(2) log N) memory complexity, at the expense of a small constant factor in time. CONCLUSIONS: Optimal ribosomal RNA structural alignments that previously required up to 150 GB of memory now require less than 270 MB.
format Text
id pubmed-119854
institution National Center for Biotechnology Information
language English
publishDate 2002
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-1198542002-09-04 A memory-efficient dynamic programming algorithm for optimal alignment of a sequence to an RNA secondary structure Eddy, Sean R BMC Bioinformatics Research article BACKGROUND: Covariance models (CMs) are probabilistic models of RNA secondary structure, analogous to profile hidden Markov models of linear sequence. The dynamic programming algorithm for aligning a CM to an RNA sequence of length N is O(N(3)) in memory. This is only practical for small RNAs. RESULTS: I describe a divide and conquer variant of the alignment algorithm that is analogous to memory-efficient Myers/Miller dynamic programming algorithms for linear sequence alignment. The new algorithm has an O(N(2) log N) memory complexity, at the expense of a small constant factor in time. CONCLUSIONS: Optimal ribosomal RNA structural alignments that previously required up to 150 GB of memory now require less than 270 MB. BioMed Central 2002-07-02 /pmc/articles/PMC119854/ /pubmed/12095421 http://dx.doi.org/10.1186/1471-2105-3-18 Text en Copyright ©2002 Eddy; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.
spellingShingle Research article
Eddy, Sean R
A memory-efficient dynamic programming algorithm for optimal alignment of a sequence to an RNA secondary structure
title A memory-efficient dynamic programming algorithm for optimal alignment of a sequence to an RNA secondary structure
title_full A memory-efficient dynamic programming algorithm for optimal alignment of a sequence to an RNA secondary structure
title_fullStr A memory-efficient dynamic programming algorithm for optimal alignment of a sequence to an RNA secondary structure
title_full_unstemmed A memory-efficient dynamic programming algorithm for optimal alignment of a sequence to an RNA secondary structure
title_short A memory-efficient dynamic programming algorithm for optimal alignment of a sequence to an RNA secondary structure
title_sort memory-efficient dynamic programming algorithm for optimal alignment of a sequence to an rna secondary structure
topic Research article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC119854/
https://www.ncbi.nlm.nih.gov/pubmed/12095421
http://dx.doi.org/10.1186/1471-2105-3-18
work_keys_str_mv AT eddyseanr amemoryefficientdynamicprogrammingalgorithmforoptimalalignmentofasequencetoanrnasecondarystructure
AT eddyseanr memoryefficientdynamicprogrammingalgorithmforoptimalalignmentofasequencetoanrnasecondarystructure