Cargando…

A simple, practical and complete O[Image: see text]-time Algorithm for RNA folding using the Four-Russians Speedup

BACKGROUND: The problem of computationally predicting the secondary structure (or folding) of RNA molecules was first introduced more than thirty years ago and yet continues to be an area of active research and development. The basic RNA-folding problem of finding a maximum cardinality, non-crossing...

Descripción completa

Detalles Bibliográficos
Autores principales: Frid, Yelena, Gusfield, Dan
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2823755/
https://www.ncbi.nlm.nih.gov/pubmed/20047670
http://dx.doi.org/10.1186/1748-7188-5-13
_version_ 1782177679991111680
author Frid, Yelena
Gusfield, Dan
author_facet Frid, Yelena
Gusfield, Dan
author_sort Frid, Yelena
collection PubMed
description BACKGROUND: The problem of computationally predicting the secondary structure (or folding) of RNA molecules was first introduced more than thirty years ago and yet continues to be an area of active research and development. The basic RNA-folding problem of finding a maximum cardinality, non-crossing, matching of complimentary nucleotides in an RNA sequence of length n, has an O(n(3))-time dynamic programming solution that is widely applied. It is known that an o(n(3)) worst-case time solution is possible, but the published and suggested methods are complex and have not been established to be practical. Significant practical improvements to the original dynamic programming method have been introduced, but they retain the O(n(3)) worst-case time bound when n is the only problem-parameter used in the bound. Surprisingly, the most widely-used, general technique to achieve a worst-case (and often practical) speed up of dynamic programming, the Four-Russians technique, has not been previously applied to the RNA-folding problem. This is perhaps due to technical issues in adapting the technique to RNA-folding. RESULTS: In this paper, we give a simple, complete, and practical Four-Russians algorithm for the basic RNA-folding problem, achieving a worst-case time-bound of O(n(3)/log(n)). CONCLUSIONS: We show that this time-bound can also be obtained for richer nucleotide matching scoring-schemes, and that the method achieves consistent speed-ups in practice. The contribution is both theoretical and practical, since the basic RNA-folding problem is often solved multiple times in the inner-loop of more complex algorithms, and for long RNA molecules in the study of RNA virus genomes.
format Text
id pubmed-2823755
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28237552010-02-18 A simple, practical and complete O[Image: see text]-time Algorithm for RNA folding using the Four-Russians Speedup Frid, Yelena Gusfield, Dan Algorithms Mol Biol Research BACKGROUND: The problem of computationally predicting the secondary structure (or folding) of RNA molecules was first introduced more than thirty years ago and yet continues to be an area of active research and development. The basic RNA-folding problem of finding a maximum cardinality, non-crossing, matching of complimentary nucleotides in an RNA sequence of length n, has an O(n(3))-time dynamic programming solution that is widely applied. It is known that an o(n(3)) worst-case time solution is possible, but the published and suggested methods are complex and have not been established to be practical. Significant practical improvements to the original dynamic programming method have been introduced, but they retain the O(n(3)) worst-case time bound when n is the only problem-parameter used in the bound. Surprisingly, the most widely-used, general technique to achieve a worst-case (and often practical) speed up of dynamic programming, the Four-Russians technique, has not been previously applied to the RNA-folding problem. This is perhaps due to technical issues in adapting the technique to RNA-folding. RESULTS: In this paper, we give a simple, complete, and practical Four-Russians algorithm for the basic RNA-folding problem, achieving a worst-case time-bound of O(n(3)/log(n)). CONCLUSIONS: We show that this time-bound can also be obtained for richer nucleotide matching scoring-schemes, and that the method achieves consistent speed-ups in practice. The contribution is both theoretical and practical, since the basic RNA-folding problem is often solved multiple times in the inner-loop of more complex algorithms, and for long RNA molecules in the study of RNA virus genomes. BioMed Central 2010-01-04 /pmc/articles/PMC2823755/ /pubmed/20047670 http://dx.doi.org/10.1186/1748-7188-5-13 Text en Copyright ©2010 Frid and Gusfield; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Frid, Yelena
Gusfield, Dan
A simple, practical and complete O[Image: see text]-time Algorithm for RNA folding using the Four-Russians Speedup
title A simple, practical and complete O[Image: see text]-time Algorithm for RNA folding using the Four-Russians Speedup
title_full A simple, practical and complete O[Image: see text]-time Algorithm for RNA folding using the Four-Russians Speedup
title_fullStr A simple, practical and complete O[Image: see text]-time Algorithm for RNA folding using the Four-Russians Speedup
title_full_unstemmed A simple, practical and complete O[Image: see text]-time Algorithm for RNA folding using the Four-Russians Speedup
title_short A simple, practical and complete O[Image: see text]-time Algorithm for RNA folding using the Four-Russians Speedup
title_sort simple, practical and complete o[image: see text]-time algorithm for rna folding using the four-russians speedup
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2823755/
https://www.ncbi.nlm.nih.gov/pubmed/20047670
http://dx.doi.org/10.1186/1748-7188-5-13
work_keys_str_mv AT fridyelena asimplepracticalandcompleteoimageseetexttimealgorithmforrnafoldingusingthefourrussiansspeedup
AT gusfielddan asimplepracticalandcompleteoimageseetexttimealgorithmforrnafoldingusingthefourrussiansspeedup
AT fridyelena simplepracticalandcompleteoimageseetexttimealgorithmforrnafoldingusingthefourrussiansspeedup
AT gusfielddan simplepracticalandcompleteoimageseetexttimealgorithmforrnafoldingusingthefourrussiansspeedup