Cargando…
RNA Secondary Structures with Limited Base Pair Span: Exact Backtracking and an Application
The accuracy of RNA secondary structure prediction decreases with the span of a base pair, i.e., the number of nucleotides that it encloses. The dynamic programming algorithms for RNA folding can be easily specialized in order to consider only base pairs with a limited span L, reducing the memory re...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7823788/ https://www.ncbi.nlm.nih.gov/pubmed/33374382 http://dx.doi.org/10.3390/genes12010014 |
_version_ | 1783639919527198720 |
---|---|
author | Lorenz, Ronny Stadler, Peter F. |
author_facet | Lorenz, Ronny Stadler, Peter F. |
author_sort | Lorenz, Ronny |
collection | PubMed |
description | The accuracy of RNA secondary structure prediction decreases with the span of a base pair, i.e., the number of nucleotides that it encloses. The dynamic programming algorithms for RNA folding can be easily specialized in order to consider only base pairs with a limited span L, reducing the memory requirements to [Formula: see text] , and further to [Formula: see text] by interleaving backtracking. However, the latter is an approximation that precludes the retrieval of the globally optimal structure. So far, the ViennaRNA package therefore does not provide a tool for computing optimal, span-restricted minimum energy structure. Here, we report on an efficient backtracking algorithm that reconstructs the globally optimal structure from the locally optimal fragments that are produced by the interleaved backtracking implemented in RNALfold. An implementation is integrated into the ViennaRNA package. The forward and the backtracking recursions of RNALfold are both easily constrained to structural components with a sufficiently negative z-scores. This provides a convenient method in order to identify hyper-stable structural elements. A screen of the C. elegans genome shows that such features are more abundant in real genomic sequences when compared to a di-nucleotide shuffled background model. |
format | Online Article Text |
id | pubmed-7823788 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-78237882021-01-24 RNA Secondary Structures with Limited Base Pair Span: Exact Backtracking and an Application Lorenz, Ronny Stadler, Peter F. Genes (Basel) Article The accuracy of RNA secondary structure prediction decreases with the span of a base pair, i.e., the number of nucleotides that it encloses. The dynamic programming algorithms for RNA folding can be easily specialized in order to consider only base pairs with a limited span L, reducing the memory requirements to [Formula: see text] , and further to [Formula: see text] by interleaving backtracking. However, the latter is an approximation that precludes the retrieval of the globally optimal structure. So far, the ViennaRNA package therefore does not provide a tool for computing optimal, span-restricted minimum energy structure. Here, we report on an efficient backtracking algorithm that reconstructs the globally optimal structure from the locally optimal fragments that are produced by the interleaved backtracking implemented in RNALfold. An implementation is integrated into the ViennaRNA package. The forward and the backtracking recursions of RNALfold are both easily constrained to structural components with a sufficiently negative z-scores. This provides a convenient method in order to identify hyper-stable structural elements. A screen of the C. elegans genome shows that such features are more abundant in real genomic sequences when compared to a di-nucleotide shuffled background model. MDPI 2020-12-24 /pmc/articles/PMC7823788/ /pubmed/33374382 http://dx.doi.org/10.3390/genes12010014 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Lorenz, Ronny Stadler, Peter F. RNA Secondary Structures with Limited Base Pair Span: Exact Backtracking and an Application |
title | RNA Secondary Structures with Limited Base Pair Span: Exact Backtracking and an Application |
title_full | RNA Secondary Structures with Limited Base Pair Span: Exact Backtracking and an Application |
title_fullStr | RNA Secondary Structures with Limited Base Pair Span: Exact Backtracking and an Application |
title_full_unstemmed | RNA Secondary Structures with Limited Base Pair Span: Exact Backtracking and an Application |
title_short | RNA Secondary Structures with Limited Base Pair Span: Exact Backtracking and an Application |
title_sort | rna secondary structures with limited base pair span: exact backtracking and an application |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7823788/ https://www.ncbi.nlm.nih.gov/pubmed/33374382 http://dx.doi.org/10.3390/genes12010014 |
work_keys_str_mv | AT lorenzronny rnasecondarystructureswithlimitedbasepairspanexactbacktrackingandanapplication AT stadlerpeterf rnasecondarystructureswithlimitedbasepairspanexactbacktrackingandanapplication |