Cargando…

RNA Secondary Structures with Limited Base Pair Span: Exact Backtracking and an Application

The accuracy of RNA secondary structure prediction decreases with the span of a base pair, i.e., the number of nucleotides that it encloses. The dynamic programming algorithms for RNA folding can be easily specialized in order to consider only base pairs with a limited span L, reducing the memory re...

Descripción completa

Detalles Bibliográficos
Autores principales: Lorenz, Ronny, Stadler, Peter F.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7823788/
https://www.ncbi.nlm.nih.gov/pubmed/33374382
http://dx.doi.org/10.3390/genes12010014
_version_ 1783639919527198720
author Lorenz, Ronny
Stadler, Peter F.
author_facet Lorenz, Ronny
Stadler, Peter F.
author_sort Lorenz, Ronny
collection PubMed
description The accuracy of RNA secondary structure prediction decreases with the span of a base pair, i.e., the number of nucleotides that it encloses. The dynamic programming algorithms for RNA folding can be easily specialized in order to consider only base pairs with a limited span L, reducing the memory requirements to [Formula: see text] , and further to [Formula: see text] by interleaving backtracking. However, the latter is an approximation that precludes the retrieval of the globally optimal structure. So far, the ViennaRNA package therefore does not provide a tool for computing optimal, span-restricted minimum energy structure. Here, we report on an efficient backtracking algorithm that reconstructs the globally optimal structure from the locally optimal fragments that are produced by the interleaved backtracking implemented in RNALfold. An implementation is integrated into the ViennaRNA package. The forward and the backtracking recursions of RNALfold are both easily constrained to structural components with a sufficiently negative z-scores. This provides a convenient method in order to identify hyper-stable structural elements. A screen of the C. elegans genome shows that such features are more abundant in real genomic sequences when compared to a di-nucleotide shuffled background model.
format Online
Article
Text
id pubmed-7823788
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-78237882021-01-24 RNA Secondary Structures with Limited Base Pair Span: Exact Backtracking and an Application Lorenz, Ronny Stadler, Peter F. Genes (Basel) Article The accuracy of RNA secondary structure prediction decreases with the span of a base pair, i.e., the number of nucleotides that it encloses. The dynamic programming algorithms for RNA folding can be easily specialized in order to consider only base pairs with a limited span L, reducing the memory requirements to [Formula: see text] , and further to [Formula: see text] by interleaving backtracking. However, the latter is an approximation that precludes the retrieval of the globally optimal structure. So far, the ViennaRNA package therefore does not provide a tool for computing optimal, span-restricted minimum energy structure. Here, we report on an efficient backtracking algorithm that reconstructs the globally optimal structure from the locally optimal fragments that are produced by the interleaved backtracking implemented in RNALfold. An implementation is integrated into the ViennaRNA package. The forward and the backtracking recursions of RNALfold are both easily constrained to structural components with a sufficiently negative z-scores. This provides a convenient method in order to identify hyper-stable structural elements. A screen of the C. elegans genome shows that such features are more abundant in real genomic sequences when compared to a di-nucleotide shuffled background model. MDPI 2020-12-24 /pmc/articles/PMC7823788/ /pubmed/33374382 http://dx.doi.org/10.3390/genes12010014 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Lorenz, Ronny
Stadler, Peter F.
RNA Secondary Structures with Limited Base Pair Span: Exact Backtracking and an Application
title RNA Secondary Structures with Limited Base Pair Span: Exact Backtracking and an Application
title_full RNA Secondary Structures with Limited Base Pair Span: Exact Backtracking and an Application
title_fullStr RNA Secondary Structures with Limited Base Pair Span: Exact Backtracking and an Application
title_full_unstemmed RNA Secondary Structures with Limited Base Pair Span: Exact Backtracking and an Application
title_short RNA Secondary Structures with Limited Base Pair Span: Exact Backtracking and an Application
title_sort rna secondary structures with limited base pair span: exact backtracking and an application
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7823788/
https://www.ncbi.nlm.nih.gov/pubmed/33374382
http://dx.doi.org/10.3390/genes12010014
work_keys_str_mv AT lorenzronny rnasecondarystructureswithlimitedbasepairspanexactbacktrackingandanapplication
AT stadlerpeterf rnasecondarystructureswithlimitedbasepairspanexactbacktrackingandanapplication