Cargando…

Faster computation of exact RNA shape probabilities

Motivation: Abstract shape analysis allows efficient computation of a representative sample of low-energy foldings of an RNA molecule. More comprehensive information is obtained by computing shape probabilities, accumulating the Boltzmann probabilities of all structures within each abstract shape. S...

Descripción completa

Detalles Bibliográficos
Autores principales: Janssen, Stefan, Giegerich, Robert
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2828121/
https://www.ncbi.nlm.nih.gov/pubmed/20080511
http://dx.doi.org/10.1093/bioinformatics/btq014
_version_ 1782177994764189696
author Janssen, Stefan
Giegerich, Robert
author_facet Janssen, Stefan
Giegerich, Robert
author_sort Janssen, Stefan
collection PubMed
description Motivation: Abstract shape analysis allows efficient computation of a representative sample of low-energy foldings of an RNA molecule. More comprehensive information is obtained by computing shape probabilities, accumulating the Boltzmann probabilities of all structures within each abstract shape. Such information is superior to free energies because it is independent of sequence length and base composition. However, up to this point, computation of shape probabilities evaluates all shapes simultaneously and comes with a computation cost which is exponential in the length of the sequence. Results: We device an approach called RapidShapes that computes the shapes above a specified probability threshold T by generating a list of promising shapes and constructing specialized folding programs for each shape to compute its share of Boltzmann probability. This aims at a heuristic improvement of runtime, while still computing exact probability values. Conclusion: Evaluating this approach and several substrategies, we find that only a small proportion of shapes have to be actually computed. For an RNA sequence of length 400, this leads, depending on the threshold, to a 10–138 fold speed-up compared with the previous complete method. Thus, probabilistic shape analysis has become feasible in medium-scale applications, such as the screening of RNA transcripts in a bacterial genome. Availability: RapidShapes is available via http://bibiserv.cebitec.uni-bielefeld.de/rnashapes Contact: robert@techfak.uni-bielefeld.de Supplementary information: Supplementary data are available at Bioinformatics online.
format Text
id pubmed-2828121
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-28281212010-02-25 Faster computation of exact RNA shape probabilities Janssen, Stefan Giegerich, Robert Bioinformatics Original Papers Motivation: Abstract shape analysis allows efficient computation of a representative sample of low-energy foldings of an RNA molecule. More comprehensive information is obtained by computing shape probabilities, accumulating the Boltzmann probabilities of all structures within each abstract shape. Such information is superior to free energies because it is independent of sequence length and base composition. However, up to this point, computation of shape probabilities evaluates all shapes simultaneously and comes with a computation cost which is exponential in the length of the sequence. Results: We device an approach called RapidShapes that computes the shapes above a specified probability threshold T by generating a list of promising shapes and constructing specialized folding programs for each shape to compute its share of Boltzmann probability. This aims at a heuristic improvement of runtime, while still computing exact probability values. Conclusion: Evaluating this approach and several substrategies, we find that only a small proportion of shapes have to be actually computed. For an RNA sequence of length 400, this leads, depending on the threshold, to a 10–138 fold speed-up compared with the previous complete method. Thus, probabilistic shape analysis has become feasible in medium-scale applications, such as the screening of RNA transcripts in a bacterial genome. Availability: RapidShapes is available via http://bibiserv.cebitec.uni-bielefeld.de/rnashapes Contact: robert@techfak.uni-bielefeld.de Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2010-03-01 2010-01-14 /pmc/articles/PMC2828121/ /pubmed/20080511 http://dx.doi.org/10.1093/bioinformatics/btq014 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Janssen, Stefan
Giegerich, Robert
Faster computation of exact RNA shape probabilities
title Faster computation of exact RNA shape probabilities
title_full Faster computation of exact RNA shape probabilities
title_fullStr Faster computation of exact RNA shape probabilities
title_full_unstemmed Faster computation of exact RNA shape probabilities
title_short Faster computation of exact RNA shape probabilities
title_sort faster computation of exact rna shape probabilities
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2828121/
https://www.ncbi.nlm.nih.gov/pubmed/20080511
http://dx.doi.org/10.1093/bioinformatics/btq014
work_keys_str_mv AT janssenstefan fastercomputationofexactrnashapeprobabilities
AT giegerichrobert fastercomputationofexactrnashapeprobabilities