Cargando…

Maximum expected accuracy structural neighbors of an RNA secondary structure

BACKGROUND: Since RNA molecules regulate genes and control alternative splicing by allostery, it is important to develop algorithms to predict RNA conformational switches. Some tools, such as paRNAss, RNAshapes and RNAbor, can be used to predict potential conformational switches; nevertheless, no ex...

Descripción completa

Detalles Bibliográficos
Autores principales: Clote, Peter, Lou, Feng, Lorenz, William A
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3358666/
https://www.ncbi.nlm.nih.gov/pubmed/22537010
http://dx.doi.org/10.1186/1471-2105-13-S5-S6
_version_ 1782233796926504960
author Clote, Peter
Lou, Feng
Lorenz, William A
author_facet Clote, Peter
Lou, Feng
Lorenz, William A
author_sort Clote, Peter
collection PubMed
description BACKGROUND: Since RNA molecules regulate genes and control alternative splicing by allostery, it is important to develop algorithms to predict RNA conformational switches. Some tools, such as paRNAss, RNAshapes and RNAbor, can be used to predict potential conformational switches; nevertheless, no existent tool can detect general (i.e., not family specific) entire riboswitches (both aptamer and expression platform) with accuracy. Thus, the development of additional algorithms to detect conformational switches seems important, especially since the difference in free energy between the two metastable secondary structures may be as large as 15-20 kcal/mol. It has recently emerged that RNA secondary structure can be more accurately predicted by computing the maximum expected accuracy (MEA) structure, rather than the minimum free energy (MFE) structure. RESULTS: Given an arbitrary RNA secondary structure S(0 )for an RNA nucleotide sequence a = a(1),..., a(n), we say that another secondary structure S of a is a k-neighbor of S(0), if the base pair distance between S(0 )and S is k. In this paper, we prove that the Boltzmann probability of all k-neighbors of the minimum free energy structure S(0 )can be approximated with accuracy ε and confidence 1 - p, simultaneously for all 0 ≤ k < K, by a relative frequency count over N sampled structures, provided that [Formula: see text] , where Φ(z) is the cumulative distribution function (CDF) for the standard normal distribution. We go on to describe the algorithm RNAborMEA, which for an arbitrary initial structure S(0 )and for all values 0 ≤ k < K, computes the secondary structure MEA(k), having maximum expected accuracy over all k-neighbors of S(0). Computation time is O(n(3 )· K(2)), and memory requirements are O(n(2 )· K). We analyze a sample TPP riboswitch, and apply our algorithm to the class of purine riboswitches. CONCLUSIONS: The approximation of RNAbor by sampling, with rigorous bound on accuracy, together with the computation of maximum expected accuracy k-neighbors by RNAborMEA, provide additional tools toward conformational switch detection. Results from RNAborMEA are quite distinct from other tools, such as RNAbor, RNAshapes and paRNAss, hence may provide orthogonal information when looking for suboptimal structures or conformational switches. Source code for RNAborMEA can be downloaded from http://sourceforge.net/projects/rnabormea/ or http://bioinformatics.bc.edu/clotelab/RNAborMEA/.
format Online
Article
Text
id pubmed-3358666
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-33586662012-05-31 Maximum expected accuracy structural neighbors of an RNA secondary structure Clote, Peter Lou, Feng Lorenz, William A BMC Bioinformatics Research BACKGROUND: Since RNA molecules regulate genes and control alternative splicing by allostery, it is important to develop algorithms to predict RNA conformational switches. Some tools, such as paRNAss, RNAshapes and RNAbor, can be used to predict potential conformational switches; nevertheless, no existent tool can detect general (i.e., not family specific) entire riboswitches (both aptamer and expression platform) with accuracy. Thus, the development of additional algorithms to detect conformational switches seems important, especially since the difference in free energy between the two metastable secondary structures may be as large as 15-20 kcal/mol. It has recently emerged that RNA secondary structure can be more accurately predicted by computing the maximum expected accuracy (MEA) structure, rather than the minimum free energy (MFE) structure. RESULTS: Given an arbitrary RNA secondary structure S(0 )for an RNA nucleotide sequence a = a(1),..., a(n), we say that another secondary structure S of a is a k-neighbor of S(0), if the base pair distance between S(0 )and S is k. In this paper, we prove that the Boltzmann probability of all k-neighbors of the minimum free energy structure S(0 )can be approximated with accuracy ε and confidence 1 - p, simultaneously for all 0 ≤ k < K, by a relative frequency count over N sampled structures, provided that [Formula: see text] , where Φ(z) is the cumulative distribution function (CDF) for the standard normal distribution. We go on to describe the algorithm RNAborMEA, which for an arbitrary initial structure S(0 )and for all values 0 ≤ k < K, computes the secondary structure MEA(k), having maximum expected accuracy over all k-neighbors of S(0). Computation time is O(n(3 )· K(2)), and memory requirements are O(n(2 )· K). We analyze a sample TPP riboswitch, and apply our algorithm to the class of purine riboswitches. CONCLUSIONS: The approximation of RNAbor by sampling, with rigorous bound on accuracy, together with the computation of maximum expected accuracy k-neighbors by RNAborMEA, provide additional tools toward conformational switch detection. Results from RNAborMEA are quite distinct from other tools, such as RNAbor, RNAshapes and paRNAss, hence may provide orthogonal information when looking for suboptimal structures or conformational switches. Source code for RNAborMEA can be downloaded from http://sourceforge.net/projects/rnabormea/ or http://bioinformatics.bc.edu/clotelab/RNAborMEA/. BioMed Central 2012-04-12 /pmc/articles/PMC3358666/ /pubmed/22537010 http://dx.doi.org/10.1186/1471-2105-13-S5-S6 Text en Copyright ©2012 Clote et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Clote, Peter
Lou, Feng
Lorenz, William A
Maximum expected accuracy structural neighbors of an RNA secondary structure
title Maximum expected accuracy structural neighbors of an RNA secondary structure
title_full Maximum expected accuracy structural neighbors of an RNA secondary structure
title_fullStr Maximum expected accuracy structural neighbors of an RNA secondary structure
title_full_unstemmed Maximum expected accuracy structural neighbors of an RNA secondary structure
title_short Maximum expected accuracy structural neighbors of an RNA secondary structure
title_sort maximum expected accuracy structural neighbors of an rna secondary structure
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3358666/
https://www.ncbi.nlm.nih.gov/pubmed/22537010
http://dx.doi.org/10.1186/1471-2105-13-S5-S6
work_keys_str_mv AT clotepeter maximumexpectedaccuracystructuralneighborsofanrnasecondarystructure
AT loufeng maximumexpectedaccuracystructuralneighborsofanrnasecondarystructure
AT lorenzwilliama maximumexpectedaccuracystructuralneighborsofanrnasecondarystructure