Cargando…

RNAG: a new Gibbs sampler for predicting RNA secondary structure for unaligned sequences

Motivation: RNA secondary structure plays an important role in the function of many RNAs, and structural features are often key to their interaction with other cellular components. Thus, there has been considerable interest in the prediction of secondary structures for RNA families. In this article,...

Descripción completa

Detalles Bibliográficos
Autores principales: Wei, Donglai, Alpert, Lauren V., Lawrence, Charles E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3167047/
https://www.ncbi.nlm.nih.gov/pubmed/21788211
http://dx.doi.org/10.1093/bioinformatics/btr421
_version_ 1782211217001021440
author Wei, Donglai
Alpert, Lauren V.
Lawrence, Charles E.
author_facet Wei, Donglai
Alpert, Lauren V.
Lawrence, Charles E.
author_sort Wei, Donglai
collection PubMed
description Motivation: RNA secondary structure plays an important role in the function of many RNAs, and structural features are often key to their interaction with other cellular components. Thus, there has been considerable interest in the prediction of secondary structures for RNA families. In this article, we present a new global structural alignment algorithm, RNAG, to predict consensus secondary structures for unaligned sequences. It uses a blocked Gibbs sampling algorithm, which has a theoretical advantage in convergence time. This algorithm iteratively samples from the conditional probability distributions P(Structure | Alignment) and P(Alignment | Structure). Not surprisingly, there is considerable uncertainly in the high-dimensional space of this difficult problem, which has so far received limited attention in this field. We show how the samples drawn from this algorithm can be used to more fully characterize the posterior space and to assess the uncertainty of predictions. Results: Our analysis of three publically available datasets showed a substantial improvement in RNA structure prediction by RNAG over extant prediction methods. Additionally, our analysis of 17 RNA families showed that the RNAG sampled structures were generally compact around their ensemble centroids, and at least 11 families had at least two well-separated clusters of predicted structures. In general, the distance between a reference structure and our predicted structure was large relative to the variation among structures within an ensemble. Availability: The Perl implementation of the RNAG algorithm and the data necessary to reproduce the results described in Sections 3.1 and 3.2 are available at http://ccmbweb.ccv.brown.edu/rnag.html Contact: charles_lawrence@brown.edu Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-3167047
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-31670472011-09-06 RNAG: a new Gibbs sampler for predicting RNA secondary structure for unaligned sequences Wei, Donglai Alpert, Lauren V. Lawrence, Charles E. Bioinformatics Original Papers Motivation: RNA secondary structure plays an important role in the function of many RNAs, and structural features are often key to their interaction with other cellular components. Thus, there has been considerable interest in the prediction of secondary structures for RNA families. In this article, we present a new global structural alignment algorithm, RNAG, to predict consensus secondary structures for unaligned sequences. It uses a blocked Gibbs sampling algorithm, which has a theoretical advantage in convergence time. This algorithm iteratively samples from the conditional probability distributions P(Structure | Alignment) and P(Alignment | Structure). Not surprisingly, there is considerable uncertainly in the high-dimensional space of this difficult problem, which has so far received limited attention in this field. We show how the samples drawn from this algorithm can be used to more fully characterize the posterior space and to assess the uncertainty of predictions. Results: Our analysis of three publically available datasets showed a substantial improvement in RNA structure prediction by RNAG over extant prediction methods. Additionally, our analysis of 17 RNA families showed that the RNAG sampled structures were generally compact around their ensemble centroids, and at least 11 families had at least two well-separated clusters of predicted structures. In general, the distance between a reference structure and our predicted structure was large relative to the variation among structures within an ensemble. Availability: The Perl implementation of the RNAG algorithm and the data necessary to reproduce the results described in Sections 3.1 and 3.2 are available at http://ccmbweb.ccv.brown.edu/rnag.html Contact: charles_lawrence@brown.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2011-09-15 2011-07-24 /pmc/articles/PMC3167047/ /pubmed/21788211 http://dx.doi.org/10.1093/bioinformatics/btr421 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Wei, Donglai
Alpert, Lauren V.
Lawrence, Charles E.
RNAG: a new Gibbs sampler for predicting RNA secondary structure for unaligned sequences
title RNAG: a new Gibbs sampler for predicting RNA secondary structure for unaligned sequences
title_full RNAG: a new Gibbs sampler for predicting RNA secondary structure for unaligned sequences
title_fullStr RNAG: a new Gibbs sampler for predicting RNA secondary structure for unaligned sequences
title_full_unstemmed RNAG: a new Gibbs sampler for predicting RNA secondary structure for unaligned sequences
title_short RNAG: a new Gibbs sampler for predicting RNA secondary structure for unaligned sequences
title_sort rnag: a new gibbs sampler for predicting rna secondary structure for unaligned sequences
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3167047/
https://www.ncbi.nlm.nih.gov/pubmed/21788211
http://dx.doi.org/10.1093/bioinformatics/btr421
work_keys_str_mv AT weidonglai rnaganewgibbssamplerforpredictingrnasecondarystructureforunalignedsequences
AT alpertlaurenv rnaganewgibbssamplerforpredictingrnasecondarystructureforunalignedsequences
AT lawrencecharlese rnaganewgibbssamplerforpredictingrnasecondarystructureforunalignedsequences