Cargando…
RNAG: a new Gibbs sampler for predicting RNA secondary structure for unaligned sequences
Motivation: RNA secondary structure plays an important role in the function of many RNAs, and structural features are often key to their interaction with other cellular components. Thus, there has been considerable interest in the prediction of secondary structures for RNA families. In this article,...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3167047/ https://www.ncbi.nlm.nih.gov/pubmed/21788211 http://dx.doi.org/10.1093/bioinformatics/btr421 |
_version_ | 1782211217001021440 |
---|---|
author | Wei, Donglai Alpert, Lauren V. Lawrence, Charles E. |
author_facet | Wei, Donglai Alpert, Lauren V. Lawrence, Charles E. |
author_sort | Wei, Donglai |
collection | PubMed |
description | Motivation: RNA secondary structure plays an important role in the function of many RNAs, and structural features are often key to their interaction with other cellular components. Thus, there has been considerable interest in the prediction of secondary structures for RNA families. In this article, we present a new global structural alignment algorithm, RNAG, to predict consensus secondary structures for unaligned sequences. It uses a blocked Gibbs sampling algorithm, which has a theoretical advantage in convergence time. This algorithm iteratively samples from the conditional probability distributions P(Structure | Alignment) and P(Alignment | Structure). Not surprisingly, there is considerable uncertainly in the high-dimensional space of this difficult problem, which has so far received limited attention in this field. We show how the samples drawn from this algorithm can be used to more fully characterize the posterior space and to assess the uncertainty of predictions. Results: Our analysis of three publically available datasets showed a substantial improvement in RNA structure prediction by RNAG over extant prediction methods. Additionally, our analysis of 17 RNA families showed that the RNAG sampled structures were generally compact around their ensemble centroids, and at least 11 families had at least two well-separated clusters of predicted structures. In general, the distance between a reference structure and our predicted structure was large relative to the variation among structures within an ensemble. Availability: The Perl implementation of the RNAG algorithm and the data necessary to reproduce the results described in Sections 3.1 and 3.2 are available at http://ccmbweb.ccv.brown.edu/rnag.html Contact: charles_lawrence@brown.edu Supplementary information: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-3167047 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-31670472011-09-06 RNAG: a new Gibbs sampler for predicting RNA secondary structure for unaligned sequences Wei, Donglai Alpert, Lauren V. Lawrence, Charles E. Bioinformatics Original Papers Motivation: RNA secondary structure plays an important role in the function of many RNAs, and structural features are often key to their interaction with other cellular components. Thus, there has been considerable interest in the prediction of secondary structures for RNA families. In this article, we present a new global structural alignment algorithm, RNAG, to predict consensus secondary structures for unaligned sequences. It uses a blocked Gibbs sampling algorithm, which has a theoretical advantage in convergence time. This algorithm iteratively samples from the conditional probability distributions P(Structure | Alignment) and P(Alignment | Structure). Not surprisingly, there is considerable uncertainly in the high-dimensional space of this difficult problem, which has so far received limited attention in this field. We show how the samples drawn from this algorithm can be used to more fully characterize the posterior space and to assess the uncertainty of predictions. Results: Our analysis of three publically available datasets showed a substantial improvement in RNA structure prediction by RNAG over extant prediction methods. Additionally, our analysis of 17 RNA families showed that the RNAG sampled structures were generally compact around their ensemble centroids, and at least 11 families had at least two well-separated clusters of predicted structures. In general, the distance between a reference structure and our predicted structure was large relative to the variation among structures within an ensemble. Availability: The Perl implementation of the RNAG algorithm and the data necessary to reproduce the results described in Sections 3.1 and 3.2 are available at http://ccmbweb.ccv.brown.edu/rnag.html Contact: charles_lawrence@brown.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2011-09-15 2011-07-24 /pmc/articles/PMC3167047/ /pubmed/21788211 http://dx.doi.org/10.1093/bioinformatics/btr421 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Papers Wei, Donglai Alpert, Lauren V. Lawrence, Charles E. RNAG: a new Gibbs sampler for predicting RNA secondary structure for unaligned sequences |
title | RNAG: a new Gibbs sampler for predicting RNA secondary structure for unaligned sequences |
title_full | RNAG: a new Gibbs sampler for predicting RNA secondary structure for unaligned sequences |
title_fullStr | RNAG: a new Gibbs sampler for predicting RNA secondary structure for unaligned sequences |
title_full_unstemmed | RNAG: a new Gibbs sampler for predicting RNA secondary structure for unaligned sequences |
title_short | RNAG: a new Gibbs sampler for predicting RNA secondary structure for unaligned sequences |
title_sort | rnag: a new gibbs sampler for predicting rna secondary structure for unaligned sequences |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3167047/ https://www.ncbi.nlm.nih.gov/pubmed/21788211 http://dx.doi.org/10.1093/bioinformatics/btr421 |
work_keys_str_mv | AT weidonglai rnaganewgibbssamplerforpredictingrnasecondarystructureforunalignedsequences AT alpertlaurenv rnaganewgibbssamplerforpredictingrnasecondarystructureforunalignedsequences AT lawrencecharlese rnaganewgibbssamplerforpredictingrnasecondarystructureforunalignedsequences |