Cargando…
Modeling RNA secondary structure folding ensembles using SHAPE mapping data
RNA secondary structure prediction is widely used for developing hypotheses about the structures of RNA sequences, and structure can provide insight about RNA function. The accuracy of structure prediction is known to be improved using experimental mapping data that provide information about the pai...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5758915/ https://www.ncbi.nlm.nih.gov/pubmed/29177466 http://dx.doi.org/10.1093/nar/gkx1057 |
_version_ | 1783291091194216448 |
---|---|
author | Spasic, Aleksandar Assmann, Sarah M Bevilacqua, Philip C Mathews, David H |
author_facet | Spasic, Aleksandar Assmann, Sarah M Bevilacqua, Philip C Mathews, David H |
author_sort | Spasic, Aleksandar |
collection | PubMed |
description | RNA secondary structure prediction is widely used for developing hypotheses about the structures of RNA sequences, and structure can provide insight about RNA function. The accuracy of structure prediction is known to be improved using experimental mapping data that provide information about the pairing status of single nucleotides, and these data can now be acquired for whole transcriptomes using high-throughput sequencing. Prior methods for using these experimental data focused on predicting structures for sequences assuming that they populate a single structure. Most RNAs populate multiple structures, however, where the ensemble of strands populates structures with different sets of canonical base pairs. The focus on modeling single structures has been a bottleneck for accurately modeling RNA structure. In this work, we introduce Rsample, an algorithm for using experimental data to predict more than one RNA structure for sequences that populate multiple structures at equilibrium. We demonstrate, using SHAPE mapping data, that we can accurately model RNA sequences that populate multiple structures, including the relative probabilities of those structures. This program is freely available as part of the RNAstructure software package. |
format | Online Article Text |
id | pubmed-5758915 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-57589152018-01-16 Modeling RNA secondary structure folding ensembles using SHAPE mapping data Spasic, Aleksandar Assmann, Sarah M Bevilacqua, Philip C Mathews, David H Nucleic Acids Res RNA and RNA-protein complexes RNA secondary structure prediction is widely used for developing hypotheses about the structures of RNA sequences, and structure can provide insight about RNA function. The accuracy of structure prediction is known to be improved using experimental mapping data that provide information about the pairing status of single nucleotides, and these data can now be acquired for whole transcriptomes using high-throughput sequencing. Prior methods for using these experimental data focused on predicting structures for sequences assuming that they populate a single structure. Most RNAs populate multiple structures, however, where the ensemble of strands populates structures with different sets of canonical base pairs. The focus on modeling single structures has been a bottleneck for accurately modeling RNA structure. In this work, we introduce Rsample, an algorithm for using experimental data to predict more than one RNA structure for sequences that populate multiple structures at equilibrium. We demonstrate, using SHAPE mapping data, that we can accurately model RNA sequences that populate multiple structures, including the relative probabilities of those structures. This program is freely available as part of the RNAstructure software package. Oxford University Press 2018-01-09 2017-11-21 /pmc/articles/PMC5758915/ /pubmed/29177466 http://dx.doi.org/10.1093/nar/gkx1057 Text en © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | RNA and RNA-protein complexes Spasic, Aleksandar Assmann, Sarah M Bevilacqua, Philip C Mathews, David H Modeling RNA secondary structure folding ensembles using SHAPE mapping data |
title | Modeling RNA secondary structure folding ensembles using SHAPE mapping data |
title_full | Modeling RNA secondary structure folding ensembles using SHAPE mapping data |
title_fullStr | Modeling RNA secondary structure folding ensembles using SHAPE mapping data |
title_full_unstemmed | Modeling RNA secondary structure folding ensembles using SHAPE mapping data |
title_short | Modeling RNA secondary structure folding ensembles using SHAPE mapping data |
title_sort | modeling rna secondary structure folding ensembles using shape mapping data |
topic | RNA and RNA-protein complexes |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5758915/ https://www.ncbi.nlm.nih.gov/pubmed/29177466 http://dx.doi.org/10.1093/nar/gkx1057 |
work_keys_str_mv | AT spasicaleksandar modelingrnasecondarystructurefoldingensemblesusingshapemappingdata AT assmannsarahm modelingrnasecondarystructurefoldingensemblesusingshapemappingdata AT bevilacquaphilipc modelingrnasecondarystructurefoldingensemblesusingshapemappingdata AT mathewsdavidh modelingrnasecondarystructurefoldingensemblesusingshapemappingdata |