Cargando…
SeqFold: Genome-scale reconstruction of RNA secondary structure integrating high-throughput sequencing data
We present an integrative approach, SeqFold, that combines high-throughput RNA structure profiling data with computational prediction for genome-scale reconstruction of RNA secondary structures. SeqFold transforms experimental RNA structure information into a structure preference profile (SPP) and u...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory Press
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3561878/ https://www.ncbi.nlm.nih.gov/pubmed/23064747 http://dx.doi.org/10.1101/gr.138545.112 |
_version_ | 1782258013071998976 |
---|---|
author | Ouyang, Zhengqing Snyder, Michael P. Chang, Howard Y. |
author_facet | Ouyang, Zhengqing Snyder, Michael P. Chang, Howard Y. |
author_sort | Ouyang, Zhengqing |
collection | PubMed |
description | We present an integrative approach, SeqFold, that combines high-throughput RNA structure profiling data with computational prediction for genome-scale reconstruction of RNA secondary structures. SeqFold transforms experimental RNA structure information into a structure preference profile (SPP) and uses it to select stable RNA structure candidates representing the structure ensemble. Under a high-dimensional classification framework, SeqFold efficiently matches a given SPP to the most likely cluster of structures sampled from the Boltzmann-weighted ensemble. SeqFold is able to incorporate diverse types of RNA structure profiling data, including parallel analysis of RNA structure (PARS), selective 2′-hydroxyl acylation analyzed by primer extension sequencing (SHAPE-Seq), fragmentation sequencing (FragSeq) data generated by deep sequencing, and conventional SHAPE data. Using the known structures of a wide range of mRNAs and noncoding RNAs as benchmarks, we demonstrate that SeqFold outperforms or matches existing approaches in accuracy and is more robust to noise in experimental data. Application of SeqFold to reconstruct the secondary structures of the yeast transcriptome reveals the diverse impact of RNA secondary structure on gene regulation, including translation efficiency, transcription initiation, and protein-RNA interactions. SeqFold can be easily adapted to incorporate any new types of high-throughput RNA structure profiling data and is widely applicable to analyze RNA structures in any transcriptome. |
format | Online Article Text |
id | pubmed-3561878 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Cold Spring Harbor Laboratory Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-35618782013-04-03 SeqFold: Genome-scale reconstruction of RNA secondary structure integrating high-throughput sequencing data Ouyang, Zhengqing Snyder, Michael P. Chang, Howard Y. Genome Res Method We present an integrative approach, SeqFold, that combines high-throughput RNA structure profiling data with computational prediction for genome-scale reconstruction of RNA secondary structures. SeqFold transforms experimental RNA structure information into a structure preference profile (SPP) and uses it to select stable RNA structure candidates representing the structure ensemble. Under a high-dimensional classification framework, SeqFold efficiently matches a given SPP to the most likely cluster of structures sampled from the Boltzmann-weighted ensemble. SeqFold is able to incorporate diverse types of RNA structure profiling data, including parallel analysis of RNA structure (PARS), selective 2′-hydroxyl acylation analyzed by primer extension sequencing (SHAPE-Seq), fragmentation sequencing (FragSeq) data generated by deep sequencing, and conventional SHAPE data. Using the known structures of a wide range of mRNAs and noncoding RNAs as benchmarks, we demonstrate that SeqFold outperforms or matches existing approaches in accuracy and is more robust to noise in experimental data. Application of SeqFold to reconstruct the secondary structures of the yeast transcriptome reveals the diverse impact of RNA secondary structure on gene regulation, including translation efficiency, transcription initiation, and protein-RNA interactions. SeqFold can be easily adapted to incorporate any new types of high-throughput RNA structure profiling data and is widely applicable to analyze RNA structures in any transcriptome. Cold Spring Harbor Laboratory Press 2013-02 /pmc/articles/PMC3561878/ /pubmed/23064747 http://dx.doi.org/10.1101/gr.138545.112 Text en © 2013, Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/3.0/ This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 3.0 Unported License), as described at http://creativecommons.org/licenses/by-nc/3.0/. |
spellingShingle | Method Ouyang, Zhengqing Snyder, Michael P. Chang, Howard Y. SeqFold: Genome-scale reconstruction of RNA secondary structure integrating high-throughput sequencing data |
title | SeqFold: Genome-scale reconstruction of RNA secondary structure integrating high-throughput sequencing data |
title_full | SeqFold: Genome-scale reconstruction of RNA secondary structure integrating high-throughput sequencing data |
title_fullStr | SeqFold: Genome-scale reconstruction of RNA secondary structure integrating high-throughput sequencing data |
title_full_unstemmed | SeqFold: Genome-scale reconstruction of RNA secondary structure integrating high-throughput sequencing data |
title_short | SeqFold: Genome-scale reconstruction of RNA secondary structure integrating high-throughput sequencing data |
title_sort | seqfold: genome-scale reconstruction of rna secondary structure integrating high-throughput sequencing data |
topic | Method |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3561878/ https://www.ncbi.nlm.nih.gov/pubmed/23064747 http://dx.doi.org/10.1101/gr.138545.112 |
work_keys_str_mv | AT ouyangzhengqing seqfoldgenomescalereconstructionofrnasecondarystructureintegratinghighthroughputsequencingdata AT snydermichaelp seqfoldgenomescalereconstructionofrnasecondarystructureintegratinghighthroughputsequencingdata AT changhowardy seqfoldgenomescalereconstructionofrnasecondarystructureintegratinghighthroughputsequencingdata |