Cargando…

Structure-based whole-genome realignment reveals many novel noncoding RNAs

Recent genome-wide computational screens that search for conservation of RNA secondary structure in whole-genome alignments (WGAs) have predicted thousands of structural noncoding RNAs (ncRNAs). The sensitivity of such approaches, however, is limited, due to their reliance on sequence-based whole-ge...

Descripción completa

Detalles Bibliográficos
Autores principales: Will, Sebastian, Yu, Michael, Berger, Bonnie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3668356/
https://www.ncbi.nlm.nih.gov/pubmed/23296921
http://dx.doi.org/10.1101/gr.137091.111
_version_ 1782271617068433408
author Will, Sebastian
Yu, Michael
Berger, Bonnie
author_facet Will, Sebastian
Yu, Michael
Berger, Bonnie
author_sort Will, Sebastian
collection PubMed
description Recent genome-wide computational screens that search for conservation of RNA secondary structure in whole-genome alignments (WGAs) have predicted thousands of structural noncoding RNAs (ncRNAs). The sensitivity of such approaches, however, is limited, due to their reliance on sequence-based whole-genome aligners, which regularly misalign structural ncRNAs. This suggests that many more structural ncRNAs may remain undetected. Structure-based alignment, which could increase the sensitivity, has been prohibitive for genome-wide screens due to its extreme computational costs. Breaking this barrier, we present the pipeline REAPR (RE-Alignment for Prediction of structural ncRNA), which efficiently realigns whole genomes based on RNA sequence and structure, thus allowing us to boost the performance of de novo ncRNA predictors, such as RNAz. Key to the pipeline's efficiency is the development of a novel banding technique for multiple RNA alignment. REAPR significantly outperforms the widely used predictors RNAz and EvoFold in genome-wide screens; in direct comparison to the most recent RNAz screen on D. melanogaster, REAPR predicts twice as many high-confidence ncRNA candidates. Moreover, modENCODE RNA-seq experiments confirm a substantial number of its predictions as transcripts. REAPR's advancement of de novo structural characterization of ncRNAs complements the identification of transcripts from rapidly accumulating RNA-seq data.
format Online
Article
Text
id pubmed-3668356
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-36683562013-06-07 Structure-based whole-genome realignment reveals many novel noncoding RNAs Will, Sebastian Yu, Michael Berger, Bonnie Genome Res Method Recent genome-wide computational screens that search for conservation of RNA secondary structure in whole-genome alignments (WGAs) have predicted thousands of structural noncoding RNAs (ncRNAs). The sensitivity of such approaches, however, is limited, due to their reliance on sequence-based whole-genome aligners, which regularly misalign structural ncRNAs. This suggests that many more structural ncRNAs may remain undetected. Structure-based alignment, which could increase the sensitivity, has been prohibitive for genome-wide screens due to its extreme computational costs. Breaking this barrier, we present the pipeline REAPR (RE-Alignment for Prediction of structural ncRNA), which efficiently realigns whole genomes based on RNA sequence and structure, thus allowing us to boost the performance of de novo ncRNA predictors, such as RNAz. Key to the pipeline's efficiency is the development of a novel banding technique for multiple RNA alignment. REAPR significantly outperforms the widely used predictors RNAz and EvoFold in genome-wide screens; in direct comparison to the most recent RNAz screen on D. melanogaster, REAPR predicts twice as many high-confidence ncRNA candidates. Moreover, modENCODE RNA-seq experiments confirm a substantial number of its predictions as transcripts. REAPR's advancement of de novo structural characterization of ncRNAs complements the identification of transcripts from rapidly accumulating RNA-seq data. Cold Spring Harbor Laboratory Press 2013-06 /pmc/articles/PMC3668356/ /pubmed/23296921 http://dx.doi.org/10.1101/gr.137091.111 Text en © 2013, Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/3.0/ This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 3.0 Unported License), as described at http://creativecommons.org/licenses/by-nc/3.0/.
spellingShingle Method
Will, Sebastian
Yu, Michael
Berger, Bonnie
Structure-based whole-genome realignment reveals many novel noncoding RNAs
title Structure-based whole-genome realignment reveals many novel noncoding RNAs
title_full Structure-based whole-genome realignment reveals many novel noncoding RNAs
title_fullStr Structure-based whole-genome realignment reveals many novel noncoding RNAs
title_full_unstemmed Structure-based whole-genome realignment reveals many novel noncoding RNAs
title_short Structure-based whole-genome realignment reveals many novel noncoding RNAs
title_sort structure-based whole-genome realignment reveals many novel noncoding rnas
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3668356/
https://www.ncbi.nlm.nih.gov/pubmed/23296921
http://dx.doi.org/10.1101/gr.137091.111
work_keys_str_mv AT willsebastian structurebasedwholegenomerealignmentrevealsmanynovelnoncodingrnas
AT yumichael structurebasedwholegenomerealignmentrevealsmanynovelnoncodingrnas
AT bergerbonnie structurebasedwholegenomerealignmentrevealsmanynovelnoncodingrnas