Cargando…

Computational RNomics of Drosophilids

BACKGROUND: Recent experimental and computational studies have provided overwhelming evidence for a plethora of diverse transcripts that are unrelated to protein-coding genes. One subclass consists of those RNAs that require distinctive secondary structure motifs to exert their biological function a...

Descripción completa

Detalles Bibliográficos
Autores principales: Rose, Dominic, Hackermüller, Jörg, Washietl, Stefan, Reiche, Kristin, Hertel, Jana, Findeiß, Sven, Stadler, Peter F, Prohaska, Sonja J
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2216035/
https://www.ncbi.nlm.nih.gov/pubmed/17996037
http://dx.doi.org/10.1186/1471-2164-8-406
_version_ 1782149097278406656
author Rose, Dominic
Hackermüller, Jörg
Washietl, Stefan
Reiche, Kristin
Hertel, Jana
Findeiß, Sven
Stadler, Peter F
Prohaska, Sonja J
author_facet Rose, Dominic
Hackermüller, Jörg
Washietl, Stefan
Reiche, Kristin
Hertel, Jana
Findeiß, Sven
Stadler, Peter F
Prohaska, Sonja J
author_sort Rose, Dominic
collection PubMed
description BACKGROUND: Recent experimental and computational studies have provided overwhelming evidence for a plethora of diverse transcripts that are unrelated to protein-coding genes. One subclass consists of those RNAs that require distinctive secondary structure motifs to exert their biological function and hence exhibit distinctive patterns of sequence conservation characteristic for positive selection on RNA secondary structure. The deep-sequencing of 12 drosophilid species coordinated by the NHGRI provides an ideal data set of comparative computational approaches to determine those genomic loci that code for evolutionarily conserved RNA motifs. This class of loci includes the majority of the known small ncRNAs as well as structured RNA motifs in mRNAs. We report here on a genome-wide survey using RNAz. RESULTS: We obtain 16 000 high quality predictions among which we recover the majority of the known ncRNAs. Taking a pessimistically estimated false discovery rate of 40% into account, this implies that at least some ten thousand loci in the Drosophila genome show the hallmarks of stabilizing selection action of RNA structure, and hence are most likely functional at the RNA level. A subset of RNAz predictions overlapping with TRF1 and BRF binding sites [Isogai et al., EMBO J. 26: 79–89 (2007)], which are plausible candidates of Pol III transcripts, have been studied in more detail. Among these sequences we identify several "clusters" of ncRNA candidates with striking structural similarities. CONCLUSION: The statistical evaluation of the RNAz predictions in comparison with a similar analysis of vertebrate genomes [Washietl et al., Nat. Biotech. 23: 1383–1390 (2005)] shows that qualitatively similar fractions of structured RNAs are found in introns, UTRs, and intergenic regions. The intergenic RNA structures, however, are concentrated much more closely around known protein-coding loci, suggesting that flies have significantly smaller complement of independent structured ncRNAs compared to mammals.
format Text
id pubmed-2216035
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-22160352008-01-29 Computational RNomics of Drosophilids Rose, Dominic Hackermüller, Jörg Washietl, Stefan Reiche, Kristin Hertel, Jana Findeiß, Sven Stadler, Peter F Prohaska, Sonja J BMC Genomics Research Article BACKGROUND: Recent experimental and computational studies have provided overwhelming evidence for a plethora of diverse transcripts that are unrelated to protein-coding genes. One subclass consists of those RNAs that require distinctive secondary structure motifs to exert their biological function and hence exhibit distinctive patterns of sequence conservation characteristic for positive selection on RNA secondary structure. The deep-sequencing of 12 drosophilid species coordinated by the NHGRI provides an ideal data set of comparative computational approaches to determine those genomic loci that code for evolutionarily conserved RNA motifs. This class of loci includes the majority of the known small ncRNAs as well as structured RNA motifs in mRNAs. We report here on a genome-wide survey using RNAz. RESULTS: We obtain 16 000 high quality predictions among which we recover the majority of the known ncRNAs. Taking a pessimistically estimated false discovery rate of 40% into account, this implies that at least some ten thousand loci in the Drosophila genome show the hallmarks of stabilizing selection action of RNA structure, and hence are most likely functional at the RNA level. A subset of RNAz predictions overlapping with TRF1 and BRF binding sites [Isogai et al., EMBO J. 26: 79–89 (2007)], which are plausible candidates of Pol III transcripts, have been studied in more detail. Among these sequences we identify several "clusters" of ncRNA candidates with striking structural similarities. CONCLUSION: The statistical evaluation of the RNAz predictions in comparison with a similar analysis of vertebrate genomes [Washietl et al., Nat. Biotech. 23: 1383–1390 (2005)] shows that qualitatively similar fractions of structured RNAs are found in introns, UTRs, and intergenic regions. The intergenic RNA structures, however, are concentrated much more closely around known protein-coding loci, suggesting that flies have significantly smaller complement of independent structured ncRNAs compared to mammals. BioMed Central 2007-11-08 /pmc/articles/PMC2216035/ /pubmed/17996037 http://dx.doi.org/10.1186/1471-2164-8-406 Text en Copyright © 2007 Rose et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Rose, Dominic
Hackermüller, Jörg
Washietl, Stefan
Reiche, Kristin
Hertel, Jana
Findeiß, Sven
Stadler, Peter F
Prohaska, Sonja J
Computational RNomics of Drosophilids
title Computational RNomics of Drosophilids
title_full Computational RNomics of Drosophilids
title_fullStr Computational RNomics of Drosophilids
title_full_unstemmed Computational RNomics of Drosophilids
title_short Computational RNomics of Drosophilids
title_sort computational rnomics of drosophilids
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2216035/
https://www.ncbi.nlm.nih.gov/pubmed/17996037
http://dx.doi.org/10.1186/1471-2164-8-406
work_keys_str_mv AT rosedominic computationalrnomicsofdrosophilids
AT hackermullerjorg computationalrnomicsofdrosophilids
AT washietlstefan computationalrnomicsofdrosophilids
AT reichekristin computationalrnomicsofdrosophilids
AT herteljana computationalrnomicsofdrosophilids
AT findeißsven computationalrnomicsofdrosophilids
AT stadlerpeterf computationalrnomicsofdrosophilids
AT prohaskasonjaj computationalrnomicsofdrosophilids