Cargando…

Identification and characterization of novel conserved RNA structures in Drosophila

BACKGROUND: Comparative genomics approaches have facilitated the discovery of many novel non-coding and structured RNAs (ncRNAs). The increasing availability of related genomes now makes it possible to systematically search for compensatory base changes – and thus for conserved secondary structures...

Descripción completa

Detalles Bibliográficos
Autores principales: Kirsch, Rebecca, Seemann, Stefan E., Ruzzo, Walter L., Cohen, Stephen M., Stadler, Peter F., Gorodkin, Jan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6288889/
https://www.ncbi.nlm.nih.gov/pubmed/30537930
http://dx.doi.org/10.1186/s12864-018-5234-4
_version_ 1783379879119552512
author Kirsch, Rebecca
Seemann, Stefan E.
Ruzzo, Walter L.
Cohen, Stephen M.
Stadler, Peter F.
Gorodkin, Jan
author_facet Kirsch, Rebecca
Seemann, Stefan E.
Ruzzo, Walter L.
Cohen, Stephen M.
Stadler, Peter F.
Gorodkin, Jan
author_sort Kirsch, Rebecca
collection PubMed
description BACKGROUND: Comparative genomics approaches have facilitated the discovery of many novel non-coding and structured RNAs (ncRNAs). The increasing availability of related genomes now makes it possible to systematically search for compensatory base changes – and thus for conserved secondary structures – even in genomic regions that are poorly alignable in the primary sequence. The wealth of available transcriptome data can add valuable insight into expression and possible function for new ncRNA candidates. Earlier work identifying ncRNAs in Drosophila melanogaster made use of sequence-based alignments and employed a sliding window approach, inevitably biasing identification toward RNAs encoded in the more conserved parts of the genome. RESULTS: To search for conserved RNA structures (CRSs) that may not be highly conserved in sequence and to assess the expression of CRSs, we conducted a genome-wide structural alignment screen of 27 insect genomes including D. melanogaster and integrated this with an extensive set of tiling array data. The structural alignment screen revealed ∼30,000 novel candidate CRSs at an estimated false discovery rate of less than 10%. With more than one quarter of all individual CRS motifs showing sequence identities below 60%, the predicted CRSs largely complement the findings of sliding window approaches applied previously. While a sixth of the CRSs were ubiquitously expressed, we found that most were expressed in specific developmental stages or cell lines. Notably, most statistically significant enrichment of CRSs were observed in pupae, mainly in exons of untranslated regions, promotors, enhancers, and long ncRNAs. Interestingly, cell lines were found to express a different set of CRSs than were found in vivo. Only a small fraction of intergenic CRSs were co-expressed with the adjacent protein coding genes, which suggests that most intergenic CRSs are independent genetic units. CONCLUSIONS: This study provides a more comprehensive view of the ncRNA transcriptome in fly as well as evidence for differential expression of CRSs during development and in cell lines. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-018-5234-4) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6288889
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-62888892018-12-14 Identification and characterization of novel conserved RNA structures in Drosophila Kirsch, Rebecca Seemann, Stefan E. Ruzzo, Walter L. Cohen, Stephen M. Stadler, Peter F. Gorodkin, Jan BMC Genomics Research Article BACKGROUND: Comparative genomics approaches have facilitated the discovery of many novel non-coding and structured RNAs (ncRNAs). The increasing availability of related genomes now makes it possible to systematically search for compensatory base changes – and thus for conserved secondary structures – even in genomic regions that are poorly alignable in the primary sequence. The wealth of available transcriptome data can add valuable insight into expression and possible function for new ncRNA candidates. Earlier work identifying ncRNAs in Drosophila melanogaster made use of sequence-based alignments and employed a sliding window approach, inevitably biasing identification toward RNAs encoded in the more conserved parts of the genome. RESULTS: To search for conserved RNA structures (CRSs) that may not be highly conserved in sequence and to assess the expression of CRSs, we conducted a genome-wide structural alignment screen of 27 insect genomes including D. melanogaster and integrated this with an extensive set of tiling array data. The structural alignment screen revealed ∼30,000 novel candidate CRSs at an estimated false discovery rate of less than 10%. With more than one quarter of all individual CRS motifs showing sequence identities below 60%, the predicted CRSs largely complement the findings of sliding window approaches applied previously. While a sixth of the CRSs were ubiquitously expressed, we found that most were expressed in specific developmental stages or cell lines. Notably, most statistically significant enrichment of CRSs were observed in pupae, mainly in exons of untranslated regions, promotors, enhancers, and long ncRNAs. Interestingly, cell lines were found to express a different set of CRSs than were found in vivo. Only a small fraction of intergenic CRSs were co-expressed with the adjacent protein coding genes, which suggests that most intergenic CRSs are independent genetic units. CONCLUSIONS: This study provides a more comprehensive view of the ncRNA transcriptome in fly as well as evidence for differential expression of CRSs during development and in cell lines. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-018-5234-4) contains supplementary material, which is available to authorized users. BioMed Central 2018-12-11 /pmc/articles/PMC6288889/ /pubmed/30537930 http://dx.doi.org/10.1186/s12864-018-5234-4 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Kirsch, Rebecca
Seemann, Stefan E.
Ruzzo, Walter L.
Cohen, Stephen M.
Stadler, Peter F.
Gorodkin, Jan
Identification and characterization of novel conserved RNA structures in Drosophila
title Identification and characterization of novel conserved RNA structures in Drosophila
title_full Identification and characterization of novel conserved RNA structures in Drosophila
title_fullStr Identification and characterization of novel conserved RNA structures in Drosophila
title_full_unstemmed Identification and characterization of novel conserved RNA structures in Drosophila
title_short Identification and characterization of novel conserved RNA structures in Drosophila
title_sort identification and characterization of novel conserved rna structures in drosophila
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6288889/
https://www.ncbi.nlm.nih.gov/pubmed/30537930
http://dx.doi.org/10.1186/s12864-018-5234-4
work_keys_str_mv AT kirschrebecca identificationandcharacterizationofnovelconservedrnastructuresindrosophila
AT seemannstefane identificationandcharacterizationofnovelconservedrnastructuresindrosophila
AT ruzzowalterl identificationandcharacterizationofnovelconservedrnastructuresindrosophila
AT cohenstephenm identificationandcharacterizationofnovelconservedrnastructuresindrosophila
AT stadlerpeterf identificationandcharacterizationofnovelconservedrnastructuresindrosophila
AT gorodkinjan identificationandcharacterizationofnovelconservedrnastructuresindrosophila