Cargando…

The distribution and evolution of Arabidopsis thaliana cis natural antisense transcripts

BACKGROUND: Natural antisense transcripts (NATs) are regulatory RNAs that contain sequence complementary to other RNAs, these other RNAs usually being messenger RNAs. In eukaryotic genomes, cis-NATs overlap the gene they complement. RESULTS: Here, our goal is to analyze the distribution and evolutio...

Descripción completa

Detalles Bibliográficos
Autores principales: Bouchard, Johnathan, Oliver, Carlos, Harrison, Paul M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4467840/
https://www.ncbi.nlm.nih.gov/pubmed/26054753
http://dx.doi.org/10.1186/s12864-015-1587-0
_version_ 1782376419109634048
author Bouchard, Johnathan
Oliver, Carlos
Harrison, Paul M
author_facet Bouchard, Johnathan
Oliver, Carlos
Harrison, Paul M
author_sort Bouchard, Johnathan
collection PubMed
description BACKGROUND: Natural antisense transcripts (NATs) are regulatory RNAs that contain sequence complementary to other RNAs, these other RNAs usually being messenger RNAs. In eukaryotic genomes, cis-NATs overlap the gene they complement. RESULTS: Here, our goal is to analyze the distribution and evolutionary conservation of cis-NATs for a variety of available data sets for Arabidopsis thaliana, to gain insights into cis-NAT functional mechanisms and their significance. Cis-NATs derived from traditional sequencing are largely validated by other data sets, although different cis-NAT data sets have different prevalent cis-NAT topologies with respect to overlapping protein-coding genes. A. thaliana cis-NATs have substantial conservation (28-35% in the three substantive data sets analyzed) of expression in A. lyrata. We examined evolutionary sequence conservation at cis-NAT loci in Arabidopsis thaliana across nine sequenced Brassicaceae species (picked for optimal discernment of purifying selection), focussing on the parts of their sequences not overlapping protein-coding transcripts (dubbed ‘NOLPs’). We found significant NOLP sequence conservation for 28-34% NATs across different cis-NAT sets. This NAT NOLP sequence conservation versus A. lyrata is generally significantly correlated with conservation of expression. We discover a significant enrichment of transcription factor binding sites (as evidenced by CHIP-seq data) in NOLPs compared to randomly sampled near-gene NOLP-like DNA , that is linked to significant sequence conservation. Conversely, there is no such evidence for a general significant link between NOLPs and formation of small interfering RNAs (siRNAs), with the substantial majority of unique siRNAs arising from the overlapping portions of the cis-NATs. CONCLUSIONS: In aggregate, our results suggest that many cis-NAT NOLPs function in the regulation of conserved promoter/regulatory elements that they ‘over-hang’. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1587-0) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4467840
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-44678402015-06-16 The distribution and evolution of Arabidopsis thaliana cis natural antisense transcripts Bouchard, Johnathan Oliver, Carlos Harrison, Paul M BMC Genomics Research Article BACKGROUND: Natural antisense transcripts (NATs) are regulatory RNAs that contain sequence complementary to other RNAs, these other RNAs usually being messenger RNAs. In eukaryotic genomes, cis-NATs overlap the gene they complement. RESULTS: Here, our goal is to analyze the distribution and evolutionary conservation of cis-NATs for a variety of available data sets for Arabidopsis thaliana, to gain insights into cis-NAT functional mechanisms and their significance. Cis-NATs derived from traditional sequencing are largely validated by other data sets, although different cis-NAT data sets have different prevalent cis-NAT topologies with respect to overlapping protein-coding genes. A. thaliana cis-NATs have substantial conservation (28-35% in the three substantive data sets analyzed) of expression in A. lyrata. We examined evolutionary sequence conservation at cis-NAT loci in Arabidopsis thaliana across nine sequenced Brassicaceae species (picked for optimal discernment of purifying selection), focussing on the parts of their sequences not overlapping protein-coding transcripts (dubbed ‘NOLPs’). We found significant NOLP sequence conservation for 28-34% NATs across different cis-NAT sets. This NAT NOLP sequence conservation versus A. lyrata is generally significantly correlated with conservation of expression. We discover a significant enrichment of transcription factor binding sites (as evidenced by CHIP-seq data) in NOLPs compared to randomly sampled near-gene NOLP-like DNA , that is linked to significant sequence conservation. Conversely, there is no such evidence for a general significant link between NOLPs and formation of small interfering RNAs (siRNAs), with the substantial majority of unique siRNAs arising from the overlapping portions of the cis-NATs. CONCLUSIONS: In aggregate, our results suggest that many cis-NAT NOLPs function in the regulation of conserved promoter/regulatory elements that they ‘over-hang’. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1587-0) contains supplementary material, which is available to authorized users. BioMed Central 2015-06-09 /pmc/articles/PMC4467840/ /pubmed/26054753 http://dx.doi.org/10.1186/s12864-015-1587-0 Text en © Bouchard et al. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Bouchard, Johnathan
Oliver, Carlos
Harrison, Paul M
The distribution and evolution of Arabidopsis thaliana cis natural antisense transcripts
title The distribution and evolution of Arabidopsis thaliana cis natural antisense transcripts
title_full The distribution and evolution of Arabidopsis thaliana cis natural antisense transcripts
title_fullStr The distribution and evolution of Arabidopsis thaliana cis natural antisense transcripts
title_full_unstemmed The distribution and evolution of Arabidopsis thaliana cis natural antisense transcripts
title_short The distribution and evolution of Arabidopsis thaliana cis natural antisense transcripts
title_sort distribution and evolution of arabidopsis thaliana cis natural antisense transcripts
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4467840/
https://www.ncbi.nlm.nih.gov/pubmed/26054753
http://dx.doi.org/10.1186/s12864-015-1587-0
work_keys_str_mv AT bouchardjohnathan thedistributionandevolutionofarabidopsisthalianacisnaturalantisensetranscripts
AT olivercarlos thedistributionandevolutionofarabidopsisthalianacisnaturalantisensetranscripts
AT harrisonpaulm thedistributionandevolutionofarabidopsisthalianacisnaturalantisensetranscripts
AT bouchardjohnathan distributionandevolutionofarabidopsisthalianacisnaturalantisensetranscripts
AT olivercarlos distributionandevolutionofarabidopsisthalianacisnaturalantisensetranscripts
AT harrisonpaulm distributionandevolutionofarabidopsisthalianacisnaturalantisensetranscripts