Cargando…

Comparison of class 2 transposable elements at superfamily resolution reveals conserved and distinct features in cereal grass genomes

BACKGROUND: Class 2 transposable elements (TEs) are the predominant elements in and around plant genes where they generate significant allelic diversity. Using the complete sequences of four grasses, we have performed a novel comparative analysis of class 2 TEs. To ensure consistent comparative anal...

Descripción completa

Detalles Bibliográficos
Autores principales: Han, Yujun, Qin, Shanshan, Wessler, Susan R
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3579700/
https://www.ncbi.nlm.nih.gov/pubmed/23369001
http://dx.doi.org/10.1186/1471-2164-14-71
_version_ 1782260145567301632
author Han, Yujun
Qin, Shanshan
Wessler, Susan R
author_facet Han, Yujun
Qin, Shanshan
Wessler, Susan R
author_sort Han, Yujun
collection PubMed
description BACKGROUND: Class 2 transposable elements (TEs) are the predominant elements in and around plant genes where they generate significant allelic diversity. Using the complete sequences of four grasses, we have performed a novel comparative analysis of class 2 TEs. To ensure consistent comparative analyses, we re-annotated class 2 TEs in Brachypodium distachyon, Oryza sativa (rice), Sorghum bicolor and Zea mays and assigned them to one of the five cut-and-paste superfamilies found in plant genomes (Tc1/mariner, PIF/Harbinger, hAT, Mutator, CACTA). We have focused on noncoding elements because of their abundance, and compared superfamily copy number, size and genomic distribution as well as correlation with the level of nearby gene expression. RESULTS: Our comparison revealed both unique and conserved features. First, the average length or size distribution of elements in each superfamily is largely conserved, with the shortest always being Tc1/mariner elements, followed by PIF/Harbinger, hAT, Mutator and CACTA. This order also holds for the ratio of the copy numbers of noncoding to coding elements. Second, with the exception of CACTAs, noncoding TEs are enriched within and flanking genes, where they display conserved distribution patterns, having the highest peak in the promoter region. Finally, our analysis of microarray data revealed that genes associated with Tc1/mariner and PIF/Harbinger noncoding elements have significantly higher expression levels than genes without class 2 TEs. In contrast, genes with CACTA elements have significantly lower expression than genes without class 2 TEs. CONCLUSIONS: We have achieved the most comprehensive annotation of class 2 TEs to date in these four grass genomes. Comparative analysis of this robust dataset led to the identification of several previously unknown features of each superfamily related to copy number, element size, genomic distribution and correlation with the expression levels of nearby genes. These results highlight the importance of distinguishing TE superfamilies when assessing their impact on gene and genome evolution.
format Online
Article
Text
id pubmed-3579700
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35797002013-02-23 Comparison of class 2 transposable elements at superfamily resolution reveals conserved and distinct features in cereal grass genomes Han, Yujun Qin, Shanshan Wessler, Susan R BMC Genomics Research Article BACKGROUND: Class 2 transposable elements (TEs) are the predominant elements in and around plant genes where they generate significant allelic diversity. Using the complete sequences of four grasses, we have performed a novel comparative analysis of class 2 TEs. To ensure consistent comparative analyses, we re-annotated class 2 TEs in Brachypodium distachyon, Oryza sativa (rice), Sorghum bicolor and Zea mays and assigned them to one of the five cut-and-paste superfamilies found in plant genomes (Tc1/mariner, PIF/Harbinger, hAT, Mutator, CACTA). We have focused on noncoding elements because of their abundance, and compared superfamily copy number, size and genomic distribution as well as correlation with the level of nearby gene expression. RESULTS: Our comparison revealed both unique and conserved features. First, the average length or size distribution of elements in each superfamily is largely conserved, with the shortest always being Tc1/mariner elements, followed by PIF/Harbinger, hAT, Mutator and CACTA. This order also holds for the ratio of the copy numbers of noncoding to coding elements. Second, with the exception of CACTAs, noncoding TEs are enriched within and flanking genes, where they display conserved distribution patterns, having the highest peak in the promoter region. Finally, our analysis of microarray data revealed that genes associated with Tc1/mariner and PIF/Harbinger noncoding elements have significantly higher expression levels than genes without class 2 TEs. In contrast, genes with CACTA elements have significantly lower expression than genes without class 2 TEs. CONCLUSIONS: We have achieved the most comprehensive annotation of class 2 TEs to date in these four grass genomes. Comparative analysis of this robust dataset led to the identification of several previously unknown features of each superfamily related to copy number, element size, genomic distribution and correlation with the expression levels of nearby genes. These results highlight the importance of distinguishing TE superfamilies when assessing their impact on gene and genome evolution. BioMed Central 2013-01-31 /pmc/articles/PMC3579700/ /pubmed/23369001 http://dx.doi.org/10.1186/1471-2164-14-71 Text en Copyright ©2013 Han et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Han, Yujun
Qin, Shanshan
Wessler, Susan R
Comparison of class 2 transposable elements at superfamily resolution reveals conserved and distinct features in cereal grass genomes
title Comparison of class 2 transposable elements at superfamily resolution reveals conserved and distinct features in cereal grass genomes
title_full Comparison of class 2 transposable elements at superfamily resolution reveals conserved and distinct features in cereal grass genomes
title_fullStr Comparison of class 2 transposable elements at superfamily resolution reveals conserved and distinct features in cereal grass genomes
title_full_unstemmed Comparison of class 2 transposable elements at superfamily resolution reveals conserved and distinct features in cereal grass genomes
title_short Comparison of class 2 transposable elements at superfamily resolution reveals conserved and distinct features in cereal grass genomes
title_sort comparison of class 2 transposable elements at superfamily resolution reveals conserved and distinct features in cereal grass genomes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3579700/
https://www.ncbi.nlm.nih.gov/pubmed/23369001
http://dx.doi.org/10.1186/1471-2164-14-71
work_keys_str_mv AT hanyujun comparisonofclass2transposableelementsatsuperfamilyresolutionrevealsconservedanddistinctfeaturesincerealgrassgenomes
AT qinshanshan comparisonofclass2transposableelementsatsuperfamilyresolutionrevealsconservedanddistinctfeaturesincerealgrassgenomes
AT wesslersusanr comparisonofclass2transposableelementsatsuperfamilyresolutionrevealsconservedanddistinctfeaturesincerealgrassgenomes