Cargando…
Identification, characterization and distribution of transposable elements in the flax (Linum usitatissimum L.) genome
BACKGROUND: Flax (Linum usitatissimum L.) is an important crop for the production of bioproducts derived from its seed and stem fiber. Transposable elements (TEs) are widespread in plant genomes and are a key component of their evolution. The availability of a genome assembly of flax (Linum usitatis...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3544724/ https://www.ncbi.nlm.nih.gov/pubmed/23171245 http://dx.doi.org/10.1186/1471-2164-13-644 |
_version_ | 1782255838013947904 |
---|---|
author | González, Leonardo Galindo Deyholos, Michael K |
author_facet | González, Leonardo Galindo Deyholos, Michael K |
author_sort | González, Leonardo Galindo |
collection | PubMed |
description | BACKGROUND: Flax (Linum usitatissimum L.) is an important crop for the production of bioproducts derived from its seed and stem fiber. Transposable elements (TEs) are widespread in plant genomes and are a key component of their evolution. The availability of a genome assembly of flax (Linum usitatissimum) affords new opportunities to explore the diversity of TEs and their relationship to genes and gene expression. RESULTS: Four de novo repeat identification algorithms (PILER, RepeatScout, LTR_finder and LTR_STRUC) were applied to the flax genome assembly. The resulting library of flax repeats was combined with the RepBase Viridiplantae division and used with RepeatMasker to identify TEs coverage in the genome. LTR retrotransposons were the most abundant TEs (17.2% genome coverage), followed by Long Interspersed Nuclear Element (LINE) retrotransposons (2.10%) and Mutator DNA transposons (1.99%). Comparison of putative flax TEs to flax transcript databases indicated that TEs are not highly expressed in flax. However, the presence of recent insertions, defined by 100% intra-element LTR similarity, provided evidence for recent TE activity. Spatial analysis showed TE-rich regions, gene-rich regions as well as regions with similar genes and TE density. Monte Carlo simulations for the 71 largest scaffolds (≥ 1 Mb each) did not show any regional differences in the frequency of TE overlap with gene coding sequences. However, differences between TE superfamilies were found in their proximity to genes. Genes within TE-rich regions also appeared to have lower transcript expression, based on EST abundance. When LTR elements were compared, Copia showed more diversity, recent insertions and conserved domains than the Gypsy, demonstrating their importance in genome evolution. CONCLUSIONS: The calculated 23.06% TE coverage of the flax WGS assembly is at the low end of the range of TE coverages reported in other eudicots, although this estimate does not include TEs likely found in unassembled repetitive regions of the genome. Since enrichment for TEs in genomic regions was associated with reduced expression of neighbouring genes, and many members of the Copia LTR superfamily are inserted close to coding regions, we suggest Copia elements have a greater influence on recent flax genome evolution while Gypsy elements have become residual and highly mutated. |
format | Online Article Text |
id | pubmed-3544724 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-35447242013-01-15 Identification, characterization and distribution of transposable elements in the flax (Linum usitatissimum L.) genome González, Leonardo Galindo Deyholos, Michael K BMC Genomics Research Article BACKGROUND: Flax (Linum usitatissimum L.) is an important crop for the production of bioproducts derived from its seed and stem fiber. Transposable elements (TEs) are widespread in plant genomes and are a key component of their evolution. The availability of a genome assembly of flax (Linum usitatissimum) affords new opportunities to explore the diversity of TEs and their relationship to genes and gene expression. RESULTS: Four de novo repeat identification algorithms (PILER, RepeatScout, LTR_finder and LTR_STRUC) were applied to the flax genome assembly. The resulting library of flax repeats was combined with the RepBase Viridiplantae division and used with RepeatMasker to identify TEs coverage in the genome. LTR retrotransposons were the most abundant TEs (17.2% genome coverage), followed by Long Interspersed Nuclear Element (LINE) retrotransposons (2.10%) and Mutator DNA transposons (1.99%). Comparison of putative flax TEs to flax transcript databases indicated that TEs are not highly expressed in flax. However, the presence of recent insertions, defined by 100% intra-element LTR similarity, provided evidence for recent TE activity. Spatial analysis showed TE-rich regions, gene-rich regions as well as regions with similar genes and TE density. Monte Carlo simulations for the 71 largest scaffolds (≥ 1 Mb each) did not show any regional differences in the frequency of TE overlap with gene coding sequences. However, differences between TE superfamilies were found in their proximity to genes. Genes within TE-rich regions also appeared to have lower transcript expression, based on EST abundance. When LTR elements were compared, Copia showed more diversity, recent insertions and conserved domains than the Gypsy, demonstrating their importance in genome evolution. CONCLUSIONS: The calculated 23.06% TE coverage of the flax WGS assembly is at the low end of the range of TE coverages reported in other eudicots, although this estimate does not include TEs likely found in unassembled repetitive regions of the genome. Since enrichment for TEs in genomic regions was associated with reduced expression of neighbouring genes, and many members of the Copia LTR superfamily are inserted close to coding regions, we suggest Copia elements have a greater influence on recent flax genome evolution while Gypsy elements have become residual and highly mutated. BioMed Central 2012-11-21 /pmc/articles/PMC3544724/ /pubmed/23171245 http://dx.doi.org/10.1186/1471-2164-13-644 Text en Copyright ©2012 González and Deyholos; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article González, Leonardo Galindo Deyholos, Michael K Identification, characterization and distribution of transposable elements in the flax (Linum usitatissimum L.) genome |
title | Identification, characterization and distribution of transposable elements in the flax (Linum usitatissimum L.) genome |
title_full | Identification, characterization and distribution of transposable elements in the flax (Linum usitatissimum L.) genome |
title_fullStr | Identification, characterization and distribution of transposable elements in the flax (Linum usitatissimum L.) genome |
title_full_unstemmed | Identification, characterization and distribution of transposable elements in the flax (Linum usitatissimum L.) genome |
title_short | Identification, characterization and distribution of transposable elements in the flax (Linum usitatissimum L.) genome |
title_sort | identification, characterization and distribution of transposable elements in the flax (linum usitatissimum l.) genome |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3544724/ https://www.ncbi.nlm.nih.gov/pubmed/23171245 http://dx.doi.org/10.1186/1471-2164-13-644 |
work_keys_str_mv | AT gonzalezleonardogalindo identificationcharacterizationanddistributionoftransposableelementsintheflaxlinumusitatissimumlgenome AT deyholosmichaelk identificationcharacterizationanddistributionoftransposableelementsintheflaxlinumusitatissimumlgenome |