Cargando…

Identification, characterization and distribution of transposable elements in the flax (Linum usitatissimum L.) genome

BACKGROUND: Flax (Linum usitatissimum L.) is an important crop for the production of bioproducts derived from its seed and stem fiber. Transposable elements (TEs) are widespread in plant genomes and are a key component of their evolution. The availability of a genome assembly of flax (Linum usitatis...

Descripción completa

Detalles Bibliográficos
Autores principales: González, Leonardo Galindo, Deyholos, Michael K
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3544724/
https://www.ncbi.nlm.nih.gov/pubmed/23171245
http://dx.doi.org/10.1186/1471-2164-13-644
_version_ 1782255838013947904
author González, Leonardo Galindo
Deyholos, Michael K
author_facet González, Leonardo Galindo
Deyholos, Michael K
author_sort González, Leonardo Galindo
collection PubMed
description BACKGROUND: Flax (Linum usitatissimum L.) is an important crop for the production of bioproducts derived from its seed and stem fiber. Transposable elements (TEs) are widespread in plant genomes and are a key component of their evolution. The availability of a genome assembly of flax (Linum usitatissimum) affords new opportunities to explore the diversity of TEs and their relationship to genes and gene expression. RESULTS: Four de novo repeat identification algorithms (PILER, RepeatScout, LTR_finder and LTR_STRUC) were applied to the flax genome assembly. The resulting library of flax repeats was combined with the RepBase Viridiplantae division and used with RepeatMasker to identify TEs coverage in the genome. LTR retrotransposons were the most abundant TEs (17.2% genome coverage), followed by Long Interspersed Nuclear Element (LINE) retrotransposons (2.10%) and Mutator DNA transposons (1.99%). Comparison of putative flax TEs to flax transcript databases indicated that TEs are not highly expressed in flax. However, the presence of recent insertions, defined by 100% intra-element LTR similarity, provided evidence for recent TE activity. Spatial analysis showed TE-rich regions, gene-rich regions as well as regions with similar genes and TE density. Monte Carlo simulations for the 71 largest scaffolds (≥ 1 Mb each) did not show any regional differences in the frequency of TE overlap with gene coding sequences. However, differences between TE superfamilies were found in their proximity to genes. Genes within TE-rich regions also appeared to have lower transcript expression, based on EST abundance. When LTR elements were compared, Copia showed more diversity, recent insertions and conserved domains than the Gypsy, demonstrating their importance in genome evolution. CONCLUSIONS: The calculated 23.06% TE coverage of the flax WGS assembly is at the low end of the range of TE coverages reported in other eudicots, although this estimate does not include TEs likely found in unassembled repetitive regions of the genome. Since enrichment for TEs in genomic regions was associated with reduced expression of neighbouring genes, and many members of the Copia LTR superfamily are inserted close to coding regions, we suggest Copia elements have a greater influence on recent flax genome evolution while Gypsy elements have become residual and highly mutated.
format Online
Article
Text
id pubmed-3544724
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35447242013-01-15 Identification, characterization and distribution of transposable elements in the flax (Linum usitatissimum L.) genome González, Leonardo Galindo Deyholos, Michael K BMC Genomics Research Article BACKGROUND: Flax (Linum usitatissimum L.) is an important crop for the production of bioproducts derived from its seed and stem fiber. Transposable elements (TEs) are widespread in plant genomes and are a key component of their evolution. The availability of a genome assembly of flax (Linum usitatissimum) affords new opportunities to explore the diversity of TEs and their relationship to genes and gene expression. RESULTS: Four de novo repeat identification algorithms (PILER, RepeatScout, LTR_finder and LTR_STRUC) were applied to the flax genome assembly. The resulting library of flax repeats was combined with the RepBase Viridiplantae division and used with RepeatMasker to identify TEs coverage in the genome. LTR retrotransposons were the most abundant TEs (17.2% genome coverage), followed by Long Interspersed Nuclear Element (LINE) retrotransposons (2.10%) and Mutator DNA transposons (1.99%). Comparison of putative flax TEs to flax transcript databases indicated that TEs are not highly expressed in flax. However, the presence of recent insertions, defined by 100% intra-element LTR similarity, provided evidence for recent TE activity. Spatial analysis showed TE-rich regions, gene-rich regions as well as regions with similar genes and TE density. Monte Carlo simulations for the 71 largest scaffolds (≥ 1 Mb each) did not show any regional differences in the frequency of TE overlap with gene coding sequences. However, differences between TE superfamilies were found in their proximity to genes. Genes within TE-rich regions also appeared to have lower transcript expression, based on EST abundance. When LTR elements were compared, Copia showed more diversity, recent insertions and conserved domains than the Gypsy, demonstrating their importance in genome evolution. CONCLUSIONS: The calculated 23.06% TE coverage of the flax WGS assembly is at the low end of the range of TE coverages reported in other eudicots, although this estimate does not include TEs likely found in unassembled repetitive regions of the genome. Since enrichment for TEs in genomic regions was associated with reduced expression of neighbouring genes, and many members of the Copia LTR superfamily are inserted close to coding regions, we suggest Copia elements have a greater influence on recent flax genome evolution while Gypsy elements have become residual and highly mutated. BioMed Central 2012-11-21 /pmc/articles/PMC3544724/ /pubmed/23171245 http://dx.doi.org/10.1186/1471-2164-13-644 Text en Copyright ©2012 González and Deyholos; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
González, Leonardo Galindo
Deyholos, Michael K
Identification, characterization and distribution of transposable elements in the flax (Linum usitatissimum L.) genome
title Identification, characterization and distribution of transposable elements in the flax (Linum usitatissimum L.) genome
title_full Identification, characterization and distribution of transposable elements in the flax (Linum usitatissimum L.) genome
title_fullStr Identification, characterization and distribution of transposable elements in the flax (Linum usitatissimum L.) genome
title_full_unstemmed Identification, characterization and distribution of transposable elements in the flax (Linum usitatissimum L.) genome
title_short Identification, characterization and distribution of transposable elements in the flax (Linum usitatissimum L.) genome
title_sort identification, characterization and distribution of transposable elements in the flax (linum usitatissimum l.) genome
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3544724/
https://www.ncbi.nlm.nih.gov/pubmed/23171245
http://dx.doi.org/10.1186/1471-2164-13-644
work_keys_str_mv AT gonzalezleonardogalindo identificationcharacterizationanddistributionoftransposableelementsintheflaxlinumusitatissimumlgenome
AT deyholosmichaelk identificationcharacterizationanddistributionoftransposableelementsintheflaxlinumusitatissimumlgenome