Cargando…

Characterization of repetitive DNA landscape in wheat homeologous group 4 chromosomes

BACKGROUND: The number and complexity of repetitive elements varies between species, being in general most represented in those with larger genomes. Combining the flow-sorted chromosome arms approach to genome analysis with second generation DNA sequencing technologies provides a unique opportunity...

Descripción completa

Detalles Bibliográficos
Autores principales: Garbus, Ingrid, Romero, José R, Valarik, Miroslav, Vanžurová, Hana, Karafiátová, Miroslava, Cáccamo, Mario, Doležel, Jaroslav, Tranquilli, Gabriela, Helguera, Marcelo, Echenique, Viviana
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4440537/
https://www.ncbi.nlm.nih.gov/pubmed/25962417
http://dx.doi.org/10.1186/s12864-015-1579-0
Descripción
Sumario:BACKGROUND: The number and complexity of repetitive elements varies between species, being in general most represented in those with larger genomes. Combining the flow-sorted chromosome arms approach to genome analysis with second generation DNA sequencing technologies provides a unique opportunity to study the repetitive portion of each chromosome, enabling comparisons among them. Additionally, different sequencing approaches may produce different depth of insight to repeatome content and structure. In this work we analyze and characterize the repetitive sequences of Triticum aestivum cv. Chinese Spring homeologous group 4 chromosome arms, obtained through Roche 454 and Illumina sequencing technologies, hereinafter marked by subscripts 454 and I, respectively. Repetitive sequences were identified with the RepeatMasker software using the interspersed repeat database mips-REdat_v9.0p. The input sequences consisted of our 4DS(454) and 4DL(454) scaffolds and 4AS(I), 4AL(I), 4BS(I), 4BL(I), 4DS(I) and 4DL(I) contigs, downloaded from the International Wheat Genome Sequencing Consortium (IWGSC). RESULTS: Repetitive sequences content varied from 55% to 63% for all chromosome arm assemblies except for 4DL(I), in which the repeat content was 38%. Transposable elements, small RNA, satellites, simple repeats and low complexity sequences were analyzed. SSR frequency was found one per 24 to 27 kb for all chromosome assemblies except 4DL(I), where it was three times higher. Dinucleotides and trinucleotides were the most abundant SSR repeat units. (GA)n/(TC)n was the most abundant SSR except for 4DL(I) where the most frequently identified SSR was (CCG/CGG)n. Retrotransposons followed by DNA transposons were the most highly represented sequence repeats, mainly composed of CACTA/En-Spm and Gypsy superfamilies, respectively. This whole chromosome sequence analysis allowed identification of three new LTR retrotransposon families belonging to the Copia superfamily, one belonging to the Gypsy superfamily and two TRIM retrotransposon families. Their physical distribution in wheat genome was analyzed by fluorescent in situ hybridization (FISH) and one of them, the Carmen retrotransposon, was found specific for centromeric regions of all wheat chromosomes. CONCLUSION: The presented work is the first deep report of wheat repetitive sequences analyzed at the chromosome arm level, revealing the first insight into the repeatome of T. aestivum chromosomes of homeologous group 4. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1579-0) contains supplementary material, which is available to authorized users.