Cargando…

Comparative analysis of morabine grasshopper genomes reveals highly abundant transposable elements and rapidly proliferating satellite DNA repeats

BACKGROUND: Repetitive DNA sequences, including transposable elements (TEs) and tandemly repeated satellite DNA (satDNAs), collectively called the “repeatome”, are found in high proportion in organisms across the Tree of Life. Grasshoppers have large genomes, averaging 9 Gb, that contain a high prop...

Descripción completa

Detalles Bibliográficos
Autores principales: Palacios-Gimenez, Octavio M., Koelman, Julia, Palmada-Flores, Marc, Bradford, Tessa M., Jones, Karl K., Cooper, Steven J. B., Kawakami, Takeshi, Suh, Alexander
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7754599/
https://www.ncbi.nlm.nih.gov/pubmed/33349252
http://dx.doi.org/10.1186/s12915-020-00925-x
_version_ 1783626228444430336
author Palacios-Gimenez, Octavio M.
Koelman, Julia
Palmada-Flores, Marc
Bradford, Tessa M.
Jones, Karl K.
Cooper, Steven J. B.
Kawakami, Takeshi
Suh, Alexander
author_facet Palacios-Gimenez, Octavio M.
Koelman, Julia
Palmada-Flores, Marc
Bradford, Tessa M.
Jones, Karl K.
Cooper, Steven J. B.
Kawakami, Takeshi
Suh, Alexander
author_sort Palacios-Gimenez, Octavio M.
collection PubMed
description BACKGROUND: Repetitive DNA sequences, including transposable elements (TEs) and tandemly repeated satellite DNA (satDNAs), collectively called the “repeatome”, are found in high proportion in organisms across the Tree of Life. Grasshoppers have large genomes, averaging 9 Gb, that contain a high proportion of repetitive DNA, which has hampered progress in assembling reference genomes. Here we combined linked-read genomics with transcriptomics to assemble, characterize, and compare the structure of repetitive DNA sequences in four chromosomal races of the morabine grasshopper Vandiemenella viatica species complex and determine their contribution to genome evolution. RESULTS: We obtained linked-read genome assemblies of 2.73–3.27 Gb from estimated genome sizes of 4.26–5.07 Gb DNA per haploid genome of the four chromosomal races of V. viatica. These constitute the third largest insect genomes assembled so far. Combining complementary annotation tools and manual curation, we found a large diversity of TEs and satDNAs, constituting 66 to 75% per genome assembly. A comparison of sequence divergence within the TE classes revealed massive accumulation of recent TEs in all four races (314–463 Mb per assembly), indicating that their large genome sizes are likely due to similar rates of TE accumulation. Transcriptome sequencing showed more biased TE expression in reproductive tissues than somatic tissues, implying permissive transcription in gametogenesis. Out of 129 satDNA families, 102 satDNA families were shared among the four chromosomal races, which likely represent a diversity of satDNA families in the ancestor of the V. viatica chromosomal races. Notably, 50 of these shared satDNA families underwent differential proliferation since the recent diversification of the V. viatica species complex. CONCLUSION: This in-depth annotation of the repeatome in morabine grasshoppers provided new insights into the genome evolution of Orthoptera. Our TEs analysis revealed a massive recent accumulation of TEs equivalent to the size of entire Drosophila genomes, which likely explains the large genome sizes in grasshoppers. Despite an overall high similarity of the TE and satDNA diversity between races, the patterns of TE expression and satDNA proliferation suggest rapid evolution of grasshopper genomes on recent timescales. SUPPLEMENTARY INFORMATION: Supplementary information accompanies this paper at 10.1186/s12915-020-00925-x.
format Online
Article
Text
id pubmed-7754599
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-77545992020-12-22 Comparative analysis of morabine grasshopper genomes reveals highly abundant transposable elements and rapidly proliferating satellite DNA repeats Palacios-Gimenez, Octavio M. Koelman, Julia Palmada-Flores, Marc Bradford, Tessa M. Jones, Karl K. Cooper, Steven J. B. Kawakami, Takeshi Suh, Alexander BMC Biol Research Article BACKGROUND: Repetitive DNA sequences, including transposable elements (TEs) and tandemly repeated satellite DNA (satDNAs), collectively called the “repeatome”, are found in high proportion in organisms across the Tree of Life. Grasshoppers have large genomes, averaging 9 Gb, that contain a high proportion of repetitive DNA, which has hampered progress in assembling reference genomes. Here we combined linked-read genomics with transcriptomics to assemble, characterize, and compare the structure of repetitive DNA sequences in four chromosomal races of the morabine grasshopper Vandiemenella viatica species complex and determine their contribution to genome evolution. RESULTS: We obtained linked-read genome assemblies of 2.73–3.27 Gb from estimated genome sizes of 4.26–5.07 Gb DNA per haploid genome of the four chromosomal races of V. viatica. These constitute the third largest insect genomes assembled so far. Combining complementary annotation tools and manual curation, we found a large diversity of TEs and satDNAs, constituting 66 to 75% per genome assembly. A comparison of sequence divergence within the TE classes revealed massive accumulation of recent TEs in all four races (314–463 Mb per assembly), indicating that their large genome sizes are likely due to similar rates of TE accumulation. Transcriptome sequencing showed more biased TE expression in reproductive tissues than somatic tissues, implying permissive transcription in gametogenesis. Out of 129 satDNA families, 102 satDNA families were shared among the four chromosomal races, which likely represent a diversity of satDNA families in the ancestor of the V. viatica chromosomal races. Notably, 50 of these shared satDNA families underwent differential proliferation since the recent diversification of the V. viatica species complex. CONCLUSION: This in-depth annotation of the repeatome in morabine grasshoppers provided new insights into the genome evolution of Orthoptera. Our TEs analysis revealed a massive recent accumulation of TEs equivalent to the size of entire Drosophila genomes, which likely explains the large genome sizes in grasshoppers. Despite an overall high similarity of the TE and satDNA diversity between races, the patterns of TE expression and satDNA proliferation suggest rapid evolution of grasshopper genomes on recent timescales. SUPPLEMENTARY INFORMATION: Supplementary information accompanies this paper at 10.1186/s12915-020-00925-x. BioMed Central 2020-12-21 /pmc/articles/PMC7754599/ /pubmed/33349252 http://dx.doi.org/10.1186/s12915-020-00925-x Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Palacios-Gimenez, Octavio M.
Koelman, Julia
Palmada-Flores, Marc
Bradford, Tessa M.
Jones, Karl K.
Cooper, Steven J. B.
Kawakami, Takeshi
Suh, Alexander
Comparative analysis of morabine grasshopper genomes reveals highly abundant transposable elements and rapidly proliferating satellite DNA repeats
title Comparative analysis of morabine grasshopper genomes reveals highly abundant transposable elements and rapidly proliferating satellite DNA repeats
title_full Comparative analysis of morabine grasshopper genomes reveals highly abundant transposable elements and rapidly proliferating satellite DNA repeats
title_fullStr Comparative analysis of morabine grasshopper genomes reveals highly abundant transposable elements and rapidly proliferating satellite DNA repeats
title_full_unstemmed Comparative analysis of morabine grasshopper genomes reveals highly abundant transposable elements and rapidly proliferating satellite DNA repeats
title_short Comparative analysis of morabine grasshopper genomes reveals highly abundant transposable elements and rapidly proliferating satellite DNA repeats
title_sort comparative analysis of morabine grasshopper genomes reveals highly abundant transposable elements and rapidly proliferating satellite dna repeats
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7754599/
https://www.ncbi.nlm.nih.gov/pubmed/33349252
http://dx.doi.org/10.1186/s12915-020-00925-x
work_keys_str_mv AT palaciosgimenezoctaviom comparativeanalysisofmorabinegrasshoppergenomesrevealshighlyabundanttransposableelementsandrapidlyproliferatingsatellitednarepeats
AT koelmanjulia comparativeanalysisofmorabinegrasshoppergenomesrevealshighlyabundanttransposableelementsandrapidlyproliferatingsatellitednarepeats
AT palmadafloresmarc comparativeanalysisofmorabinegrasshoppergenomesrevealshighlyabundanttransposableelementsandrapidlyproliferatingsatellitednarepeats
AT bradfordtessam comparativeanalysisofmorabinegrasshoppergenomesrevealshighlyabundanttransposableelementsandrapidlyproliferatingsatellitednarepeats
AT joneskarlk comparativeanalysisofmorabinegrasshoppergenomesrevealshighlyabundanttransposableelementsandrapidlyproliferatingsatellitednarepeats
AT cooperstevenjb comparativeanalysisofmorabinegrasshoppergenomesrevealshighlyabundanttransposableelementsandrapidlyproliferatingsatellitednarepeats
AT kawakamitakeshi comparativeanalysisofmorabinegrasshoppergenomesrevealshighlyabundanttransposableelementsandrapidlyproliferatingsatellitednarepeats
AT suhalexander comparativeanalysisofmorabinegrasshoppergenomesrevealshighlyabundanttransposableelementsandrapidlyproliferatingsatellitednarepeats