Cargando…

DeviaTE: Assembly‐free analysis and visualization of mobile genetic element composition

Transposable elements (TEs) are selfish DNA sequences that multiply within host genomes. They are present in most species investigated so far at varying degrees of abundance and sequence diversity. The TE composition may not only vary between but also within species and could have important biologic...

Descripción completa

Detalles Bibliográficos
Autores principales: Weilguny, Lukas, Kofler, Robert
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6791034/
https://www.ncbi.nlm.nih.gov/pubmed/31056858
http://dx.doi.org/10.1111/1755-0998.13030
_version_ 1783458902926426112
author Weilguny, Lukas
Kofler, Robert
author_facet Weilguny, Lukas
Kofler, Robert
author_sort Weilguny, Lukas
collection PubMed
description Transposable elements (TEs) are selfish DNA sequences that multiply within host genomes. They are present in most species investigated so far at varying degrees of abundance and sequence diversity. The TE composition may not only vary between but also within species and could have important biological implications. Variation in prevalence among populations may for example indicate a recent TE invasion, whereas sequence variation could indicate the presence of hyperactive or inactive forms. Gaining unbiased estimates of TE composition is thus vital for understanding the evolutionary dynamics of transposons. To this end, we developed DeviaTE, a tool to analyse and visualize TE abundance using Illumina or Sanger sequencing reads. Our tool requires sequencing reads of one or more samples (tissue, individual or population) and consensus sequences of TEs. It generates a table and a visual representation of TE composition. This allows for an intuitive assessment of coverage, sequence divergence, segregating SNPs and indels, as well as the presence of internal and terminal deletions. By contrasting the coverage between TEs and single copy genes, DeviaTE derives unbiased estimates of TE abundance. We show that naive approaches, which do not consider regions spanned by internal deletions, may substantially underestimate TE abundance. Using published data we demonstrate that DeviaTE can be used to study the TE composition within samples, identify clinal variation in TEs, compare TE diversity among species, and monitor TE invasions. Finally we present careful validations with publicly available and simulated data. DeviaTE is implemented in Python and distributed under the GPLv3 (https://github.com/W-L/deviaTE).
format Online
Article
Text
id pubmed-6791034
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-67910342019-10-21 DeviaTE: Assembly‐free analysis and visualization of mobile genetic element composition Weilguny, Lukas Kofler, Robert Mol Ecol Resour RESOURCE ARTICLES Transposable elements (TEs) are selfish DNA sequences that multiply within host genomes. They are present in most species investigated so far at varying degrees of abundance and sequence diversity. The TE composition may not only vary between but also within species and could have important biological implications. Variation in prevalence among populations may for example indicate a recent TE invasion, whereas sequence variation could indicate the presence of hyperactive or inactive forms. Gaining unbiased estimates of TE composition is thus vital for understanding the evolutionary dynamics of transposons. To this end, we developed DeviaTE, a tool to analyse and visualize TE abundance using Illumina or Sanger sequencing reads. Our tool requires sequencing reads of one or more samples (tissue, individual or population) and consensus sequences of TEs. It generates a table and a visual representation of TE composition. This allows for an intuitive assessment of coverage, sequence divergence, segregating SNPs and indels, as well as the presence of internal and terminal deletions. By contrasting the coverage between TEs and single copy genes, DeviaTE derives unbiased estimates of TE abundance. We show that naive approaches, which do not consider regions spanned by internal deletions, may substantially underestimate TE abundance. Using published data we demonstrate that DeviaTE can be used to study the TE composition within samples, identify clinal variation in TEs, compare TE diversity among species, and monitor TE invasions. Finally we present careful validations with publicly available and simulated data. DeviaTE is implemented in Python and distributed under the GPLv3 (https://github.com/W-L/deviaTE). John Wiley and Sons Inc. 2019-07-03 2019-09 /pmc/articles/PMC6791034/ /pubmed/31056858 http://dx.doi.org/10.1111/1755-0998.13030 Text en © 2019 The Authors. Molecular Ecology Resources Published by John Wiley & Sons Ltd This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle RESOURCE ARTICLES
Weilguny, Lukas
Kofler, Robert
DeviaTE: Assembly‐free analysis and visualization of mobile genetic element composition
title DeviaTE: Assembly‐free analysis and visualization of mobile genetic element composition
title_full DeviaTE: Assembly‐free analysis and visualization of mobile genetic element composition
title_fullStr DeviaTE: Assembly‐free analysis and visualization of mobile genetic element composition
title_full_unstemmed DeviaTE: Assembly‐free analysis and visualization of mobile genetic element composition
title_short DeviaTE: Assembly‐free analysis and visualization of mobile genetic element composition
title_sort deviate: assembly‐free analysis and visualization of mobile genetic element composition
topic RESOURCE ARTICLES
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6791034/
https://www.ncbi.nlm.nih.gov/pubmed/31056858
http://dx.doi.org/10.1111/1755-0998.13030
work_keys_str_mv AT weilgunylukas deviateassemblyfreeanalysisandvisualizationofmobilegeneticelementcomposition
AT koflerrobert deviateassemblyfreeanalysisandvisualizationofmobilegeneticelementcomposition