Cargando…

MGERT: a pipeline to retrieve coding sequences of mobile genetic elements from genome assemblies

BACKGROUND: Genomes of eukaryotes are inhabited by myriads of mobile genetic elements (MGEs) – transposons and retrotransposons - which play a great role in genome plasticity and evolution. A lot of computational tools were developed to annotate them either in genomic assemblies or raw reads using d...

Descripción completa

Detalles Bibliográficos
Autores principales: Guliaev, Andrei S., Semyenova, Seraphima K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6515669/
https://www.ncbi.nlm.nih.gov/pubmed/31114637
http://dx.doi.org/10.1186/s13100-019-0163-6
_version_ 1783418128670130176
author Guliaev, Andrei S.
Semyenova, Seraphima K.
author_facet Guliaev, Andrei S.
Semyenova, Seraphima K.
author_sort Guliaev, Andrei S.
collection PubMed
description BACKGROUND: Genomes of eukaryotes are inhabited by myriads of mobile genetic elements (MGEs) – transposons and retrotransposons - which play a great role in genome plasticity and evolution. A lot of computational tools were developed to annotate them either in genomic assemblies or raw reads using de novo or homology-based approaches. But there has been no pipeline enabling users to get coding and flanking sequences of MGEs suitable for a downstream analysis from genome assemblies. RESULTS: We developed a new pipeline, MGERT (Mobile Genetic Elements Retrieving Tool), that automates all the steps necessary to obtain protein-coding sequences of mobile genetic elements from genomic assemblies even if no previous knowledge on MGE content of a particular genome is available. CONCLUSIONS: Using MGERT, researchers can easily find MGEs, their coding and flanking sequences in the genome of interest. Thus, this pipeline helps researchers to focus on the biological analysis of MGEs rather than excessive scripting and pipelining. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13100-019-0163-6) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6515669
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-65156692019-05-21 MGERT: a pipeline to retrieve coding sequences of mobile genetic elements from genome assemblies Guliaev, Andrei S. Semyenova, Seraphima K. Mob DNA Software BACKGROUND: Genomes of eukaryotes are inhabited by myriads of mobile genetic elements (MGEs) – transposons and retrotransposons - which play a great role in genome plasticity and evolution. A lot of computational tools were developed to annotate them either in genomic assemblies or raw reads using de novo or homology-based approaches. But there has been no pipeline enabling users to get coding and flanking sequences of MGEs suitable for a downstream analysis from genome assemblies. RESULTS: We developed a new pipeline, MGERT (Mobile Genetic Elements Retrieving Tool), that automates all the steps necessary to obtain protein-coding sequences of mobile genetic elements from genomic assemblies even if no previous knowledge on MGE content of a particular genome is available. CONCLUSIONS: Using MGERT, researchers can easily find MGEs, their coding and flanking sequences in the genome of interest. Thus, this pipeline helps researchers to focus on the biological analysis of MGEs rather than excessive scripting and pipelining. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13100-019-0163-6) contains supplementary material, which is available to authorized users. BioMed Central 2019-05-14 /pmc/articles/PMC6515669/ /pubmed/31114637 http://dx.doi.org/10.1186/s13100-019-0163-6 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Guliaev, Andrei S.
Semyenova, Seraphima K.
MGERT: a pipeline to retrieve coding sequences of mobile genetic elements from genome assemblies
title MGERT: a pipeline to retrieve coding sequences of mobile genetic elements from genome assemblies
title_full MGERT: a pipeline to retrieve coding sequences of mobile genetic elements from genome assemblies
title_fullStr MGERT: a pipeline to retrieve coding sequences of mobile genetic elements from genome assemblies
title_full_unstemmed MGERT: a pipeline to retrieve coding sequences of mobile genetic elements from genome assemblies
title_short MGERT: a pipeline to retrieve coding sequences of mobile genetic elements from genome assemblies
title_sort mgert: a pipeline to retrieve coding sequences of mobile genetic elements from genome assemblies
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6515669/
https://www.ncbi.nlm.nih.gov/pubmed/31114637
http://dx.doi.org/10.1186/s13100-019-0163-6
work_keys_str_mv AT guliaevandreis mgertapipelinetoretrievecodingsequencesofmobilegeneticelementsfromgenomeassemblies
AT semyenovaseraphimak mgertapipelinetoretrievecodingsequencesofmobilegeneticelementsfromgenomeassemblies