Cargando…

TransposonUltimate: software for transposon classification, annotation and detection

Most genomes harbor a large number of transposons, and they play an important role in evolution and gene regulation. They are also of interest to clinicians as they are involved in several diseases, including cancer and neurodegeneration. Although several methods for transposon identification are av...

Descripción completa

Detalles Bibliográficos
Autores principales: Riehl, Kevin, Riccio, Cristian, Miska, Eric A, Hemberg, Martin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9226531/
https://www.ncbi.nlm.nih.gov/pubmed/35234904
http://dx.doi.org/10.1093/nar/gkac136
_version_ 1784733921406091264
author Riehl, Kevin
Riccio, Cristian
Miska, Eric A
Hemberg, Martin
author_facet Riehl, Kevin
Riccio, Cristian
Miska, Eric A
Hemberg, Martin
author_sort Riehl, Kevin
collection PubMed
description Most genomes harbor a large number of transposons, and they play an important role in evolution and gene regulation. They are also of interest to clinicians as they are involved in several diseases, including cancer and neurodegeneration. Although several methods for transposon identification are available, they are often highly specialised towards specific tasks or classes of transposons, and they lack common standards such as a unified taxonomy scheme and output file format. We present TransposonUltimate, a powerful bundle of three modules for transposon classification, annotation, and detection of transposition events. TransposonUltimate comes as a Conda package under the GPL-3.0 licence, is well documented and it is easy to install through https://github.com/DerKevinRiehl/TransposonUltimate. We benchmark the classification module on the large TransposonDB covering 891,051 sequences to demonstrate that it outperforms the currently best existing solutions. The annotation and detection modules combine sixteen existing softwares, and we illustrate its use by annotating Caenorhabditis elegans, Rhizophagus irregularis and Oryza sativa subs. japonica genomes. Finally, we use the detection module to discover 29 554 transposition events in the genomes of 20 wild type strains of C. elegans. Databases, assemblies, annotations and further findings can be downloaded from (https://doi.org/10.5281/zenodo.5518085).
format Online
Article
Text
id pubmed-9226531
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-92265312022-06-28 TransposonUltimate: software for transposon classification, annotation and detection Riehl, Kevin Riccio, Cristian Miska, Eric A Hemberg, Martin Nucleic Acids Res Methods Online Most genomes harbor a large number of transposons, and they play an important role in evolution and gene regulation. They are also of interest to clinicians as they are involved in several diseases, including cancer and neurodegeneration. Although several methods for transposon identification are available, they are often highly specialised towards specific tasks or classes of transposons, and they lack common standards such as a unified taxonomy scheme and output file format. We present TransposonUltimate, a powerful bundle of three modules for transposon classification, annotation, and detection of transposition events. TransposonUltimate comes as a Conda package under the GPL-3.0 licence, is well documented and it is easy to install through https://github.com/DerKevinRiehl/TransposonUltimate. We benchmark the classification module on the large TransposonDB covering 891,051 sequences to demonstrate that it outperforms the currently best existing solutions. The annotation and detection modules combine sixteen existing softwares, and we illustrate its use by annotating Caenorhabditis elegans, Rhizophagus irregularis and Oryza sativa subs. japonica genomes. Finally, we use the detection module to discover 29 554 transposition events in the genomes of 20 wild type strains of C. elegans. Databases, assemblies, annotations and further findings can be downloaded from (https://doi.org/10.5281/zenodo.5518085). Oxford University Press 2022-03-02 /pmc/articles/PMC9226531/ /pubmed/35234904 http://dx.doi.org/10.1093/nar/gkac136 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of Nucleic Acids Research. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Riehl, Kevin
Riccio, Cristian
Miska, Eric A
Hemberg, Martin
TransposonUltimate: software for transposon classification, annotation and detection
title TransposonUltimate: software for transposon classification, annotation and detection
title_full TransposonUltimate: software for transposon classification, annotation and detection
title_fullStr TransposonUltimate: software for transposon classification, annotation and detection
title_full_unstemmed TransposonUltimate: software for transposon classification, annotation and detection
title_short TransposonUltimate: software for transposon classification, annotation and detection
title_sort transposonultimate: software for transposon classification, annotation and detection
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9226531/
https://www.ncbi.nlm.nih.gov/pubmed/35234904
http://dx.doi.org/10.1093/nar/gkac136
work_keys_str_mv AT riehlkevin transposonultimatesoftwarefortransposonclassificationannotationanddetection
AT ricciocristian transposonultimatesoftwarefortransposonclassificationannotationanddetection
AT miskaerica transposonultimatesoftwarefortransposonclassificationannotationanddetection
AT hembergmartin transposonultimatesoftwarefortransposonclassificationannotationanddetection