Cargando…

MITE Tracker: an accurate approach to identify miniature inverted-repeat transposable elements in large genomes

BACKGROUND: Miniature inverted-repeat transposable elements (MITEs) are short, non-autonomous class II transposable elements present in a high number of conserved copies in eukaryote genomes. An accurate identification of these elements can help to shed light on the mechanisms controlling genome evo...

Descripción completa

Detalles Bibliográficos
Autores principales: Crescente, Juan Manuel, Zavallo, Diego, Helguera, Marcelo, Vanzetti, Leonardo Sebastián
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6171319/
https://www.ncbi.nlm.nih.gov/pubmed/30285604
http://dx.doi.org/10.1186/s12859-018-2376-y
_version_ 1783360771423469568
author Crescente, Juan Manuel
Zavallo, Diego
Helguera, Marcelo
Vanzetti, Leonardo Sebastián
author_facet Crescente, Juan Manuel
Zavallo, Diego
Helguera, Marcelo
Vanzetti, Leonardo Sebastián
author_sort Crescente, Juan Manuel
collection PubMed
description BACKGROUND: Miniature inverted-repeat transposable elements (MITEs) are short, non-autonomous class II transposable elements present in a high number of conserved copies in eukaryote genomes. An accurate identification of these elements can help to shed light on the mechanisms controlling genome evolution and gene regulation. The structure and distribution of these elements are well-defined and therefore computational approaches can be used to identify MITEs sequences. RESULTS: Here we describe MITE Tracker, a novel, open source software program that finds and classifies MITEs using an efficient alignment strategy to retrieve nearby inverted-repeat sequences from large genomes. This program groups them into high sequence homology families using a fast clustering algorithm and finally filters only those elements that were likely transposed from different genomic locations because of their low scoring flanking sequence alignment. CONCLUSIONS: Many programs have been proposed to find MITEs hidden in genomes. However, none of them are able to process large-scale genomes such as that of bread wheat. Furthermore, in many cases the existing methods perform high false-positive rates (or miss rates). The rice genome was used as reference to compare MITE Tracker against known tools. Our method turned out to be the most reliable in our tests. Indeed, it revealed more known elements, presented the lowest false-positive number and was the only program able to run with the bread wheat genome as input. In wheat, MITE Tracker discovered 6013 MITE families and allowed the first structural exploration of MITEs in the complete bread wheat genome. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2376-y) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6171319
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-61713192018-10-10 MITE Tracker: an accurate approach to identify miniature inverted-repeat transposable elements in large genomes Crescente, Juan Manuel Zavallo, Diego Helguera, Marcelo Vanzetti, Leonardo Sebastián BMC Bioinformatics Software BACKGROUND: Miniature inverted-repeat transposable elements (MITEs) are short, non-autonomous class II transposable elements present in a high number of conserved copies in eukaryote genomes. An accurate identification of these elements can help to shed light on the mechanisms controlling genome evolution and gene regulation. The structure and distribution of these elements are well-defined and therefore computational approaches can be used to identify MITEs sequences. RESULTS: Here we describe MITE Tracker, a novel, open source software program that finds and classifies MITEs using an efficient alignment strategy to retrieve nearby inverted-repeat sequences from large genomes. This program groups them into high sequence homology families using a fast clustering algorithm and finally filters only those elements that were likely transposed from different genomic locations because of their low scoring flanking sequence alignment. CONCLUSIONS: Many programs have been proposed to find MITEs hidden in genomes. However, none of them are able to process large-scale genomes such as that of bread wheat. Furthermore, in many cases the existing methods perform high false-positive rates (or miss rates). The rice genome was used as reference to compare MITE Tracker against known tools. Our method turned out to be the most reliable in our tests. Indeed, it revealed more known elements, presented the lowest false-positive number and was the only program able to run with the bread wheat genome as input. In wheat, MITE Tracker discovered 6013 MITE families and allowed the first structural exploration of MITEs in the complete bread wheat genome. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2376-y) contains supplementary material, which is available to authorized users. BioMed Central 2018-10-03 /pmc/articles/PMC6171319/ /pubmed/30285604 http://dx.doi.org/10.1186/s12859-018-2376-y Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Crescente, Juan Manuel
Zavallo, Diego
Helguera, Marcelo
Vanzetti, Leonardo Sebastián
MITE Tracker: an accurate approach to identify miniature inverted-repeat transposable elements in large genomes
title MITE Tracker: an accurate approach to identify miniature inverted-repeat transposable elements in large genomes
title_full MITE Tracker: an accurate approach to identify miniature inverted-repeat transposable elements in large genomes
title_fullStr MITE Tracker: an accurate approach to identify miniature inverted-repeat transposable elements in large genomes
title_full_unstemmed MITE Tracker: an accurate approach to identify miniature inverted-repeat transposable elements in large genomes
title_short MITE Tracker: an accurate approach to identify miniature inverted-repeat transposable elements in large genomes
title_sort mite tracker: an accurate approach to identify miniature inverted-repeat transposable elements in large genomes
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6171319/
https://www.ncbi.nlm.nih.gov/pubmed/30285604
http://dx.doi.org/10.1186/s12859-018-2376-y
work_keys_str_mv AT crescentejuanmanuel mitetrackeranaccurateapproachtoidentifyminiatureinvertedrepeattransposableelementsinlargegenomes
AT zavallodiego mitetrackeranaccurateapproachtoidentifyminiatureinvertedrepeattransposableelementsinlargegenomes
AT helgueramarcelo mitetrackeranaccurateapproachtoidentifyminiatureinvertedrepeattransposableelementsinlargegenomes
AT vanzettileonardosebastian mitetrackeranaccurateapproachtoidentifyminiatureinvertedrepeattransposableelementsinlargegenomes