Cargando…

LTRharvest, an efficient and flexible software for de novo detection of LTR retrotransposons

BACKGROUND: Transposable elements are abundant in eukaryotic genomes and it is believed that they have a significant impact on the evolution of gene and chromosome structure. While there are several completed eukaryotic genome projects, there are only few high quality genome wide annotations of tran...

Descripción completa

Detalles Bibliográficos
Autores principales: Ellinghaus, David, Kurtz, Stefan, Willhoeft, Ute
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2253517/
https://www.ncbi.nlm.nih.gov/pubmed/18194517
http://dx.doi.org/10.1186/1471-2105-9-18
_version_ 1782151111505870848
author Ellinghaus, David
Kurtz, Stefan
Willhoeft, Ute
author_facet Ellinghaus, David
Kurtz, Stefan
Willhoeft, Ute
author_sort Ellinghaus, David
collection PubMed
description BACKGROUND: Transposable elements are abundant in eukaryotic genomes and it is believed that they have a significant impact on the evolution of gene and chromosome structure. While there are several completed eukaryotic genome projects, there are only few high quality genome wide annotations of transposable elements. Therefore, there is a considerable demand for computational identification of transposable elements. LTR retrotransposons, an important subclass of transposable elements, are well suited for computational identification, as they contain long terminal repeats (LTRs). RESULTS: We have developed a software tool LTRharvest for the de novo detection of full length LTR retrotransposons in large sequence sets. LTRharvest efficiently delivers high quality annotations based on known LTR transposon features like length, distance, and sequence motifs. A quality validation of LTRharvest against a gold standard annotation for Saccharomyces cerevisae and Drosophila melanogaster shows a sensitivity of up to 90% and 97% and specificity of 100% and 72%, respectively. This is comparable or slightly better than annotations for previous software tools. The main advantage of LTRharvest over previous tools is (a) its ability to efficiently handle large datasets from finished or unfinished genome projects, (b) its flexibility in incorporating known sequence features into the prediction, and (c) its availability as an open source software. CONCLUSION: LTRharvest is an efficient software tool delivering high quality annotation of LTR retrotransposons. It can, for example, process the largest human chromosome in approx. 8 minutes on a Linux PC with 4 GB of memory. Its flexibility and small space and run-time requirements makes LTRharvest a very competitive candidate for future LTR retrotransposon annotation projects. Moreover, the structured design and implementation and the availability as open source provides an excellent base for incorporating novel concepts to further improve prediction of LTR retrotransposons.
format Text
id pubmed-2253517
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-22535172008-02-23 LTRharvest, an efficient and flexible software for de novo detection of LTR retrotransposons Ellinghaus, David Kurtz, Stefan Willhoeft, Ute BMC Bioinformatics Software BACKGROUND: Transposable elements are abundant in eukaryotic genomes and it is believed that they have a significant impact on the evolution of gene and chromosome structure. While there are several completed eukaryotic genome projects, there are only few high quality genome wide annotations of transposable elements. Therefore, there is a considerable demand for computational identification of transposable elements. LTR retrotransposons, an important subclass of transposable elements, are well suited for computational identification, as they contain long terminal repeats (LTRs). RESULTS: We have developed a software tool LTRharvest for the de novo detection of full length LTR retrotransposons in large sequence sets. LTRharvest efficiently delivers high quality annotations based on known LTR transposon features like length, distance, and sequence motifs. A quality validation of LTRharvest against a gold standard annotation for Saccharomyces cerevisae and Drosophila melanogaster shows a sensitivity of up to 90% and 97% and specificity of 100% and 72%, respectively. This is comparable or slightly better than annotations for previous software tools. The main advantage of LTRharvest over previous tools is (a) its ability to efficiently handle large datasets from finished or unfinished genome projects, (b) its flexibility in incorporating known sequence features into the prediction, and (c) its availability as an open source software. CONCLUSION: LTRharvest is an efficient software tool delivering high quality annotation of LTR retrotransposons. It can, for example, process the largest human chromosome in approx. 8 minutes on a Linux PC with 4 GB of memory. Its flexibility and small space and run-time requirements makes LTRharvest a very competitive candidate for future LTR retrotransposon annotation projects. Moreover, the structured design and implementation and the availability as open source provides an excellent base for incorporating novel concepts to further improve prediction of LTR retrotransposons. BioMed Central 2008-01-14 /pmc/articles/PMC2253517/ /pubmed/18194517 http://dx.doi.org/10.1186/1471-2105-9-18 Text en Copyright © 2008 Ellinghaus et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Ellinghaus, David
Kurtz, Stefan
Willhoeft, Ute
LTRharvest, an efficient and flexible software for de novo detection of LTR retrotransposons
title LTRharvest, an efficient and flexible software for de novo detection of LTR retrotransposons
title_full LTRharvest, an efficient and flexible software for de novo detection of LTR retrotransposons
title_fullStr LTRharvest, an efficient and flexible software for de novo detection of LTR retrotransposons
title_full_unstemmed LTRharvest, an efficient and flexible software for de novo detection of LTR retrotransposons
title_short LTRharvest, an efficient and flexible software for de novo detection of LTR retrotransposons
title_sort ltrharvest, an efficient and flexible software for de novo detection of ltr retrotransposons
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2253517/
https://www.ncbi.nlm.nih.gov/pubmed/18194517
http://dx.doi.org/10.1186/1471-2105-9-18
work_keys_str_mv AT ellinghausdavid ltrharvestanefficientandflexiblesoftwarefordenovodetectionofltrretrotransposons
AT kurtzstefan ltrharvestanefficientandflexiblesoftwarefordenovodetectionofltrretrotransposons
AT willhoeftute ltrharvestanefficientandflexiblesoftwarefordenovodetectionofltrretrotransposons