Cargando…

Automated paleontology of repetitive DNA with REANNOTATE

BACKGROUND: Dispersed repeats are a major component of eukaryotic genomes and drivers of genome evolution. Annotation of DNA sequences homologous to known repetitive elements has been mainly performed with the program REPEATMASKER. Sequences annotated by REPEATMASKER often correspond to fragments of...

Descripción completa

Detalles Bibliográficos
Autor principal: Pereira, Vini
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2672092/
https://www.ncbi.nlm.nih.gov/pubmed/19094224
http://dx.doi.org/10.1186/1471-2164-9-614
_version_ 1782166473546924032
author Pereira, Vini
author_facet Pereira, Vini
author_sort Pereira, Vini
collection PubMed
description BACKGROUND: Dispersed repeats are a major component of eukaryotic genomes and drivers of genome evolution. Annotation of DNA sequences homologous to known repetitive elements has been mainly performed with the program REPEATMASKER. Sequences annotated by REPEATMASKER often correspond to fragments of repetitive elements resulting from the insertion of younger elements or other rearrangements. Although REPEATMASKER annotation is indispensable for studying genome biology, this annotation does not contain much information on the common origin of fossil fragments that share an insertion event, especially where clusters of nested insertions of repetitive elements have occurred. RESULTS: Here I present REANNOTATE, a computational tool to process REPEATMASKER annotation for automated i) defragmentation of dispersed repetitive elements, ii) resolution of the temporal order of insertions in clusters of nested elements, and iii) estimating the age of the elements, if they have long terminal repeats. I have re-annotated the repetitive content of human chromosomes, providing evidence for a recent expansion of satellite repeats on the Y chromosome and, from the retroviral age distribution, for a higher rate of evolution on the Y relative to autosomes. CONCLUSION: REANNOTATE is ready to process existing annotation for automated evolutionary analysis of all types of complex repeats in any genome. The tool is freely available under the GPL at .
format Text
id pubmed-2672092
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26720922009-04-23 Automated paleontology of repetitive DNA with REANNOTATE Pereira, Vini BMC Genomics Software BACKGROUND: Dispersed repeats are a major component of eukaryotic genomes and drivers of genome evolution. Annotation of DNA sequences homologous to known repetitive elements has been mainly performed with the program REPEATMASKER. Sequences annotated by REPEATMASKER often correspond to fragments of repetitive elements resulting from the insertion of younger elements or other rearrangements. Although REPEATMASKER annotation is indispensable for studying genome biology, this annotation does not contain much information on the common origin of fossil fragments that share an insertion event, especially where clusters of nested insertions of repetitive elements have occurred. RESULTS: Here I present REANNOTATE, a computational tool to process REPEATMASKER annotation for automated i) defragmentation of dispersed repetitive elements, ii) resolution of the temporal order of insertions in clusters of nested elements, and iii) estimating the age of the elements, if they have long terminal repeats. I have re-annotated the repetitive content of human chromosomes, providing evidence for a recent expansion of satellite repeats on the Y chromosome and, from the retroviral age distribution, for a higher rate of evolution on the Y relative to autosomes. CONCLUSION: REANNOTATE is ready to process existing annotation for automated evolutionary analysis of all types of complex repeats in any genome. The tool is freely available under the GPL at . BioMed Central 2008-12-18 /pmc/articles/PMC2672092/ /pubmed/19094224 http://dx.doi.org/10.1186/1471-2164-9-614 Text en Copyright © 2008 Pereira; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Pereira, Vini
Automated paleontology of repetitive DNA with REANNOTATE
title Automated paleontology of repetitive DNA with REANNOTATE
title_full Automated paleontology of repetitive DNA with REANNOTATE
title_fullStr Automated paleontology of repetitive DNA with REANNOTATE
title_full_unstemmed Automated paleontology of repetitive DNA with REANNOTATE
title_short Automated paleontology of repetitive DNA with REANNOTATE
title_sort automated paleontology of repetitive dna with reannotate
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2672092/
https://www.ncbi.nlm.nih.gov/pubmed/19094224
http://dx.doi.org/10.1186/1471-2164-9-614
work_keys_str_mv AT pereiravini automatedpaleontologyofrepetitivednawithreannotate