Cargando…
Exploring Repetitive DNA Landscapes Using REPCLASS, a Tool That Automates the Classification of Transposable Elements in Eukaryotic Genomes
Eukaryotic genomes contain large amount of repetitive DNA, most of which is derived from transposable elements (TEs). Progress has been made to develop computational tools for ab initio identification of repeat families, but there is an urgent need to develop tools to automate the annotation of TEs...
Autores principales: | , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2817418/ https://www.ncbi.nlm.nih.gov/pubmed/20333191 http://dx.doi.org/10.1093/gbe/evp023 |
_version_ | 1782177191324286976 |
---|---|
author | Feschotte, Cédric Keswani, Umeshkumar Ranganathan, Nirmal Guibotsy, Marcel L. Levine, David |
author_facet | Feschotte, Cédric Keswani, Umeshkumar Ranganathan, Nirmal Guibotsy, Marcel L. Levine, David |
author_sort | Feschotte, Cédric |
collection | PubMed |
description | Eukaryotic genomes contain large amount of repetitive DNA, most of which is derived from transposable elements (TEs). Progress has been made to develop computational tools for ab initio identification of repeat families, but there is an urgent need to develop tools to automate the annotation of TEs in genome sequences. Here we introduce REPCLASS, a tool that automates the classification of TE sequences. Using control repeat libraries, we show that the program can classify accurately virtually any known TE types. Combining REPCLASS to ab initio repeat finding in the genomes of Caenorhabditis elegans and Drosophila melanogaster allowed us to recover the contrasting TE landscape characteristic of these species. Unexpectedly, REPCLASS also uncovered several novel TE families in both genomes, augmenting the TE repertoire of these model species. When applied to the genomes of distant Caenorhabditis and Drosophila species, the approach revealed a remarkable conservation of TE composition profile within each genus, despite substantial interspecific covariations in genome size and in the number of TEs and TE families. Lastly, we applied REPCLASS to analyze 10 fungal genomes from a wide taxonomic range, most of which have not been analyzed for TE content previously. The results showed that TE diversity varies widely across the fungi “kingdom” and appears to positively correlate with genome size, in particular for DNA transposons. Together, these data validate REPCLASS as a powerful tool to explore the repetitive DNA landscapes of eukaryotes and to shed light onto the evolutionary forces shaping TE diversity and genome architecture. |
format | Text |
id | pubmed-2817418 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-28174182010-03-22 Exploring Repetitive DNA Landscapes Using REPCLASS, a Tool That Automates the Classification of Transposable Elements in Eukaryotic Genomes Feschotte, Cédric Keswani, Umeshkumar Ranganathan, Nirmal Guibotsy, Marcel L. Levine, David Genome Biol Evol Research Articles Eukaryotic genomes contain large amount of repetitive DNA, most of which is derived from transposable elements (TEs). Progress has been made to develop computational tools for ab initio identification of repeat families, but there is an urgent need to develop tools to automate the annotation of TEs in genome sequences. Here we introduce REPCLASS, a tool that automates the classification of TE sequences. Using control repeat libraries, we show that the program can classify accurately virtually any known TE types. Combining REPCLASS to ab initio repeat finding in the genomes of Caenorhabditis elegans and Drosophila melanogaster allowed us to recover the contrasting TE landscape characteristic of these species. Unexpectedly, REPCLASS also uncovered several novel TE families in both genomes, augmenting the TE repertoire of these model species. When applied to the genomes of distant Caenorhabditis and Drosophila species, the approach revealed a remarkable conservation of TE composition profile within each genus, despite substantial interspecific covariations in genome size and in the number of TEs and TE families. Lastly, we applied REPCLASS to analyze 10 fungal genomes from a wide taxonomic range, most of which have not been analyzed for TE content previously. The results showed that TE diversity varies widely across the fungi “kingdom” and appears to positively correlate with genome size, in particular for DNA transposons. Together, these data validate REPCLASS as a powerful tool to explore the repetitive DNA landscapes of eukaryotes and to shed light onto the evolutionary forces shaping TE diversity and genome architecture. Oxford University Press 2009 2009-07-23 /pmc/articles/PMC2817418/ /pubmed/20333191 http://dx.doi.org/10.1093/gbe/evp023 Text en © The Author(s) 2009. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Articles Feschotte, Cédric Keswani, Umeshkumar Ranganathan, Nirmal Guibotsy, Marcel L. Levine, David Exploring Repetitive DNA Landscapes Using REPCLASS, a Tool That Automates the Classification of Transposable Elements in Eukaryotic Genomes |
title | Exploring Repetitive DNA Landscapes Using REPCLASS, a Tool That Automates the Classification of Transposable Elements in Eukaryotic Genomes |
title_full | Exploring Repetitive DNA Landscapes Using REPCLASS, a Tool That Automates the Classification of Transposable Elements in Eukaryotic Genomes |
title_fullStr | Exploring Repetitive DNA Landscapes Using REPCLASS, a Tool That Automates the Classification of Transposable Elements in Eukaryotic Genomes |
title_full_unstemmed | Exploring Repetitive DNA Landscapes Using REPCLASS, a Tool That Automates the Classification of Transposable Elements in Eukaryotic Genomes |
title_short | Exploring Repetitive DNA Landscapes Using REPCLASS, a Tool That Automates the Classification of Transposable Elements in Eukaryotic Genomes |
title_sort | exploring repetitive dna landscapes using repclass, a tool that automates the classification of transposable elements in eukaryotic genomes |
topic | Research Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2817418/ https://www.ncbi.nlm.nih.gov/pubmed/20333191 http://dx.doi.org/10.1093/gbe/evp023 |
work_keys_str_mv | AT feschottecedric exploringrepetitivednalandscapesusingrepclassatoolthatautomatestheclassificationoftransposableelementsineukaryoticgenomes AT keswaniumeshkumar exploringrepetitivednalandscapesusingrepclassatoolthatautomatestheclassificationoftransposableelementsineukaryoticgenomes AT ranganathannirmal exploringrepetitivednalandscapesusingrepclassatoolthatautomatestheclassificationoftransposableelementsineukaryoticgenomes AT guibotsymarcell exploringrepetitivednalandscapesusingrepclassatoolthatautomatestheclassificationoftransposableelementsineukaryoticgenomes AT levinedavid exploringrepetitivednalandscapesusingrepclassatoolthatautomatestheclassificationoftransposableelementsineukaryoticgenomes |