Cargando…

RepetDB: a unified resource for transposable element references

BACKGROUND: Thanks to their ability to move around and replicate within genomes, transposable elements (TEs) are perhaps the most important contributors to genome plasticity and evolution. Their detection and annotation are considered essential in any genome sequencing project. The number of fully s...

Descripción completa

Detalles Bibliográficos
Autores principales: Amselem, Joëlle, Cornut, Guillaume, Choisne, Nathalie, Alaux, Michael, Alfama-Depauw, Françoise, Jamilloux, Véronique, Maumus, Florian, Letellier, Thomas, Luyten, Isabelle, Pommier, Cyril, Adam-Blondon, Anne-Françoise, Quesneville, Hadi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6350395/
https://www.ncbi.nlm.nih.gov/pubmed/30719103
http://dx.doi.org/10.1186/s13100-019-0150-y
_version_ 1783390446800601088
author Amselem, Joëlle
Cornut, Guillaume
Choisne, Nathalie
Alaux, Michael
Alfama-Depauw, Françoise
Jamilloux, Véronique
Maumus, Florian
Letellier, Thomas
Luyten, Isabelle
Pommier, Cyril
Adam-Blondon, Anne-Françoise
Quesneville, Hadi
author_facet Amselem, Joëlle
Cornut, Guillaume
Choisne, Nathalie
Alaux, Michael
Alfama-Depauw, Françoise
Jamilloux, Véronique
Maumus, Florian
Letellier, Thomas
Luyten, Isabelle
Pommier, Cyril
Adam-Blondon, Anne-Françoise
Quesneville, Hadi
author_sort Amselem, Joëlle
collection PubMed
description BACKGROUND: Thanks to their ability to move around and replicate within genomes, transposable elements (TEs) are perhaps the most important contributors to genome plasticity and evolution. Their detection and annotation are considered essential in any genome sequencing project. The number of fully sequenced genomes is rapidly increasing with improvements in high-throughput sequencing technologies. A fully automated de novo annotation process for TEs is therefore required to cope with the deluge of sequence data. However, all automated procedures are error-prone, and an automated procedure for TE identification and classification would be no exception. It is therefore crucial to provide not only the TE reference sequences, but also evidence justifying their classification, at the scale of the whole genome. A few TE databases already exist, but none provides evidence to justify TE classification. Moreover, biological information about the sequences remains globally poor. RESULTS: We present here the RepetDB database developed in the framework of GnpIS, a genetic and genomic information system. RepetDB is designed to store and retrieve detected, classified and annotated TEs in a standardized manner. RepetDB is an implementation with extensions of InterMine, an open-source data warehouse framework used here to store, search, browse, analyze and compare all the data recorded for each TE reference sequence. InterMine can display diverse information for each sequence and allows simple to very complex queries. Finally, TE data are displayed via a worldwide data discovery portal. RepetDB is accessible at urgi.versailles.inra.fr/repetdb. CONCLUSIONS: RepetDB is designed to be a TE knowledge base populated with full de novo TE annotations of complete (or near-complete) genome sequences. Indeed, the description and classification of TEs facilitates the exploration of specific TE families, superfamilies or orders across a large range of species. It also makes possible cross-species searches and comparisons of TE family content between genomes.
format Online
Article
Text
id pubmed-6350395
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-63503952019-02-04 RepetDB: a unified resource for transposable element references Amselem, Joëlle Cornut, Guillaume Choisne, Nathalie Alaux, Michael Alfama-Depauw, Françoise Jamilloux, Véronique Maumus, Florian Letellier, Thomas Luyten, Isabelle Pommier, Cyril Adam-Blondon, Anne-Françoise Quesneville, Hadi Mob DNA Software BACKGROUND: Thanks to their ability to move around and replicate within genomes, transposable elements (TEs) are perhaps the most important contributors to genome plasticity and evolution. Their detection and annotation are considered essential in any genome sequencing project. The number of fully sequenced genomes is rapidly increasing with improvements in high-throughput sequencing technologies. A fully automated de novo annotation process for TEs is therefore required to cope with the deluge of sequence data. However, all automated procedures are error-prone, and an automated procedure for TE identification and classification would be no exception. It is therefore crucial to provide not only the TE reference sequences, but also evidence justifying their classification, at the scale of the whole genome. A few TE databases already exist, but none provides evidence to justify TE classification. Moreover, biological information about the sequences remains globally poor. RESULTS: We present here the RepetDB database developed in the framework of GnpIS, a genetic and genomic information system. RepetDB is designed to store and retrieve detected, classified and annotated TEs in a standardized manner. RepetDB is an implementation with extensions of InterMine, an open-source data warehouse framework used here to store, search, browse, analyze and compare all the data recorded for each TE reference sequence. InterMine can display diverse information for each sequence and allows simple to very complex queries. Finally, TE data are displayed via a worldwide data discovery portal. RepetDB is accessible at urgi.versailles.inra.fr/repetdb. CONCLUSIONS: RepetDB is designed to be a TE knowledge base populated with full de novo TE annotations of complete (or near-complete) genome sequences. Indeed, the description and classification of TEs facilitates the exploration of specific TE families, superfamilies or orders across a large range of species. It also makes possible cross-species searches and comparisons of TE family content between genomes. BioMed Central 2019-01-29 /pmc/articles/PMC6350395/ /pubmed/30719103 http://dx.doi.org/10.1186/s13100-019-0150-y Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Amselem, Joëlle
Cornut, Guillaume
Choisne, Nathalie
Alaux, Michael
Alfama-Depauw, Françoise
Jamilloux, Véronique
Maumus, Florian
Letellier, Thomas
Luyten, Isabelle
Pommier, Cyril
Adam-Blondon, Anne-Françoise
Quesneville, Hadi
RepetDB: a unified resource for transposable element references
title RepetDB: a unified resource for transposable element references
title_full RepetDB: a unified resource for transposable element references
title_fullStr RepetDB: a unified resource for transposable element references
title_full_unstemmed RepetDB: a unified resource for transposable element references
title_short RepetDB: a unified resource for transposable element references
title_sort repetdb: a unified resource for transposable element references
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6350395/
https://www.ncbi.nlm.nih.gov/pubmed/30719103
http://dx.doi.org/10.1186/s13100-019-0150-y
work_keys_str_mv AT amselemjoelle repetdbaunifiedresourcefortransposableelementreferences
AT cornutguillaume repetdbaunifiedresourcefortransposableelementreferences
AT choisnenathalie repetdbaunifiedresourcefortransposableelementreferences
AT alauxmichael repetdbaunifiedresourcefortransposableelementreferences
AT alfamadepauwfrancoise repetdbaunifiedresourcefortransposableelementreferences
AT jamillouxveronique repetdbaunifiedresourcefortransposableelementreferences
AT maumusflorian repetdbaunifiedresourcefortransposableelementreferences
AT letellierthomas repetdbaunifiedresourcefortransposableelementreferences
AT luytenisabelle repetdbaunifiedresourcefortransposableelementreferences
AT pommiercyril repetdbaunifiedresourcefortransposableelementreferences
AT adamblondonannefrancoise repetdbaunifiedresourcefortransposableelementreferences
AT quesnevillehadi repetdbaunifiedresourcefortransposableelementreferences