Cargando…

Annotation, submission and screening of repetitive elements in Repbase: RepbaseSubmitter and Censor

BACKGROUND: Repbase is a reference database of eukaryotic repetitive DNA, which includes prototypic sequences of repeats and basic information described in annotations. Updating and maintenance of the database requires specialized tools, which we have created and made available for use with Repbase,...

Descripción completa

Detalles Bibliográficos
Autores principales: Kohany, Oleksiy, Gentles, Andrew J, Hankus, Lukasz, Jurka, Jerzy
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1634758/
https://www.ncbi.nlm.nih.gov/pubmed/17064419
http://dx.doi.org/10.1186/1471-2105-7-474
_version_ 1782130644082491392
author Kohany, Oleksiy
Gentles, Andrew J
Hankus, Lukasz
Jurka, Jerzy
author_facet Kohany, Oleksiy
Gentles, Andrew J
Hankus, Lukasz
Jurka, Jerzy
author_sort Kohany, Oleksiy
collection PubMed
description BACKGROUND: Repbase is a reference database of eukaryotic repetitive DNA, which includes prototypic sequences of repeats and basic information described in annotations. Updating and maintenance of the database requires specialized tools, which we have created and made available for use with Repbase, and which may be useful as a template for other curated databases. RESULTS: We describe the software tools RepbaseSubmitter and Censor, which are designed to facilitate updating and screening the content of Repbase. RepbaseSubmitter is a java-based interface for formatting and annotating Repbase entries. It eliminates many common formatting errors, and automates actions such as calculation of sequence lengths and composition, thus facilitating curation of Repbase sequences. In addition, it has several features for predicting protein coding regions in sequences; searching and including Pubmed references in Repbase entries; and searching the NCBI taxonomy database for correct inclusion of species information and taxonomic position. Censor is a tool to rapidly identify repetitive elements by comparison to known repeats. It uses WU-BLAST for speed and sensitivity, and can conduct DNA-DNA, DNA-protein, or translated DNA-translated DNA searches of genomic sequence. Defragmented output includes a map of repeats present in the query sequence, with the options to report masked query sequence(s), repeat sequences found in the query, and alignments. CONCLUSION: Censor and RepbaseSubmitter are available as both web-based services and downloadable versions. They can be found at (RepbaseSubmitter) and (Censor).
format Text
id pubmed-1634758
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-16347582006-11-04 Annotation, submission and screening of repetitive elements in Repbase: RepbaseSubmitter and Censor Kohany, Oleksiy Gentles, Andrew J Hankus, Lukasz Jurka, Jerzy BMC Bioinformatics Software BACKGROUND: Repbase is a reference database of eukaryotic repetitive DNA, which includes prototypic sequences of repeats and basic information described in annotations. Updating and maintenance of the database requires specialized tools, which we have created and made available for use with Repbase, and which may be useful as a template for other curated databases. RESULTS: We describe the software tools RepbaseSubmitter and Censor, which are designed to facilitate updating and screening the content of Repbase. RepbaseSubmitter is a java-based interface for formatting and annotating Repbase entries. It eliminates many common formatting errors, and automates actions such as calculation of sequence lengths and composition, thus facilitating curation of Repbase sequences. In addition, it has several features for predicting protein coding regions in sequences; searching and including Pubmed references in Repbase entries; and searching the NCBI taxonomy database for correct inclusion of species information and taxonomic position. Censor is a tool to rapidly identify repetitive elements by comparison to known repeats. It uses WU-BLAST for speed and sensitivity, and can conduct DNA-DNA, DNA-protein, or translated DNA-translated DNA searches of genomic sequence. Defragmented output includes a map of repeats present in the query sequence, with the options to report masked query sequence(s), repeat sequences found in the query, and alignments. CONCLUSION: Censor and RepbaseSubmitter are available as both web-based services and downloadable versions. They can be found at (RepbaseSubmitter) and (Censor). BioMed Central 2006-10-25 /pmc/articles/PMC1634758/ /pubmed/17064419 http://dx.doi.org/10.1186/1471-2105-7-474 Text en Copyright © 2006 Kohany et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Kohany, Oleksiy
Gentles, Andrew J
Hankus, Lukasz
Jurka, Jerzy
Annotation, submission and screening of repetitive elements in Repbase: RepbaseSubmitter and Censor
title Annotation, submission and screening of repetitive elements in Repbase: RepbaseSubmitter and Censor
title_full Annotation, submission and screening of repetitive elements in Repbase: RepbaseSubmitter and Censor
title_fullStr Annotation, submission and screening of repetitive elements in Repbase: RepbaseSubmitter and Censor
title_full_unstemmed Annotation, submission and screening of repetitive elements in Repbase: RepbaseSubmitter and Censor
title_short Annotation, submission and screening of repetitive elements in Repbase: RepbaseSubmitter and Censor
title_sort annotation, submission and screening of repetitive elements in repbase: repbasesubmitter and censor
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1634758/
https://www.ncbi.nlm.nih.gov/pubmed/17064419
http://dx.doi.org/10.1186/1471-2105-7-474
work_keys_str_mv AT kohanyoleksiy annotationsubmissionandscreeningofrepetitiveelementsinrepbaserepbasesubmitterandcensor
AT gentlesandrewj annotationsubmissionandscreeningofrepetitiveelementsinrepbaserepbasesubmitterandcensor
AT hankuslukasz annotationsubmissionandscreeningofrepetitiveelementsinrepbaserepbasesubmitterandcensor
AT jurkajerzy annotationsubmissionandscreeningofrepetitiveelementsinrepbaserepbasesubmitterandcensor