Cargando…

ProtRepeatsDB: a database of amino acid repeats in genomes

BACKGROUND: Genome wide and cross species comparisons of amino acid repeats is an intriguing problem in biology mainly due to the highly polymorphic nature and diverse functions of amino acid repeats. Innate protein repeats constitute vital functional and structural regions in proteins. Repeats are...

Descripción completa

Detalles Bibliográficos
Autores principales: Kalita, Mridul K, Ramasamy, Gowthaman, Duraisamy, Sekhar, Chauhan, Virander S, Gupta, Dinesh
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1538635/
https://www.ncbi.nlm.nih.gov/pubmed/16827924
http://dx.doi.org/10.1186/1471-2105-7-336
_version_ 1782129117125148672
author Kalita, Mridul K
Ramasamy, Gowthaman
Duraisamy, Sekhar
Chauhan, Virander S
Gupta, Dinesh
author_facet Kalita, Mridul K
Ramasamy, Gowthaman
Duraisamy, Sekhar
Chauhan, Virander S
Gupta, Dinesh
author_sort Kalita, Mridul K
collection PubMed
description BACKGROUND: Genome wide and cross species comparisons of amino acid repeats is an intriguing problem in biology mainly due to the highly polymorphic nature and diverse functions of amino acid repeats. Innate protein repeats constitute vital functional and structural regions in proteins. Repeats are of great consequence in evolution of proteins, as evident from analysis of repeats in different organisms. In the post genomic era, availability of protein sequences encoded in different genomes provides a unique opportunity to perform large scale comparative studies of amino acid repeats. ProtRepeatsDB is a relational database of perfect and mismatch repeats, access to which is designed as a resource and collection of tools for detection and cross species comparisons of different types of amino acid repeats. DESCRIPTION: ProtRepeatsDB (v1.2) consists of perfect as well as mismatch amino acid repeats in the protein sequences of 141 organisms, the genomes of which are now available. The web interface of ProtRepeatsDB consists of different tools to perform repeat s; based on protein IDs, organism name, repeat sequences, and keywords as in FASTA headers, size, frequency, gene ontology (GO) annotation IDs and regular expressions (REGEXP) describing repeats. These tools also allow formulation of a variety of simple, complex and logical queries to facilitate mining and large-scale cross-species comparisons of amino acid repeats. In addition to this, the database also contains sequence analysis tools to determine repeats in user input sequences. CONCLUSION: ProtRepeatsDB is a multi-organism database of different types of amino acid repeats present in proteins. It integrates useful tools to perform genome wide queries for rapid screening and identification of amino acid repeats and facilitates comparative and evolutionary studies of the repeats. The database is useful for identification of species or organism specific repeat markers, interspecies variations and polymorphism.
format Text
id pubmed-1538635
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-15386352006-08-10 ProtRepeatsDB: a database of amino acid repeats in genomes Kalita, Mridul K Ramasamy, Gowthaman Duraisamy, Sekhar Chauhan, Virander S Gupta, Dinesh BMC Bioinformatics Database BACKGROUND: Genome wide and cross species comparisons of amino acid repeats is an intriguing problem in biology mainly due to the highly polymorphic nature and diverse functions of amino acid repeats. Innate protein repeats constitute vital functional and structural regions in proteins. Repeats are of great consequence in evolution of proteins, as evident from analysis of repeats in different organisms. In the post genomic era, availability of protein sequences encoded in different genomes provides a unique opportunity to perform large scale comparative studies of amino acid repeats. ProtRepeatsDB is a relational database of perfect and mismatch repeats, access to which is designed as a resource and collection of tools for detection and cross species comparisons of different types of amino acid repeats. DESCRIPTION: ProtRepeatsDB (v1.2) consists of perfect as well as mismatch amino acid repeats in the protein sequences of 141 organisms, the genomes of which are now available. The web interface of ProtRepeatsDB consists of different tools to perform repeat s; based on protein IDs, organism name, repeat sequences, and keywords as in FASTA headers, size, frequency, gene ontology (GO) annotation IDs and regular expressions (REGEXP) describing repeats. These tools also allow formulation of a variety of simple, complex and logical queries to facilitate mining and large-scale cross-species comparisons of amino acid repeats. In addition to this, the database also contains sequence analysis tools to determine repeats in user input sequences. CONCLUSION: ProtRepeatsDB is a multi-organism database of different types of amino acid repeats present in proteins. It integrates useful tools to perform genome wide queries for rapid screening and identification of amino acid repeats and facilitates comparative and evolutionary studies of the repeats. The database is useful for identification of species or organism specific repeat markers, interspecies variations and polymorphism. BioMed Central 2006-07-07 /pmc/articles/PMC1538635/ /pubmed/16827924 http://dx.doi.org/10.1186/1471-2105-7-336 Text en Copyright © 2006 Kalita et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Database
Kalita, Mridul K
Ramasamy, Gowthaman
Duraisamy, Sekhar
Chauhan, Virander S
Gupta, Dinesh
ProtRepeatsDB: a database of amino acid repeats in genomes
title ProtRepeatsDB: a database of amino acid repeats in genomes
title_full ProtRepeatsDB: a database of amino acid repeats in genomes
title_fullStr ProtRepeatsDB: a database of amino acid repeats in genomes
title_full_unstemmed ProtRepeatsDB: a database of amino acid repeats in genomes
title_short ProtRepeatsDB: a database of amino acid repeats in genomes
title_sort protrepeatsdb: a database of amino acid repeats in genomes
topic Database
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1538635/
https://www.ncbi.nlm.nih.gov/pubmed/16827924
http://dx.doi.org/10.1186/1471-2105-7-336
work_keys_str_mv AT kalitamridulk protrepeatsdbadatabaseofaminoacidrepeatsingenomes
AT ramasamygowthaman protrepeatsdbadatabaseofaminoacidrepeatsingenomes
AT duraisamysekhar protrepeatsdbadatabaseofaminoacidrepeatsingenomes
AT chauhanviranders protrepeatsdbadatabaseofaminoacidrepeatsingenomes
AT guptadinesh protrepeatsdbadatabaseofaminoacidrepeatsingenomes