Cargando…

COPASAAR – A database for proteomic analysis of single amino acid repeats

BACKGROUND: Single amino acid repeats make up a significant proportion in all of the proteomes that have currently been determined. They have been shown to be functionally and medically significant, and are associated with cancers and neuro-degenerative diseases such as Huntington's Chorea, whe...

Descripción completa

Detalles Bibliográficos
Autores principales: Depledge, Daniel P, Dalby, Andrew R
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1199582/
https://www.ncbi.nlm.nih.gov/pubmed/16078990
http://dx.doi.org/10.1186/1471-2105-6-196
_version_ 1782124871974649856
author Depledge, Daniel P
Dalby, Andrew R
author_facet Depledge, Daniel P
Dalby, Andrew R
author_sort Depledge, Daniel P
collection PubMed
description BACKGROUND: Single amino acid repeats make up a significant proportion in all of the proteomes that have currently been determined. They have been shown to be functionally and medically significant, and are associated with cancers and neuro-degenerative diseases such as Huntington's Chorea, where a poly-glutamine repeat is responsible for causing the disease. The COPASAAR database is a new tool to facilitate the rapid analysis of single amino acid repeats at a proteome level. The database aims to simplify the comparison of repeat distributions between proteomes in order to provide a better understanding of their function and evolution. RESULTS: A comparative analysis of all proteomes in the database (currently 244) shows that single amino acid repeats account for about 12–14% of the proteome of any given species. They are more common in eukaryotes (14%) than in either archaea or bacteria (both 13%). Individual analyses of proteomes show that long single amino acid repeats (6+ residues) are much more common in the Eukaryotes and that longer repeats are usually made up of hydrophilic amino acids such as glutamine, glutamic acid, asparagine, aspartic acid and serine. CONCLUSION: COPASAAR is a useful tool for comparative proteomics that provides rapid access to amino acid repeat data that can be readily data-mined. The COPASAAR database can be queried at the kingdom, proteome or individual protein level. As the amount of available proteome data increases this will be increasingly important in order to automate proteome comparison. The insights gained from these studies will give a better insight into the evolution of protein sequence and function.
format Text
id pubmed-1199582
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-11995822005-09-08 COPASAAR – A database for proteomic analysis of single amino acid repeats Depledge, Daniel P Dalby, Andrew R BMC Bioinformatics Database BACKGROUND: Single amino acid repeats make up a significant proportion in all of the proteomes that have currently been determined. They have been shown to be functionally and medically significant, and are associated with cancers and neuro-degenerative diseases such as Huntington's Chorea, where a poly-glutamine repeat is responsible for causing the disease. The COPASAAR database is a new tool to facilitate the rapid analysis of single amino acid repeats at a proteome level. The database aims to simplify the comparison of repeat distributions between proteomes in order to provide a better understanding of their function and evolution. RESULTS: A comparative analysis of all proteomes in the database (currently 244) shows that single amino acid repeats account for about 12–14% of the proteome of any given species. They are more common in eukaryotes (14%) than in either archaea or bacteria (both 13%). Individual analyses of proteomes show that long single amino acid repeats (6+ residues) are much more common in the Eukaryotes and that longer repeats are usually made up of hydrophilic amino acids such as glutamine, glutamic acid, asparagine, aspartic acid and serine. CONCLUSION: COPASAAR is a useful tool for comparative proteomics that provides rapid access to amino acid repeat data that can be readily data-mined. The COPASAAR database can be queried at the kingdom, proteome or individual protein level. As the amount of available proteome data increases this will be increasingly important in order to automate proteome comparison. The insights gained from these studies will give a better insight into the evolution of protein sequence and function. BioMed Central 2005-08-03 /pmc/articles/PMC1199582/ /pubmed/16078990 http://dx.doi.org/10.1186/1471-2105-6-196 Text en Copyright © 2005 Depledge and Dalby; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Database
Depledge, Daniel P
Dalby, Andrew R
COPASAAR – A database for proteomic analysis of single amino acid repeats
title COPASAAR – A database for proteomic analysis of single amino acid repeats
title_full COPASAAR – A database for proteomic analysis of single amino acid repeats
title_fullStr COPASAAR – A database for proteomic analysis of single amino acid repeats
title_full_unstemmed COPASAAR – A database for proteomic analysis of single amino acid repeats
title_short COPASAAR – A database for proteomic analysis of single amino acid repeats
title_sort copasaar – a database for proteomic analysis of single amino acid repeats
topic Database
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1199582/
https://www.ncbi.nlm.nih.gov/pubmed/16078990
http://dx.doi.org/10.1186/1471-2105-6-196
work_keys_str_mv AT depledgedanielp copasaaradatabaseforproteomicanalysisofsingleaminoacidrepeats
AT dalbyandrewr copasaaradatabaseforproteomicanalysisofsingleaminoacidrepeats