Cargando…

Compressing Proteomes: The Relevance of Medium Range Correlations

We study the nonrandomness of proteome sequences by analysing the correlations that arise between amino acids at a short and medium range, more specifically, between amino acids located 10 or 100 residues apart; respectively. We show that statistical models that consider these two types of correlati...

Descripción completa

Detalles Bibliográficos
Autores principales: Benedetto, Dario, Caglioti, Emanuele, Chica, Claudia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3171345/
https://www.ncbi.nlm.nih.gov/pubmed/18256727
http://dx.doi.org/10.1155/2007/60723
Descripción
Sumario:We study the nonrandomness of proteome sequences by analysing the correlations that arise between amino acids at a short and medium range, more specifically, between amino acids located 10 or 100 residues apart; respectively. We show that statistical models that consider these two types of correlation are more likely to seize the information contained in protein sequences and thus achieve good compression rates. Finally, we propose that the cause for this redundancy is related to the evolutionary origin of proteomes and protein sequences.