Cargando…
Compressing Proteomes: The Relevance of Medium Range Correlations
We study the nonrandomness of proteome sequences by analysing the correlations that arise between amino acids at a short and medium range, more specifically, between amino acids located 10 or 100 residues apart; respectively. We show that statistical models that consider these two types of correlati...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer
2007
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3171345/ https://www.ncbi.nlm.nih.gov/pubmed/18256727 http://dx.doi.org/10.1155/2007/60723 |
_version_ | 1782211741540679680 |
---|---|
author | Benedetto, Dario Caglioti, Emanuele Chica, Claudia |
author_facet | Benedetto, Dario Caglioti, Emanuele Chica, Claudia |
author_sort | Benedetto, Dario |
collection | PubMed |
description | We study the nonrandomness of proteome sequences by analysing the correlations that arise between amino acids at a short and medium range, more specifically, between amino acids located 10 or 100 residues apart; respectively. We show that statistical models that consider these two types of correlation are more likely to seize the information contained in protein sequences and thus achieve good compression rates. Finally, we propose that the cause for this redundancy is related to the evolutionary origin of proteomes and protein sequences. |
format | Online Article Text |
id | pubmed-3171345 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2007 |
publisher | Springer |
record_format | MEDLINE/PubMed |
spelling | pubmed-31713452011-09-13 Compressing Proteomes: The Relevance of Medium Range Correlations Benedetto, Dario Caglioti, Emanuele Chica, Claudia EURASIP J Bioinform Syst Biol Research Article We study the nonrandomness of proteome sequences by analysing the correlations that arise between amino acids at a short and medium range, more specifically, between amino acids located 10 or 100 residues apart; respectively. We show that statistical models that consider these two types of correlation are more likely to seize the information contained in protein sequences and thus achieve good compression rates. Finally, we propose that the cause for this redundancy is related to the evolutionary origin of proteomes and protein sequences. Springer 2007-10-30 /pmc/articles/PMC3171345/ /pubmed/18256727 http://dx.doi.org/10.1155/2007/60723 Text en Copyright © 2007 Dario Benedetto et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Benedetto, Dario Caglioti, Emanuele Chica, Claudia Compressing Proteomes: The Relevance of Medium Range Correlations |
title | Compressing Proteomes: The Relevance of Medium Range Correlations |
title_full | Compressing Proteomes: The Relevance of Medium Range Correlations |
title_fullStr | Compressing Proteomes: The Relevance of Medium Range Correlations |
title_full_unstemmed | Compressing Proteomes: The Relevance of Medium Range Correlations |
title_short | Compressing Proteomes: The Relevance of Medium Range Correlations |
title_sort | compressing proteomes: the relevance of medium range correlations |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3171345/ https://www.ncbi.nlm.nih.gov/pubmed/18256727 http://dx.doi.org/10.1155/2007/60723 |
work_keys_str_mv | AT benedettodario compressingproteomestherelevanceofmediumrangecorrelations AT cagliotiemanuele compressingproteomestherelevanceofmediumrangecorrelations AT chicaclaudia compressingproteomestherelevanceofmediumrangecorrelations |