Cargando…
Simple sequence proteins in prokaryotic proteomes
BACKGROUND: The structural and functional features associated with Simple Sequence Proteins (SSPs) are non-globularity, disease states, signaling and post-translational modification. SSPs are also an important source of genetic and possibly phenotypic variation. Analysis of 249 prokaryotic proteomes...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2006
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1524752/ https://www.ncbi.nlm.nih.gov/pubmed/16762057 http://dx.doi.org/10.1186/1471-2164-7-141 |
_version_ | 1782128837402820608 |
---|---|
author | Subramanyam, Mekapati Bala Gnanamani, Muthiah Ramachandran, Srinivasan |
author_facet | Subramanyam, Mekapati Bala Gnanamani, Muthiah Ramachandran, Srinivasan |
author_sort | Subramanyam, Mekapati Bala |
collection | PubMed |
description | BACKGROUND: The structural and functional features associated with Simple Sequence Proteins (SSPs) are non-globularity, disease states, signaling and post-translational modification. SSPs are also an important source of genetic and possibly phenotypic variation. Analysis of 249 prokaryotic proteomes offers a new opportunity to examine the genomic properties of SSPs. RESULTS: SSPs are a minority but they grow with proteome size. This relationship is exhibited across species varying in genomic GC, mutational bias, life style, and pathogenicity. Their proportion in each proteome is strongly influenced by genomic base compositional bias. In most species simple duplications is favoured, but in a few cases such as Mycobacteria, large families of duplications occur. Amino acid preference in SSPs exhibits a trend towards low cost of biosynthesis. In SSPs and in non-SSPs, Alanine, Glycine, Leucine, and Valine are abundant in species widely varying in genomic GC whereas Isoleucine and Lysine are rich only in organisms with low genomic GC. Arginine is abundant in SSPs of two species and in the non-SSPs of Xanthomonas oryzae. Asparagine is abundant only in SSPs of low GC species. Aspartic acid is abundant only in the non-SSPs of Halobacterium sp NRC1. The abundance of Serine in SSPs of 62 species extends over a broader range compared to that of non-SSPs. Threonine(T) is abundant only in SSPs of a couple of species. SSPs exhibit preferential association with Cell surface, Cell membrane and Transport functions and a negative association with Metabolism. Mesophiles and Thermophiles display similar ranges in the content of SSPs. CONCLUSION: Although SSPs are a minority, the genomic forces of base compositional bias and duplications influence their growth and pattern in each species. The preferences and abundance of amino acids are governed by low biosynthetic cost, evolutionary age and base composition of codons. Abundance of charged amino acids Arginine and Aspartic acid is severely restricted. SSPs preferentially associate with cell surface and interface functions as opposed to metabolism, wherein proteins of high sequence complexity with globular structures are preferred. Mesophiles and Thermophiles are similar with respect to the content of SSPs. Our analysis serves to expandthe commonly held views on SSPs. |
format | Text |
id | pubmed-1524752 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2006 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-15247522006-08-01 Simple sequence proteins in prokaryotic proteomes Subramanyam, Mekapati Bala Gnanamani, Muthiah Ramachandran, Srinivasan BMC Genomics Research Article BACKGROUND: The structural and functional features associated with Simple Sequence Proteins (SSPs) are non-globularity, disease states, signaling and post-translational modification. SSPs are also an important source of genetic and possibly phenotypic variation. Analysis of 249 prokaryotic proteomes offers a new opportunity to examine the genomic properties of SSPs. RESULTS: SSPs are a minority but they grow with proteome size. This relationship is exhibited across species varying in genomic GC, mutational bias, life style, and pathogenicity. Their proportion in each proteome is strongly influenced by genomic base compositional bias. In most species simple duplications is favoured, but in a few cases such as Mycobacteria, large families of duplications occur. Amino acid preference in SSPs exhibits a trend towards low cost of biosynthesis. In SSPs and in non-SSPs, Alanine, Glycine, Leucine, and Valine are abundant in species widely varying in genomic GC whereas Isoleucine and Lysine are rich only in organisms with low genomic GC. Arginine is abundant in SSPs of two species and in the non-SSPs of Xanthomonas oryzae. Asparagine is abundant only in SSPs of low GC species. Aspartic acid is abundant only in the non-SSPs of Halobacterium sp NRC1. The abundance of Serine in SSPs of 62 species extends over a broader range compared to that of non-SSPs. Threonine(T) is abundant only in SSPs of a couple of species. SSPs exhibit preferential association with Cell surface, Cell membrane and Transport functions and a negative association with Metabolism. Mesophiles and Thermophiles display similar ranges in the content of SSPs. CONCLUSION: Although SSPs are a minority, the genomic forces of base compositional bias and duplications influence their growth and pattern in each species. The preferences and abundance of amino acids are governed by low biosynthetic cost, evolutionary age and base composition of codons. Abundance of charged amino acids Arginine and Aspartic acid is severely restricted. SSPs preferentially associate with cell surface and interface functions as opposed to metabolism, wherein proteins of high sequence complexity with globular structures are preferred. Mesophiles and Thermophiles are similar with respect to the content of SSPs. Our analysis serves to expandthe commonly held views on SSPs. BioMed Central 2006-06-08 /pmc/articles/PMC1524752/ /pubmed/16762057 http://dx.doi.org/10.1186/1471-2164-7-141 Text en Copyright © 2006 Subramanyam et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Subramanyam, Mekapati Bala Gnanamani, Muthiah Ramachandran, Srinivasan Simple sequence proteins in prokaryotic proteomes |
title | Simple sequence proteins in prokaryotic proteomes |
title_full | Simple sequence proteins in prokaryotic proteomes |
title_fullStr | Simple sequence proteins in prokaryotic proteomes |
title_full_unstemmed | Simple sequence proteins in prokaryotic proteomes |
title_short | Simple sequence proteins in prokaryotic proteomes |
title_sort | simple sequence proteins in prokaryotic proteomes |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1524752/ https://www.ncbi.nlm.nih.gov/pubmed/16762057 http://dx.doi.org/10.1186/1471-2164-7-141 |
work_keys_str_mv | AT subramanyammekapatibala simplesequenceproteinsinprokaryoticproteomes AT gnanamanimuthiah simplesequenceproteinsinprokaryoticproteomes AT ramachandransrinivasan simplesequenceproteinsinprokaryoticproteomes |