Cargando…

Single Amino Acid Repeats in the Proteome World: Structural, Functional, and Evolutionary Insights

Microsatellites or simple sequence repeats (SSR) are abundant, highly diverse stretches of short DNA repeats present in all genomes. Tandem mono/tri/hexanucleotide repeats in the coding regions contribute to single amino acids repeats (SAARs) in the proteome. While SSRs in the coding region always r...

Descripción completa

Detalles Bibliográficos
Autores principales: Kumar, Amitha Sampath, Sowpati, Divya Tej, Mishra, Rakesh K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5125637/
https://www.ncbi.nlm.nih.gov/pubmed/27893794
http://dx.doi.org/10.1371/journal.pone.0166854
_version_ 1782469992344715264
author Kumar, Amitha Sampath
Sowpati, Divya Tej
Mishra, Rakesh K.
author_facet Kumar, Amitha Sampath
Sowpati, Divya Tej
Mishra, Rakesh K.
author_sort Kumar, Amitha Sampath
collection PubMed
description Microsatellites or simple sequence repeats (SSR) are abundant, highly diverse stretches of short DNA repeats present in all genomes. Tandem mono/tri/hexanucleotide repeats in the coding regions contribute to single amino acids repeats (SAARs) in the proteome. While SSRs in the coding region always result in amino acid repeats, a majority of SAARs arise due to a combination of various codons representing the same amino acid and not as a consequence of SSR events. Certain amino acids are abundant in repeat regions indicating a positive selection pressure behind the accumulation of SAARs. By analysing 22 proteomes including the human proteome, we explored the functional and structural relationship of amino acid repeats in an evolutionary context. Only ~15% of repeats are present in any known functional domain, while ~74% of repeats are present in the disordered regions, suggesting that SAARs add to the functionality of proteins by providing flexibility, stability and act as linker elements between domains. Comparison of SAAR containing proteins across species reveals that while shorter repeats are conserved among orthologs, proteins with longer repeats, >15 amino acids, are unique to the respective organism. Lysine repeats are well conserved among orthologs with respect to their length and number of occurrences in a protein. Other amino acids such as glutamic acid, proline, serine and alanine repeats are generally conserved among the orthologs with varying repeat lengths. These findings suggest that SAARs have accumulated in the proteome under positive selection pressure and that they provide flexibility for optimal folding of functional/structural domains of proteins. The insights gained from our observations can help in effective designing and engineering of proteins with novel features.
format Online
Article
Text
id pubmed-5125637
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-51256372016-12-15 Single Amino Acid Repeats in the Proteome World: Structural, Functional, and Evolutionary Insights Kumar, Amitha Sampath Sowpati, Divya Tej Mishra, Rakesh K. PLoS One Research Article Microsatellites or simple sequence repeats (SSR) are abundant, highly diverse stretches of short DNA repeats present in all genomes. Tandem mono/tri/hexanucleotide repeats in the coding regions contribute to single amino acids repeats (SAARs) in the proteome. While SSRs in the coding region always result in amino acid repeats, a majority of SAARs arise due to a combination of various codons representing the same amino acid and not as a consequence of SSR events. Certain amino acids are abundant in repeat regions indicating a positive selection pressure behind the accumulation of SAARs. By analysing 22 proteomes including the human proteome, we explored the functional and structural relationship of amino acid repeats in an evolutionary context. Only ~15% of repeats are present in any known functional domain, while ~74% of repeats are present in the disordered regions, suggesting that SAARs add to the functionality of proteins by providing flexibility, stability and act as linker elements between domains. Comparison of SAAR containing proteins across species reveals that while shorter repeats are conserved among orthologs, proteins with longer repeats, >15 amino acids, are unique to the respective organism. Lysine repeats are well conserved among orthologs with respect to their length and number of occurrences in a protein. Other amino acids such as glutamic acid, proline, serine and alanine repeats are generally conserved among the orthologs with varying repeat lengths. These findings suggest that SAARs have accumulated in the proteome under positive selection pressure and that they provide flexibility for optimal folding of functional/structural domains of proteins. The insights gained from our observations can help in effective designing and engineering of proteins with novel features. Public Library of Science 2016-11-28 /pmc/articles/PMC5125637/ /pubmed/27893794 http://dx.doi.org/10.1371/journal.pone.0166854 Text en © 2016 Kumar et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Kumar, Amitha Sampath
Sowpati, Divya Tej
Mishra, Rakesh K.
Single Amino Acid Repeats in the Proteome World: Structural, Functional, and Evolutionary Insights
title Single Amino Acid Repeats in the Proteome World: Structural, Functional, and Evolutionary Insights
title_full Single Amino Acid Repeats in the Proteome World: Structural, Functional, and Evolutionary Insights
title_fullStr Single Amino Acid Repeats in the Proteome World: Structural, Functional, and Evolutionary Insights
title_full_unstemmed Single Amino Acid Repeats in the Proteome World: Structural, Functional, and Evolutionary Insights
title_short Single Amino Acid Repeats in the Proteome World: Structural, Functional, and Evolutionary Insights
title_sort single amino acid repeats in the proteome world: structural, functional, and evolutionary insights
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5125637/
https://www.ncbi.nlm.nih.gov/pubmed/27893794
http://dx.doi.org/10.1371/journal.pone.0166854
work_keys_str_mv AT kumaramithasampath singleaminoacidrepeatsintheproteomeworldstructuralfunctionalandevolutionaryinsights
AT sowpatidivyatej singleaminoacidrepeatsintheproteomeworldstructuralfunctionalandevolutionaryinsights
AT mishrarakeshk singleaminoacidrepeatsintheproteomeworldstructuralfunctionalandevolutionaryinsights