Cargando…

Serine substitutions are linked to codon usage and differ for variable and conserved protein regions

Serine is the only amino acid that is encoded by two disjoint codon sets (TCN & AGY) so that a tandem substitution of two nucleotides is required to switch between the two sets. We show that these codon sets underlie distinct substitution patterns at positions subject to purifying and diversifyi...

Descripción completa

Detalles Bibliográficos
Autores principales: Schwartz, Gregory W., Shauli, Tair, Linial, Michal, Hershberg, Uri
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6872785/
https://www.ncbi.nlm.nih.gov/pubmed/31754132
http://dx.doi.org/10.1038/s41598-019-53452-3
_version_ 1783472563389726720
author Schwartz, Gregory W.
Shauli, Tair
Linial, Michal
Hershberg, Uri
author_facet Schwartz, Gregory W.
Shauli, Tair
Linial, Michal
Hershberg, Uri
author_sort Schwartz, Gregory W.
collection PubMed
description Serine is the only amino acid that is encoded by two disjoint codon sets (TCN & AGY) so that a tandem substitution of two nucleotides is required to switch between the two sets. We show that these codon sets underlie distinct substitution patterns at positions subject to purifying and diversifying selections. We found that in humans, positions that are conserved among ~100 vertebrates, and thus subjected to purifying selection, are enriched for substitutions involving serine (TCN, denoted S′), proline, and alanine, (S′PA). In contrast, the less conserved positions are enriched for serine encoded with AGY codons (denoted S″), glycine and asparagine, (GS″N). We tested this phenomenon in the HIV envelope glycoprotein (gp120), and the V-gene that encodes B-cell receptors/antibodies. These fast evolving proteins both have hypervariable positions, which are under diversifying selection, closely adjacent to highly conserved structural regions. In both instances, we identified an opposite abundance of two groups of serine substitutions, with enrichment of S′PA in the conserved positions, and GS″N in the hypervariable regions. Finally, we analyzed the substitutions across 60,000 individual human exomes to show that, when serine has a specific functional constraint of phosphorylation capability, S′ codons are 32-folds less prone than S″ to substitutions to Threonine or Tyrosine that could potentially retain the phosphorylation site capacity. Combined, our results, that cover evolutionary signals at different temporal scales, demonstrate that through its encoding by two codon sets, serine allows for the existence of alternating substitution patterns within positions of functional maintenance versus sites of rapid diversification.
format Online
Article
Text
id pubmed-6872785
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-68727852019-12-04 Serine substitutions are linked to codon usage and differ for variable and conserved protein regions Schwartz, Gregory W. Shauli, Tair Linial, Michal Hershberg, Uri Sci Rep Article Serine is the only amino acid that is encoded by two disjoint codon sets (TCN & AGY) so that a tandem substitution of two nucleotides is required to switch between the two sets. We show that these codon sets underlie distinct substitution patterns at positions subject to purifying and diversifying selections. We found that in humans, positions that are conserved among ~100 vertebrates, and thus subjected to purifying selection, are enriched for substitutions involving serine (TCN, denoted S′), proline, and alanine, (S′PA). In contrast, the less conserved positions are enriched for serine encoded with AGY codons (denoted S″), glycine and asparagine, (GS″N). We tested this phenomenon in the HIV envelope glycoprotein (gp120), and the V-gene that encodes B-cell receptors/antibodies. These fast evolving proteins both have hypervariable positions, which are under diversifying selection, closely adjacent to highly conserved structural regions. In both instances, we identified an opposite abundance of two groups of serine substitutions, with enrichment of S′PA in the conserved positions, and GS″N in the hypervariable regions. Finally, we analyzed the substitutions across 60,000 individual human exomes to show that, when serine has a specific functional constraint of phosphorylation capability, S′ codons are 32-folds less prone than S″ to substitutions to Threonine or Tyrosine that could potentially retain the phosphorylation site capacity. Combined, our results, that cover evolutionary signals at different temporal scales, demonstrate that through its encoding by two codon sets, serine allows for the existence of alternating substitution patterns within positions of functional maintenance versus sites of rapid diversification. Nature Publishing Group UK 2019-11-21 /pmc/articles/PMC6872785/ /pubmed/31754132 http://dx.doi.org/10.1038/s41598-019-53452-3 Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Schwartz, Gregory W.
Shauli, Tair
Linial, Michal
Hershberg, Uri
Serine substitutions are linked to codon usage and differ for variable and conserved protein regions
title Serine substitutions are linked to codon usage and differ for variable and conserved protein regions
title_full Serine substitutions are linked to codon usage and differ for variable and conserved protein regions
title_fullStr Serine substitutions are linked to codon usage and differ for variable and conserved protein regions
title_full_unstemmed Serine substitutions are linked to codon usage and differ for variable and conserved protein regions
title_short Serine substitutions are linked to codon usage and differ for variable and conserved protein regions
title_sort serine substitutions are linked to codon usage and differ for variable and conserved protein regions
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6872785/
https://www.ncbi.nlm.nih.gov/pubmed/31754132
http://dx.doi.org/10.1038/s41598-019-53452-3
work_keys_str_mv AT schwartzgregoryw serinesubstitutionsarelinkedtocodonusageanddifferforvariableandconservedproteinregions
AT shaulitair serinesubstitutionsarelinkedtocodonusageanddifferforvariableandconservedproteinregions
AT linialmichal serinesubstitutionsarelinkedtocodonusageanddifferforvariableandconservedproteinregions
AT hershberguri serinesubstitutionsarelinkedtocodonusageanddifferforvariableandconservedproteinregions