Cargando…
Serine substitutions are linked to codon usage and differ for variable and conserved protein regions
Serine is the only amino acid that is encoded by two disjoint codon sets (TCN & AGY) so that a tandem substitution of two nucleotides is required to switch between the two sets. We show that these codon sets underlie distinct substitution patterns at positions subject to purifying and diversifyi...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6872785/ https://www.ncbi.nlm.nih.gov/pubmed/31754132 http://dx.doi.org/10.1038/s41598-019-53452-3 |
_version_ | 1783472563389726720 |
---|---|
author | Schwartz, Gregory W. Shauli, Tair Linial, Michal Hershberg, Uri |
author_facet | Schwartz, Gregory W. Shauli, Tair Linial, Michal Hershberg, Uri |
author_sort | Schwartz, Gregory W. |
collection | PubMed |
description | Serine is the only amino acid that is encoded by two disjoint codon sets (TCN & AGY) so that a tandem substitution of two nucleotides is required to switch between the two sets. We show that these codon sets underlie distinct substitution patterns at positions subject to purifying and diversifying selections. We found that in humans, positions that are conserved among ~100 vertebrates, and thus subjected to purifying selection, are enriched for substitutions involving serine (TCN, denoted S′), proline, and alanine, (S′PA). In contrast, the less conserved positions are enriched for serine encoded with AGY codons (denoted S″), glycine and asparagine, (GS″N). We tested this phenomenon in the HIV envelope glycoprotein (gp120), and the V-gene that encodes B-cell receptors/antibodies. These fast evolving proteins both have hypervariable positions, which are under diversifying selection, closely adjacent to highly conserved structural regions. In both instances, we identified an opposite abundance of two groups of serine substitutions, with enrichment of S′PA in the conserved positions, and GS″N in the hypervariable regions. Finally, we analyzed the substitutions across 60,000 individual human exomes to show that, when serine has a specific functional constraint of phosphorylation capability, S′ codons are 32-folds less prone than S″ to substitutions to Threonine or Tyrosine that could potentially retain the phosphorylation site capacity. Combined, our results, that cover evolutionary signals at different temporal scales, demonstrate that through its encoding by two codon sets, serine allows for the existence of alternating substitution patterns within positions of functional maintenance versus sites of rapid diversification. |
format | Online Article Text |
id | pubmed-6872785 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-68727852019-12-04 Serine substitutions are linked to codon usage and differ for variable and conserved protein regions Schwartz, Gregory W. Shauli, Tair Linial, Michal Hershberg, Uri Sci Rep Article Serine is the only amino acid that is encoded by two disjoint codon sets (TCN & AGY) so that a tandem substitution of two nucleotides is required to switch between the two sets. We show that these codon sets underlie distinct substitution patterns at positions subject to purifying and diversifying selections. We found that in humans, positions that are conserved among ~100 vertebrates, and thus subjected to purifying selection, are enriched for substitutions involving serine (TCN, denoted S′), proline, and alanine, (S′PA). In contrast, the less conserved positions are enriched for serine encoded with AGY codons (denoted S″), glycine and asparagine, (GS″N). We tested this phenomenon in the HIV envelope glycoprotein (gp120), and the V-gene that encodes B-cell receptors/antibodies. These fast evolving proteins both have hypervariable positions, which are under diversifying selection, closely adjacent to highly conserved structural regions. In both instances, we identified an opposite abundance of two groups of serine substitutions, with enrichment of S′PA in the conserved positions, and GS″N in the hypervariable regions. Finally, we analyzed the substitutions across 60,000 individual human exomes to show that, when serine has a specific functional constraint of phosphorylation capability, S′ codons are 32-folds less prone than S″ to substitutions to Threonine or Tyrosine that could potentially retain the phosphorylation site capacity. Combined, our results, that cover evolutionary signals at different temporal scales, demonstrate that through its encoding by two codon sets, serine allows for the existence of alternating substitution patterns within positions of functional maintenance versus sites of rapid diversification. Nature Publishing Group UK 2019-11-21 /pmc/articles/PMC6872785/ /pubmed/31754132 http://dx.doi.org/10.1038/s41598-019-53452-3 Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Schwartz, Gregory W. Shauli, Tair Linial, Michal Hershberg, Uri Serine substitutions are linked to codon usage and differ for variable and conserved protein regions |
title | Serine substitutions are linked to codon usage and differ for variable and conserved protein regions |
title_full | Serine substitutions are linked to codon usage and differ for variable and conserved protein regions |
title_fullStr | Serine substitutions are linked to codon usage and differ for variable and conserved protein regions |
title_full_unstemmed | Serine substitutions are linked to codon usage and differ for variable and conserved protein regions |
title_short | Serine substitutions are linked to codon usage and differ for variable and conserved protein regions |
title_sort | serine substitutions are linked to codon usage and differ for variable and conserved protein regions |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6872785/ https://www.ncbi.nlm.nih.gov/pubmed/31754132 http://dx.doi.org/10.1038/s41598-019-53452-3 |
work_keys_str_mv | AT schwartzgregoryw serinesubstitutionsarelinkedtocodonusageanddifferforvariableandconservedproteinregions AT shaulitair serinesubstitutionsarelinkedtocodonusageanddifferforvariableandconservedproteinregions AT linialmichal serinesubstitutionsarelinkedtocodonusageanddifferforvariableandconservedproteinregions AT hershberguri serinesubstitutionsarelinkedtocodonusageanddifferforvariableandconservedproteinregions |