Cargando…

Genes for highly abundant proteins in Escherichia coli avoid 5’ codons that promote ribosomal initiation

In many species highly expressed genes (HEGs) over-employ the synonymous codons that match the more abundant iso-acceptor tRNAs. Bacterial transgene codon randomization experiments report, however, that enrichment with such “translationally optimal” codons has little to no effect on the resultant pr...

Descripción completa

Detalles Bibliográficos
Autores principales: Lewin, Loveday E., Daniels, Kate G., Hurst, Laurence D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10599525/
https://www.ncbi.nlm.nih.gov/pubmed/37878567
http://dx.doi.org/10.1371/journal.pcbi.1011581
_version_ 1785125782786408448
author Lewin, Loveday E.
Daniels, Kate G.
Hurst, Laurence D.
author_facet Lewin, Loveday E.
Daniels, Kate G.
Hurst, Laurence D.
author_sort Lewin, Loveday E.
collection PubMed
description In many species highly expressed genes (HEGs) over-employ the synonymous codons that match the more abundant iso-acceptor tRNAs. Bacterial transgene codon randomization experiments report, however, that enrichment with such “translationally optimal” codons has little to no effect on the resultant protein level. By contrast, consistent with the view that ribosomal initiation is rate limiting, synonymous codon usage following the 5’ ATG greatly influences protein levels, at least in part by modifying RNA stability. For the design of bacterial transgenes, for simple codon based in silico inference of protein levels and for understanding selection on synonymous mutations, it would be valuable to computationally determine initiation optimality (IO) scores for codons for any given species. One attractive approach is to characterize the 5’ codon enrichment of HEGs compared with the most lowly expressed genes, just as translational optimality scores of codons have been similarly defined employing the full gene body. Here we determine the viability of this approach employing a unique opportunity: for Escherichia coli there is both the most extensive protein abundance data for native genes and a unique large-scale transgene codon randomization experiment enabling objective definition of the 5’ codons that cause, rather than just correlate with, high protein abundance (that we equate with initiation optimality, broadly defined). Surprisingly, the 5’ ends of native genes that specify highly abundant proteins avoid such initiation optimal codons. We find that this is probably owing to conflicting selection pressures particular to native HEGs, including selection favouring low initiation rates, this potentially enabling high efficiency of ribosomal usage and low noise. While the classical HEG enrichment approach does not work, rendering simple prediction of native protein abundance from 5’ codon content futile, we report evidence that initiation optimality scores derived from the transgene experiment may hold relevance for in silico transgene design for a broad spectrum of bacteria.
format Online
Article
Text
id pubmed-10599525
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-105995252023-10-26 Genes for highly abundant proteins in Escherichia coli avoid 5’ codons that promote ribosomal initiation Lewin, Loveday E. Daniels, Kate G. Hurst, Laurence D. PLoS Comput Biol Research Article In many species highly expressed genes (HEGs) over-employ the synonymous codons that match the more abundant iso-acceptor tRNAs. Bacterial transgene codon randomization experiments report, however, that enrichment with such “translationally optimal” codons has little to no effect on the resultant protein level. By contrast, consistent with the view that ribosomal initiation is rate limiting, synonymous codon usage following the 5’ ATG greatly influences protein levels, at least in part by modifying RNA stability. For the design of bacterial transgenes, for simple codon based in silico inference of protein levels and for understanding selection on synonymous mutations, it would be valuable to computationally determine initiation optimality (IO) scores for codons for any given species. One attractive approach is to characterize the 5’ codon enrichment of HEGs compared with the most lowly expressed genes, just as translational optimality scores of codons have been similarly defined employing the full gene body. Here we determine the viability of this approach employing a unique opportunity: for Escherichia coli there is both the most extensive protein abundance data for native genes and a unique large-scale transgene codon randomization experiment enabling objective definition of the 5’ codons that cause, rather than just correlate with, high protein abundance (that we equate with initiation optimality, broadly defined). Surprisingly, the 5’ ends of native genes that specify highly abundant proteins avoid such initiation optimal codons. We find that this is probably owing to conflicting selection pressures particular to native HEGs, including selection favouring low initiation rates, this potentially enabling high efficiency of ribosomal usage and low noise. While the classical HEG enrichment approach does not work, rendering simple prediction of native protein abundance from 5’ codon content futile, we report evidence that initiation optimality scores derived from the transgene experiment may hold relevance for in silico transgene design for a broad spectrum of bacteria. Public Library of Science 2023-10-25 /pmc/articles/PMC10599525/ /pubmed/37878567 http://dx.doi.org/10.1371/journal.pcbi.1011581 Text en © 2023 Lewin et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Lewin, Loveday E.
Daniels, Kate G.
Hurst, Laurence D.
Genes for highly abundant proteins in Escherichia coli avoid 5’ codons that promote ribosomal initiation
title Genes for highly abundant proteins in Escherichia coli avoid 5’ codons that promote ribosomal initiation
title_full Genes for highly abundant proteins in Escherichia coli avoid 5’ codons that promote ribosomal initiation
title_fullStr Genes for highly abundant proteins in Escherichia coli avoid 5’ codons that promote ribosomal initiation
title_full_unstemmed Genes for highly abundant proteins in Escherichia coli avoid 5’ codons that promote ribosomal initiation
title_short Genes for highly abundant proteins in Escherichia coli avoid 5’ codons that promote ribosomal initiation
title_sort genes for highly abundant proteins in escherichia coli avoid 5’ codons that promote ribosomal initiation
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10599525/
https://www.ncbi.nlm.nih.gov/pubmed/37878567
http://dx.doi.org/10.1371/journal.pcbi.1011581
work_keys_str_mv AT lewinlovedaye genesforhighlyabundantproteinsinescherichiacoliavoid5codonsthatpromoteribosomalinitiation
AT danielskateg genesforhighlyabundantproteinsinescherichiacoliavoid5codonsthatpromoteribosomalinitiation
AT hurstlaurenced genesforhighlyabundantproteinsinescherichiacoliavoid5codonsthatpromoteribosomalinitiation