Cargando…
Codon usage clusters correlation: towards protein solubility prediction in heterologous expression systems in E. coli
Production of soluble recombinant proteins is crucial to the development of industry and basic research. However, the aggregation due to the incorrect folding of the nascent polypeptides is still a mayor bottleneck. Understanding the factors governing protein solubility is important to grasp the und...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6045634/ https://www.ncbi.nlm.nih.gov/pubmed/30006617 http://dx.doi.org/10.1038/s41598-018-29035-z |
_version_ | 1783339692984369152 |
---|---|
author | Pellizza, Leonardo Smal, Clara Rodrigo, Guido Arán, Martín |
author_facet | Pellizza, Leonardo Smal, Clara Rodrigo, Guido Arán, Martín |
author_sort | Pellizza, Leonardo |
collection | PubMed |
description | Production of soluble recombinant proteins is crucial to the development of industry and basic research. However, the aggregation due to the incorrect folding of the nascent polypeptides is still a mayor bottleneck. Understanding the factors governing protein solubility is important to grasp the underlying mechanisms and improve the design of recombinant proteins. Here we show a quantitative study of the expression and solubility of a set of proteins from Bizionia argentinensis. Through the analysis of different features known to modulate protein production, we defined two parameters based on the %MinMax algorithm to compare codon usage clusters between the host and the target genes. We demonstrate that the absolute difference between all %MinMax frequencies of the host and the target gene is significantly negatively correlated with protein expression levels. But most importantly, a strong positive correlation between solubility and the degree of conservation of codons usage clusters is observed for two independent datasets. Moreover, we evince that this correlation is higher in codon usage clusters involved in less compact protein secondary structure regions. Our results provide important tools for protein design and support the notion that codon usage may dictate translation rate and modulate co-translational folding. |
format | Online Article Text |
id | pubmed-6045634 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-60456342018-07-16 Codon usage clusters correlation: towards protein solubility prediction in heterologous expression systems in E. coli Pellizza, Leonardo Smal, Clara Rodrigo, Guido Arán, Martín Sci Rep Article Production of soluble recombinant proteins is crucial to the development of industry and basic research. However, the aggregation due to the incorrect folding of the nascent polypeptides is still a mayor bottleneck. Understanding the factors governing protein solubility is important to grasp the underlying mechanisms and improve the design of recombinant proteins. Here we show a quantitative study of the expression and solubility of a set of proteins from Bizionia argentinensis. Through the analysis of different features known to modulate protein production, we defined two parameters based on the %MinMax algorithm to compare codon usage clusters between the host and the target genes. We demonstrate that the absolute difference between all %MinMax frequencies of the host and the target gene is significantly negatively correlated with protein expression levels. But most importantly, a strong positive correlation between solubility and the degree of conservation of codons usage clusters is observed for two independent datasets. Moreover, we evince that this correlation is higher in codon usage clusters involved in less compact protein secondary structure regions. Our results provide important tools for protein design and support the notion that codon usage may dictate translation rate and modulate co-translational folding. Nature Publishing Group UK 2018-07-13 /pmc/articles/PMC6045634/ /pubmed/30006617 http://dx.doi.org/10.1038/s41598-018-29035-z Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Pellizza, Leonardo Smal, Clara Rodrigo, Guido Arán, Martín Codon usage clusters correlation: towards protein solubility prediction in heterologous expression systems in E. coli |
title | Codon usage clusters correlation: towards protein solubility prediction in heterologous expression systems in E. coli |
title_full | Codon usage clusters correlation: towards protein solubility prediction in heterologous expression systems in E. coli |
title_fullStr | Codon usage clusters correlation: towards protein solubility prediction in heterologous expression systems in E. coli |
title_full_unstemmed | Codon usage clusters correlation: towards protein solubility prediction in heterologous expression systems in E. coli |
title_short | Codon usage clusters correlation: towards protein solubility prediction in heterologous expression systems in E. coli |
title_sort | codon usage clusters correlation: towards protein solubility prediction in heterologous expression systems in e. coli |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6045634/ https://www.ncbi.nlm.nih.gov/pubmed/30006617 http://dx.doi.org/10.1038/s41598-018-29035-z |
work_keys_str_mv | AT pellizzaleonardo codonusageclusterscorrelationtowardsproteinsolubilitypredictioninheterologousexpressionsystemsinecoli AT smalclara codonusageclusterscorrelationtowardsproteinsolubilitypredictioninheterologousexpressionsystemsinecoli AT rodrigoguido codonusageclusterscorrelationtowardsproteinsolubilitypredictioninheterologousexpressionsystemsinecoli AT aranmartin codonusageclusterscorrelationtowardsproteinsolubilitypredictioninheterologousexpressionsystemsinecoli |