Cargando…

Large-scale experimental studies show unexpected amino acid effects on protein expression and solubility in vivo in E. coli

The biochemical and physical factors controlling protein expression level and solubility in vivo remain incompletely characterized. To gain insight into the primary sequence features influencing these outcomes, we performed statistical analyses of results from the high-throughput protein-production...

Descripción completa

Detalles Bibliográficos
Autores principales: Price, W Nicholson, Handelman, Samuel K, Everett, John K, Tong, Saichiu N, Bracic, Ana, Luff, Jon D, Naumov, Victor, Acton, Thomas, Manor, Philip, Xiao, Rong, Rost, Burkhard, Montelione, Gaetano T, Hunt, John F
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3372292/
https://www.ncbi.nlm.nih.gov/pubmed/22587847
http://dx.doi.org/10.1186/2042-5783-1-6
_version_ 1782235325240705024
author Price, W Nicholson
Handelman, Samuel K
Everett, John K
Tong, Saichiu N
Bracic, Ana
Luff, Jon D
Naumov, Victor
Acton, Thomas
Manor, Philip
Xiao, Rong
Rost, Burkhard
Montelione, Gaetano T
Hunt, John F
author_facet Price, W Nicholson
Handelman, Samuel K
Everett, John K
Tong, Saichiu N
Bracic, Ana
Luff, Jon D
Naumov, Victor
Acton, Thomas
Manor, Philip
Xiao, Rong
Rost, Burkhard
Montelione, Gaetano T
Hunt, John F
author_sort Price, W Nicholson
collection PubMed
description The biochemical and physical factors controlling protein expression level and solubility in vivo remain incompletely characterized. To gain insight into the primary sequence features influencing these outcomes, we performed statistical analyses of results from the high-throughput protein-production pipeline of the Northeast Structural Genomics Consortium. Proteins expressed in E. coli and consistently purified were scored independently for expression and solubility levels. These parameters nonetheless show a very strong positive correlation. We used logistic regressions to determine whether they are systematically influenced by fractional amino acid composition or several bulk sequence parameters including hydrophobicity, sidechain entropy, electrostatic charge, and predicted backbone disorder. Decreasing hydrophobicity correlates with higher expression and solubility levels, but this correlation apparently derives solely from the beneficial effect of three charged amino acids, at least for bacterial proteins. In fact, the three most hydrophobic residues showed very different correlations with solubility level. Leu showed the strongest negative correlation among amino acids, while Ile showed a slightly positive correlation in most data segments. Several other amino acids also had unexpected effects. Notably, Arg correlated with decreased expression and, most surprisingly, solubility of bacterial proteins, an effect only partially attributable to rare codons. However, rare codons did significantly reduce expression despite use of a codon-enhanced strain. Additional analyses suggest that positively but not negatively charged amino acids may reduce translation efficiency in E. coli irrespective of codon usage. While some observed effects may reflect indirect evolutionary correlations, others may reflect basic physicochemical phenomena. We used these results to construct and validate predictors of expression and solubility levels and overall protein usability, and we propose new strategies to be explored for engineering improved protein expression and solubility.
format Online
Article
Text
id pubmed-3372292
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-33722922012-06-12 Large-scale experimental studies show unexpected amino acid effects on protein expression and solubility in vivo in E. coli Price, W Nicholson Handelman, Samuel K Everett, John K Tong, Saichiu N Bracic, Ana Luff, Jon D Naumov, Victor Acton, Thomas Manor, Philip Xiao, Rong Rost, Burkhard Montelione, Gaetano T Hunt, John F Microb Inform Exp Research The biochemical and physical factors controlling protein expression level and solubility in vivo remain incompletely characterized. To gain insight into the primary sequence features influencing these outcomes, we performed statistical analyses of results from the high-throughput protein-production pipeline of the Northeast Structural Genomics Consortium. Proteins expressed in E. coli and consistently purified were scored independently for expression and solubility levels. These parameters nonetheless show a very strong positive correlation. We used logistic regressions to determine whether they are systematically influenced by fractional amino acid composition or several bulk sequence parameters including hydrophobicity, sidechain entropy, electrostatic charge, and predicted backbone disorder. Decreasing hydrophobicity correlates with higher expression and solubility levels, but this correlation apparently derives solely from the beneficial effect of three charged amino acids, at least for bacterial proteins. In fact, the three most hydrophobic residues showed very different correlations with solubility level. Leu showed the strongest negative correlation among amino acids, while Ile showed a slightly positive correlation in most data segments. Several other amino acids also had unexpected effects. Notably, Arg correlated with decreased expression and, most surprisingly, solubility of bacterial proteins, an effect only partially attributable to rare codons. However, rare codons did significantly reduce expression despite use of a codon-enhanced strain. Additional analyses suggest that positively but not negatively charged amino acids may reduce translation efficiency in E. coli irrespective of codon usage. While some observed effects may reflect indirect evolutionary correlations, others may reflect basic physicochemical phenomena. We used these results to construct and validate predictors of expression and solubility levels and overall protein usability, and we propose new strategies to be explored for engineering improved protein expression and solubility. BioMed Central 2011-06-27 /pmc/articles/PMC3372292/ /pubmed/22587847 http://dx.doi.org/10.1186/2042-5783-1-6 Text en Copyright ©2011 Price et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Price, W Nicholson
Handelman, Samuel K
Everett, John K
Tong, Saichiu N
Bracic, Ana
Luff, Jon D
Naumov, Victor
Acton, Thomas
Manor, Philip
Xiao, Rong
Rost, Burkhard
Montelione, Gaetano T
Hunt, John F
Large-scale experimental studies show unexpected amino acid effects on protein expression and solubility in vivo in E. coli
title Large-scale experimental studies show unexpected amino acid effects on protein expression and solubility in vivo in E. coli
title_full Large-scale experimental studies show unexpected amino acid effects on protein expression and solubility in vivo in E. coli
title_fullStr Large-scale experimental studies show unexpected amino acid effects on protein expression and solubility in vivo in E. coli
title_full_unstemmed Large-scale experimental studies show unexpected amino acid effects on protein expression and solubility in vivo in E. coli
title_short Large-scale experimental studies show unexpected amino acid effects on protein expression and solubility in vivo in E. coli
title_sort large-scale experimental studies show unexpected amino acid effects on protein expression and solubility in vivo in e. coli
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3372292/
https://www.ncbi.nlm.nih.gov/pubmed/22587847
http://dx.doi.org/10.1186/2042-5783-1-6
work_keys_str_mv AT pricewnicholson largescaleexperimentalstudiesshowunexpectedaminoacideffectsonproteinexpressionandsolubilityinvivoinecoli
AT handelmansamuelk largescaleexperimentalstudiesshowunexpectedaminoacideffectsonproteinexpressionandsolubilityinvivoinecoli
AT everettjohnk largescaleexperimentalstudiesshowunexpectedaminoacideffectsonproteinexpressionandsolubilityinvivoinecoli
AT tongsaichiun largescaleexperimentalstudiesshowunexpectedaminoacideffectsonproteinexpressionandsolubilityinvivoinecoli
AT bracicana largescaleexperimentalstudiesshowunexpectedaminoacideffectsonproteinexpressionandsolubilityinvivoinecoli
AT luffjond largescaleexperimentalstudiesshowunexpectedaminoacideffectsonproteinexpressionandsolubilityinvivoinecoli
AT naumovvictor largescaleexperimentalstudiesshowunexpectedaminoacideffectsonproteinexpressionandsolubilityinvivoinecoli
AT actonthomas largescaleexperimentalstudiesshowunexpectedaminoacideffectsonproteinexpressionandsolubilityinvivoinecoli
AT manorphilip largescaleexperimentalstudiesshowunexpectedaminoacideffectsonproteinexpressionandsolubilityinvivoinecoli
AT xiaorong largescaleexperimentalstudiesshowunexpectedaminoacideffectsonproteinexpressionandsolubilityinvivoinecoli
AT rostburkhard largescaleexperimentalstudiesshowunexpectedaminoacideffectsonproteinexpressionandsolubilityinvivoinecoli
AT montelionegaetanot largescaleexperimentalstudiesshowunexpectedaminoacideffectsonproteinexpressionandsolubilityinvivoinecoli
AT huntjohnf largescaleexperimentalstudiesshowunexpectedaminoacideffectsonproteinexpressionandsolubilityinvivoinecoli