Cargando…

Lysine and Arginine Content of Proteins: Computational Analysis Suggests a New Tool for Solubility Design

[Image: see text] Prediction and engineering of protein solubility is an important but imprecise area. While some features are routinely used, such as the avoidance of extensive non-polar surface area, scope remains for benchmarking of sequence and structural features with experimental data. We stud...

Descripción completa

Detalles Bibliográficos
Autores principales: Warwicker, Jim, Charonis, Spyros, Curtis, Robin A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2013
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3885198/
https://www.ncbi.nlm.nih.gov/pubmed/24283752
http://dx.doi.org/10.1021/mp4004749
_version_ 1782298723152297984
author Warwicker, Jim
Charonis, Spyros
Curtis, Robin A.
author_facet Warwicker, Jim
Charonis, Spyros
Curtis, Robin A.
author_sort Warwicker, Jim
collection PubMed
description [Image: see text] Prediction and engineering of protein solubility is an important but imprecise area. While some features are routinely used, such as the avoidance of extensive non-polar surface area, scope remains for benchmarking of sequence and structural features with experimental data. We study properties in the context of experimental solubilities, protein gene expression levels, and families of abundant proteins (serum albumin and myoglobin) and their less abundant paralogues. A common feature that emerges for proteins with elevated solubility and at higher expression and abundance levels is an increased ratio of lysine content to arginine content. We suggest that the same properties of arginine that give rise to its recorded propensity for specific interaction surfaces also lead to favorable interactions at nonspecific contacts, and thus lysine is favored for proteins at relatively high concentration. A survey of protein therapeutics shows that a significant subset possesses a relatively low lysine to arginine ratio, and therefore may not be favored for high protein concentration. We conclude that modulation of lysine and arginine content could prove a useful and relatively simple addition to the toolkit available for engineering protein solubility in biotechnological applications.
format Online
Article
Text
id pubmed-3885198
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-38851982014-01-10 Lysine and Arginine Content of Proteins: Computational Analysis Suggests a New Tool for Solubility Design Warwicker, Jim Charonis, Spyros Curtis, Robin A. Mol Pharm [Image: see text] Prediction and engineering of protein solubility is an important but imprecise area. While some features are routinely used, such as the avoidance of extensive non-polar surface area, scope remains for benchmarking of sequence and structural features with experimental data. We study properties in the context of experimental solubilities, protein gene expression levels, and families of abundant proteins (serum albumin and myoglobin) and their less abundant paralogues. A common feature that emerges for proteins with elevated solubility and at higher expression and abundance levels is an increased ratio of lysine content to arginine content. We suggest that the same properties of arginine that give rise to its recorded propensity for specific interaction surfaces also lead to favorable interactions at nonspecific contacts, and thus lysine is favored for proteins at relatively high concentration. A survey of protein therapeutics shows that a significant subset possesses a relatively low lysine to arginine ratio, and therefore may not be favored for high protein concentration. We conclude that modulation of lysine and arginine content could prove a useful and relatively simple addition to the toolkit available for engineering protein solubility in biotechnological applications. American Chemical Society 2013-11-20 2014-01-06 /pmc/articles/PMC3885198/ /pubmed/24283752 http://dx.doi.org/10.1021/mp4004749 Text en Copyright © 2013 American Chemical Society Terms of Use CC-BY (http://pubs.acs.org/page/policy/authorchoice_ccby_termsofuse.html)
spellingShingle Warwicker, Jim
Charonis, Spyros
Curtis, Robin A.
Lysine and Arginine Content of Proteins: Computational Analysis Suggests a New Tool for Solubility Design
title Lysine and Arginine Content of Proteins: Computational Analysis Suggests a New Tool for Solubility Design
title_full Lysine and Arginine Content of Proteins: Computational Analysis Suggests a New Tool for Solubility Design
title_fullStr Lysine and Arginine Content of Proteins: Computational Analysis Suggests a New Tool for Solubility Design
title_full_unstemmed Lysine and Arginine Content of Proteins: Computational Analysis Suggests a New Tool for Solubility Design
title_short Lysine and Arginine Content of Proteins: Computational Analysis Suggests a New Tool for Solubility Design
title_sort lysine and arginine content of proteins: computational analysis suggests a new tool for solubility design
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3885198/
https://www.ncbi.nlm.nih.gov/pubmed/24283752
http://dx.doi.org/10.1021/mp4004749
work_keys_str_mv AT warwickerjim lysineandargininecontentofproteinscomputationalanalysissuggestsanewtoolforsolubilitydesign
AT charonisspyros lysineandargininecontentofproteinscomputationalanalysissuggestsanewtoolforsolubilitydesign
AT curtisrobina lysineandargininecontentofproteinscomputationalanalysissuggestsanewtoolforsolubilitydesign