Cargando…
Solubility-Weighted Index: fast and accurate prediction of protein solubility
MOTIVATION: Recombinant protein production is a widely used technique in the biotechnology and biomedical industries, yet only a quarter of target proteins are soluble and can therefore be purified. RESULTS: We have discovered that global structural flexibility, which can be modeled by normalized B-...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7750957/ https://www.ncbi.nlm.nih.gov/pubmed/32559287 http://dx.doi.org/10.1093/bioinformatics/btaa578 |
_version_ | 1783625579445092352 |
---|---|
author | Bhandari, Bikash K Gardner, Paul P Lim, Chun Shen |
author_facet | Bhandari, Bikash K Gardner, Paul P Lim, Chun Shen |
author_sort | Bhandari, Bikash K |
collection | PubMed |
description | MOTIVATION: Recombinant protein production is a widely used technique in the biotechnology and biomedical industries, yet only a quarter of target proteins are soluble and can therefore be purified. RESULTS: We have discovered that global structural flexibility, which can be modeled by normalized B-factors, accurately predicts the solubility of 12 216 recombinant proteins expressed in Escherichia coli. We have optimized these B-factors, and derived a new set of values for solubility scoring that further improves prediction accuracy. We call this new predictor the ‘Solubility-Weighted Index’ (SWI). Importantly, SWI outperforms many existing protein solubility prediction tools. Furthermore, we have developed ‘SoDoPE’ (Soluble Domain for Protein Expression), a web interface that allows users to choose a protein region of interest for predicting and maximizing both protein expression and solubility. AVAILABILITY AND IMPLEMENTATION: The SoDoPE web server and source code are freely available at https://tisigner.com/sodope and https://github.com/Gardner-BinfLab/TISIGNER-ReactJS, respectively. The code and data for reproducing our analysis can be found at https://github.com/Gardner-BinfLab/SoDoPE_paper_2020. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-7750957 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-77509572020-12-28 Solubility-Weighted Index: fast and accurate prediction of protein solubility Bhandari, Bikash K Gardner, Paul P Lim, Chun Shen Bioinformatics Original Papers MOTIVATION: Recombinant protein production is a widely used technique in the biotechnology and biomedical industries, yet only a quarter of target proteins are soluble and can therefore be purified. RESULTS: We have discovered that global structural flexibility, which can be modeled by normalized B-factors, accurately predicts the solubility of 12 216 recombinant proteins expressed in Escherichia coli. We have optimized these B-factors, and derived a new set of values for solubility scoring that further improves prediction accuracy. We call this new predictor the ‘Solubility-Weighted Index’ (SWI). Importantly, SWI outperforms many existing protein solubility prediction tools. Furthermore, we have developed ‘SoDoPE’ (Soluble Domain for Protein Expression), a web interface that allows users to choose a protein region of interest for predicting and maximizing both protein expression and solubility. AVAILABILITY AND IMPLEMENTATION: The SoDoPE web server and source code are freely available at https://tisigner.com/sodope and https://github.com/Gardner-BinfLab/TISIGNER-ReactJS, respectively. The code and data for reproducing our analysis can be found at https://github.com/Gardner-BinfLab/SoDoPE_paper_2020. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2020-06-19 /pmc/articles/PMC7750957/ /pubmed/32559287 http://dx.doi.org/10.1093/bioinformatics/btaa578 Text en © The Author(s) 2020. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Papers Bhandari, Bikash K Gardner, Paul P Lim, Chun Shen Solubility-Weighted Index: fast and accurate prediction of protein solubility |
title | Solubility-Weighted Index: fast and accurate prediction of protein solubility |
title_full | Solubility-Weighted Index: fast and accurate prediction of protein solubility |
title_fullStr | Solubility-Weighted Index: fast and accurate prediction of protein solubility |
title_full_unstemmed | Solubility-Weighted Index: fast and accurate prediction of protein solubility |
title_short | Solubility-Weighted Index: fast and accurate prediction of protein solubility |
title_sort | solubility-weighted index: fast and accurate prediction of protein solubility |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7750957/ https://www.ncbi.nlm.nih.gov/pubmed/32559287 http://dx.doi.org/10.1093/bioinformatics/btaa578 |
work_keys_str_mv | AT bhandaribikashk solubilityweightedindexfastandaccuratepredictionofproteinsolubility AT gardnerpaulp solubilityweightedindexfastandaccuratepredictionofproteinsolubility AT limchunshen solubilityweightedindexfastandaccuratepredictionofproteinsolubility |