Cargando…

Periscope: quantitative prediction of soluble protein expression in the periplasm of Escherichia coli

Periplasmic expression of soluble proteins in Escherichia coli not only offers a much-simplified downstream purification process, but also enhances the probability of obtaining correctly folded and biologically active proteins. Different combinations of signal peptides and target proteins lead to di...

Descripción completa

Detalles Bibliográficos
Autores principales: Chang, Catherine Ching Han, Li, Chen, Webb, Geoffrey I., Tey, BengTi, Song, Jiangning, Ramanan, Ramakrishnan Nagasundara
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4773868/
https://www.ncbi.nlm.nih.gov/pubmed/26931649
http://dx.doi.org/10.1038/srep21844
_version_ 1782418823515734016
author Chang, Catherine Ching Han
Li, Chen
Webb, Geoffrey I.
Tey, BengTi
Song, Jiangning
Ramanan, Ramakrishnan Nagasundara
author_facet Chang, Catherine Ching Han
Li, Chen
Webb, Geoffrey I.
Tey, BengTi
Song, Jiangning
Ramanan, Ramakrishnan Nagasundara
author_sort Chang, Catherine Ching Han
collection PubMed
description Periplasmic expression of soluble proteins in Escherichia coli not only offers a much-simplified downstream purification process, but also enhances the probability of obtaining correctly folded and biologically active proteins. Different combinations of signal peptides and target proteins lead to different soluble protein expression levels, ranging from negligible to several grams per litre. Accurate algorithms for rational selection of promising candidates can serve as a powerful tool to complement with current trial-and-error approaches. Accordingly, proteomics studies can be conducted with greater efficiency and cost-effectiveness. Here, we developed a predictor with a two-stage architecture, to predict the real-valued expression level of target protein in the periplasm. The output of the first-stage support vector machine (SVM) classifier determines which second-stage support vector regression (SVR) classifier to be used. When tested on an independent test dataset, the predictor achieved an overall prediction accuracy of 78% and a Pearson’s correlation coefficient (PCC) of 0.77. We further illustrate the relative importance of various features with respect to different models. The results indicate that the occurrence of dipeptide glutamine and aspartic acid is the most important feature for the classification model. Finally, we provide access to the implemented predictor through the Periscope webserver, freely accessible at http://lightning.med.monash.edu/periscope/.
format Online
Article
Text
id pubmed-4773868
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-47738682016-03-09 Periscope: quantitative prediction of soluble protein expression in the periplasm of Escherichia coli Chang, Catherine Ching Han Li, Chen Webb, Geoffrey I. Tey, BengTi Song, Jiangning Ramanan, Ramakrishnan Nagasundara Sci Rep Article Periplasmic expression of soluble proteins in Escherichia coli not only offers a much-simplified downstream purification process, but also enhances the probability of obtaining correctly folded and biologically active proteins. Different combinations of signal peptides and target proteins lead to different soluble protein expression levels, ranging from negligible to several grams per litre. Accurate algorithms for rational selection of promising candidates can serve as a powerful tool to complement with current trial-and-error approaches. Accordingly, proteomics studies can be conducted with greater efficiency and cost-effectiveness. Here, we developed a predictor with a two-stage architecture, to predict the real-valued expression level of target protein in the periplasm. The output of the first-stage support vector machine (SVM) classifier determines which second-stage support vector regression (SVR) classifier to be used. When tested on an independent test dataset, the predictor achieved an overall prediction accuracy of 78% and a Pearson’s correlation coefficient (PCC) of 0.77. We further illustrate the relative importance of various features with respect to different models. The results indicate that the occurrence of dipeptide glutamine and aspartic acid is the most important feature for the classification model. Finally, we provide access to the implemented predictor through the Periscope webserver, freely accessible at http://lightning.med.monash.edu/periscope/. Nature Publishing Group 2016-03-02 /pmc/articles/PMC4773868/ /pubmed/26931649 http://dx.doi.org/10.1038/srep21844 Text en Copyright © 2016, Macmillan Publishers Limited http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Chang, Catherine Ching Han
Li, Chen
Webb, Geoffrey I.
Tey, BengTi
Song, Jiangning
Ramanan, Ramakrishnan Nagasundara
Periscope: quantitative prediction of soluble protein expression in the periplasm of Escherichia coli
title Periscope: quantitative prediction of soluble protein expression in the periplasm of Escherichia coli
title_full Periscope: quantitative prediction of soluble protein expression in the periplasm of Escherichia coli
title_fullStr Periscope: quantitative prediction of soluble protein expression in the periplasm of Escherichia coli
title_full_unstemmed Periscope: quantitative prediction of soluble protein expression in the periplasm of Escherichia coli
title_short Periscope: quantitative prediction of soluble protein expression in the periplasm of Escherichia coli
title_sort periscope: quantitative prediction of soluble protein expression in the periplasm of escherichia coli
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4773868/
https://www.ncbi.nlm.nih.gov/pubmed/26931649
http://dx.doi.org/10.1038/srep21844
work_keys_str_mv AT changcatherinechinghan periscopequantitativepredictionofsolubleproteinexpressionintheperiplasmofescherichiacoli
AT lichen periscopequantitativepredictionofsolubleproteinexpressionintheperiplasmofescherichiacoli
AT webbgeoffreyi periscopequantitativepredictionofsolubleproteinexpressionintheperiplasmofescherichiacoli
AT teybengti periscopequantitativepredictionofsolubleproteinexpressionintheperiplasmofescherichiacoli
AT songjiangning periscopequantitativepredictionofsolubleproteinexpressionintheperiplasmofescherichiacoli
AT ramananramakrishnannagasundara periscopequantitativepredictionofsolubleproteinexpressionintheperiplasmofescherichiacoli