Cargando…

PERISCOPE-Opt: Machine learning-based prediction of optimal fermentation conditions and yields of recombinant periplasmic protein expressed in Escherichia coli

Optimization of the fermentation process for recombinant protein production (RPP) is often resource-intensive. Machine learning (ML) approaches are helpful in minimizing the experimentations and find vast applications in RPP. However, these ML-based tools primarily focus on features with respect to...

Descripción completa

Detalles Bibliográficos
Autores principales: Packiam, Kulandai Arockia Rajesh, Ooi, Chien Wei, Li, Fuyi, Mei, Shutao, Tey, Beng Ti, Ong, Huey Fang, Song, Jiangning, Ramanan, Ramakrishnan Nagasundara
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9201004/
https://www.ncbi.nlm.nih.gov/pubmed/35765650
http://dx.doi.org/10.1016/j.csbj.2022.06.006
_version_ 1784728190843879424
author Packiam, Kulandai Arockia Rajesh
Ooi, Chien Wei
Li, Fuyi
Mei, Shutao
Tey, Beng Ti
Ong, Huey Fang
Song, Jiangning
Ramanan, Ramakrishnan Nagasundara
author_facet Packiam, Kulandai Arockia Rajesh
Ooi, Chien Wei
Li, Fuyi
Mei, Shutao
Tey, Beng Ti
Ong, Huey Fang
Song, Jiangning
Ramanan, Ramakrishnan Nagasundara
author_sort Packiam, Kulandai Arockia Rajesh
collection PubMed
description Optimization of the fermentation process for recombinant protein production (RPP) is often resource-intensive. Machine learning (ML) approaches are helpful in minimizing the experimentations and find vast applications in RPP. However, these ML-based tools primarily focus on features with respect to amino-acid-sequence, ruling out the influence of fermentation process conditions. The present study combines the features derived from fermentation process conditions with that from amino acid-sequence to construct an ML-based model that predicts the maximal protein yields and the corresponding fermentation conditions for the expression of target recombinant protein in the Escherichia coli periplasm. Two sets of XGBoost classifiers were employed in the first stage to classify the expression levels of the target protein as high (>50 mg/L), medium (between 0.5 and 50 mg/L), or low (<0.5 mg/L). The second-stage framework consisted of three regression models involving support vector machines and random forest to predict the expression yields corresponding to each expression-level-class. Independent tests showed that the predictor achieved an overall average accuracy of 75% and a Pearson coefficient correlation of 0.91 for the correctly classified instances. Therefore, our model offers a reliable substitution of numerous trial-and-error experiments to identify the optimal fermentation conditions and yield for RPP. It is also implemented as an open-access webserver, PERISCOPE-Opt (http://periscope-opt.erc.monash.edu).
format Online
Article
Text
id pubmed-9201004
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-92010042022-06-27 PERISCOPE-Opt: Machine learning-based prediction of optimal fermentation conditions and yields of recombinant periplasmic protein expressed in Escherichia coli Packiam, Kulandai Arockia Rajesh Ooi, Chien Wei Li, Fuyi Mei, Shutao Tey, Beng Ti Ong, Huey Fang Song, Jiangning Ramanan, Ramakrishnan Nagasundara Comput Struct Biotechnol J Research Article Optimization of the fermentation process for recombinant protein production (RPP) is often resource-intensive. Machine learning (ML) approaches are helpful in minimizing the experimentations and find vast applications in RPP. However, these ML-based tools primarily focus on features with respect to amino-acid-sequence, ruling out the influence of fermentation process conditions. The present study combines the features derived from fermentation process conditions with that from amino acid-sequence to construct an ML-based model that predicts the maximal protein yields and the corresponding fermentation conditions for the expression of target recombinant protein in the Escherichia coli periplasm. Two sets of XGBoost classifiers were employed in the first stage to classify the expression levels of the target protein as high (>50 mg/L), medium (between 0.5 and 50 mg/L), or low (<0.5 mg/L). The second-stage framework consisted of three regression models involving support vector machines and random forest to predict the expression yields corresponding to each expression-level-class. Independent tests showed that the predictor achieved an overall average accuracy of 75% and a Pearson coefficient correlation of 0.91 for the correctly classified instances. Therefore, our model offers a reliable substitution of numerous trial-and-error experiments to identify the optimal fermentation conditions and yield for RPP. It is also implemented as an open-access webserver, PERISCOPE-Opt (http://periscope-opt.erc.monash.edu). Research Network of Computational and Structural Biotechnology 2022-06-03 /pmc/articles/PMC9201004/ /pubmed/35765650 http://dx.doi.org/10.1016/j.csbj.2022.06.006 Text en © 2022 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Research Article
Packiam, Kulandai Arockia Rajesh
Ooi, Chien Wei
Li, Fuyi
Mei, Shutao
Tey, Beng Ti
Ong, Huey Fang
Song, Jiangning
Ramanan, Ramakrishnan Nagasundara
PERISCOPE-Opt: Machine learning-based prediction of optimal fermentation conditions and yields of recombinant periplasmic protein expressed in Escherichia coli
title PERISCOPE-Opt: Machine learning-based prediction of optimal fermentation conditions and yields of recombinant periplasmic protein expressed in Escherichia coli
title_full PERISCOPE-Opt: Machine learning-based prediction of optimal fermentation conditions and yields of recombinant periplasmic protein expressed in Escherichia coli
title_fullStr PERISCOPE-Opt: Machine learning-based prediction of optimal fermentation conditions and yields of recombinant periplasmic protein expressed in Escherichia coli
title_full_unstemmed PERISCOPE-Opt: Machine learning-based prediction of optimal fermentation conditions and yields of recombinant periplasmic protein expressed in Escherichia coli
title_short PERISCOPE-Opt: Machine learning-based prediction of optimal fermentation conditions and yields of recombinant periplasmic protein expressed in Escherichia coli
title_sort periscope-opt: machine learning-based prediction of optimal fermentation conditions and yields of recombinant periplasmic protein expressed in escherichia coli
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9201004/
https://www.ncbi.nlm.nih.gov/pubmed/35765650
http://dx.doi.org/10.1016/j.csbj.2022.06.006
work_keys_str_mv AT packiamkulandaiarockiarajesh periscopeoptmachinelearningbasedpredictionofoptimalfermentationconditionsandyieldsofrecombinantperiplasmicproteinexpressedinescherichiacoli
AT ooichienwei periscopeoptmachinelearningbasedpredictionofoptimalfermentationconditionsandyieldsofrecombinantperiplasmicproteinexpressedinescherichiacoli
AT lifuyi periscopeoptmachinelearningbasedpredictionofoptimalfermentationconditionsandyieldsofrecombinantperiplasmicproteinexpressedinescherichiacoli
AT meishutao periscopeoptmachinelearningbasedpredictionofoptimalfermentationconditionsandyieldsofrecombinantperiplasmicproteinexpressedinescherichiacoli
AT teybengti periscopeoptmachinelearningbasedpredictionofoptimalfermentationconditionsandyieldsofrecombinantperiplasmicproteinexpressedinescherichiacoli
AT onghueyfang periscopeoptmachinelearningbasedpredictionofoptimalfermentationconditionsandyieldsofrecombinantperiplasmicproteinexpressedinescherichiacoli
AT songjiangning periscopeoptmachinelearningbasedpredictionofoptimalfermentationconditionsandyieldsofrecombinantperiplasmicproteinexpressedinescherichiacoli
AT ramananramakrishnannagasundara periscopeoptmachinelearningbasedpredictionofoptimalfermentationconditionsandyieldsofrecombinantperiplasmicproteinexpressedinescherichiacoli