Cargando…

Rational incorporation of any unnatural amino acid into proteins by machine learning on existing experimental proofs

The unnatural amino acid (UAA) incorporation technique through genetic code expansion has been extensively used in protein engineering for the last two decades. Mutations into UAAs offer more dimensions to tune protein structures and functions. However, the huge library of optional UAAs and various...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Haoran, Zheng, Zhetao, Dong, Liangzhen, Shi, Ningning, Yang, Yuelin, Chen, Hongmin, Shen, Yuxuan, Xia, Qing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9472073/
https://www.ncbi.nlm.nih.gov/pubmed/36147660
http://dx.doi.org/10.1016/j.csbj.2022.08.063
_version_ 1784789227470323712
author Zhang, Haoran
Zheng, Zhetao
Dong, Liangzhen
Shi, Ningning
Yang, Yuelin
Chen, Hongmin
Shen, Yuxuan
Xia, Qing
author_facet Zhang, Haoran
Zheng, Zhetao
Dong, Liangzhen
Shi, Ningning
Yang, Yuelin
Chen, Hongmin
Shen, Yuxuan
Xia, Qing
author_sort Zhang, Haoran
collection PubMed
description The unnatural amino acid (UAA) incorporation technique through genetic code expansion has been extensively used in protein engineering for the last two decades. Mutations into UAAs offer more dimensions to tune protein structures and functions. However, the huge library of optional UAAs and various circumstances of mutation sites on different proteins urge rational UAA incorporations guided by artificial intelligence. Here we collected existing experimental proofs of UAA-incorporated proteins in literature and established a database of known UAA substitution sites. By program designing and machine learning on the database, we showed that UAA incorporations into proteins are predictable by the observed evolutional, steric and physiochemical factors. Based on the predicted probability of successful UAA substitutions, we tested the model performance using literature-reported and freshly-designed experimental proofs, and demonstrated its potential in screening UAA-incorporated proteins. This work expands structure-based computational biology and virtual screening to UAA-incorporated proteins, and offers a useful tool to automate the rational design of proteins with any UAA.
format Online
Article
Text
id pubmed-9472073
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-94720732022-09-21 Rational incorporation of any unnatural amino acid into proteins by machine learning on existing experimental proofs Zhang, Haoran Zheng, Zhetao Dong, Liangzhen Shi, Ningning Yang, Yuelin Chen, Hongmin Shen, Yuxuan Xia, Qing Comput Struct Biotechnol J Research Article The unnatural amino acid (UAA) incorporation technique through genetic code expansion has been extensively used in protein engineering for the last two decades. Mutations into UAAs offer more dimensions to tune protein structures and functions. However, the huge library of optional UAAs and various circumstances of mutation sites on different proteins urge rational UAA incorporations guided by artificial intelligence. Here we collected existing experimental proofs of UAA-incorporated proteins in literature and established a database of known UAA substitution sites. By program designing and machine learning on the database, we showed that UAA incorporations into proteins are predictable by the observed evolutional, steric and physiochemical factors. Based on the predicted probability of successful UAA substitutions, we tested the model performance using literature-reported and freshly-designed experimental proofs, and demonstrated its potential in screening UAA-incorporated proteins. This work expands structure-based computational biology and virtual screening to UAA-incorporated proteins, and offers a useful tool to automate the rational design of proteins with any UAA. Research Network of Computational and Structural Biotechnology 2022-09-05 /pmc/articles/PMC9472073/ /pubmed/36147660 http://dx.doi.org/10.1016/j.csbj.2022.08.063 Text en © 2022 The Authors https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Research Article
Zhang, Haoran
Zheng, Zhetao
Dong, Liangzhen
Shi, Ningning
Yang, Yuelin
Chen, Hongmin
Shen, Yuxuan
Xia, Qing
Rational incorporation of any unnatural amino acid into proteins by machine learning on existing experimental proofs
title Rational incorporation of any unnatural amino acid into proteins by machine learning on existing experimental proofs
title_full Rational incorporation of any unnatural amino acid into proteins by machine learning on existing experimental proofs
title_fullStr Rational incorporation of any unnatural amino acid into proteins by machine learning on existing experimental proofs
title_full_unstemmed Rational incorporation of any unnatural amino acid into proteins by machine learning on existing experimental proofs
title_short Rational incorporation of any unnatural amino acid into proteins by machine learning on existing experimental proofs
title_sort rational incorporation of any unnatural amino acid into proteins by machine learning on existing experimental proofs
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9472073/
https://www.ncbi.nlm.nih.gov/pubmed/36147660
http://dx.doi.org/10.1016/j.csbj.2022.08.063
work_keys_str_mv AT zhanghaoran rationalincorporationofanyunnaturalaminoacidintoproteinsbymachinelearningonexistingexperimentalproofs
AT zhengzhetao rationalincorporationofanyunnaturalaminoacidintoproteinsbymachinelearningonexistingexperimentalproofs
AT dongliangzhen rationalincorporationofanyunnaturalaminoacidintoproteinsbymachinelearningonexistingexperimentalproofs
AT shiningning rationalincorporationofanyunnaturalaminoacidintoproteinsbymachinelearningonexistingexperimentalproofs
AT yangyuelin rationalincorporationofanyunnaturalaminoacidintoproteinsbymachinelearningonexistingexperimentalproofs
AT chenhongmin rationalincorporationofanyunnaturalaminoacidintoproteinsbymachinelearningonexistingexperimentalproofs
AT shenyuxuan rationalincorporationofanyunnaturalaminoacidintoproteinsbymachinelearningonexistingexperimentalproofs
AT xiaqing rationalincorporationofanyunnaturalaminoacidintoproteinsbymachinelearningonexistingexperimentalproofs