Cargando…

ECNet is an evolutionary context-integrated deep learning framework for protein engineering

Machine learning has been increasingly used for protein engineering. However, because the general sequence contexts they capture are not specific to the protein being engineered, the accuracy of existing machine learning algorithms is rather limited. Here, we report ECNet (evolutionary context-integ...

Descripción completa

Detalles Bibliográficos
Autores principales: Luo, Yunan, Jiang, Guangde, Yu, Tianhao, Liu, Yang, Vo, Lam, Ding, Hantian, Su, Yufeng, Qian, Wesley Wei, Zhao, Huimin, Peng, Jian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8484459/
https://www.ncbi.nlm.nih.gov/pubmed/34593817
http://dx.doi.org/10.1038/s41467-021-25976-8
_version_ 1784577322641260544
author Luo, Yunan
Jiang, Guangde
Yu, Tianhao
Liu, Yang
Vo, Lam
Ding, Hantian
Su, Yufeng
Qian, Wesley Wei
Zhao, Huimin
Peng, Jian
author_facet Luo, Yunan
Jiang, Guangde
Yu, Tianhao
Liu, Yang
Vo, Lam
Ding, Hantian
Su, Yufeng
Qian, Wesley Wei
Zhao, Huimin
Peng, Jian
author_sort Luo, Yunan
collection PubMed
description Machine learning has been increasingly used for protein engineering. However, because the general sequence contexts they capture are not specific to the protein being engineered, the accuracy of existing machine learning algorithms is rather limited. Here, we report ECNet (evolutionary context-integrated neural network), a deep-learning algorithm that exploits evolutionary contexts to predict functional fitness for protein engineering. This algorithm integrates local evolutionary context from homologous sequences that explicitly model residue-residue epistasis for the protein of interest with the global evolutionary context that encodes rich semantic and structural features from the enormous protein sequence universe. As such, it enables accurate mapping from sequence to function and provides generalization from low-order mutants to higher-order mutants. We show that ECNet predicts the sequence-function relationship more accurately as compared to existing machine learning algorithms by using ~50 deep mutational scanning and random mutagenesis datasets. Moreover, we used ECNet to guide the engineering of TEM-1 β-lactamase and identified variants with improved ampicillin resistance with high success rates.
format Online
Article
Text
id pubmed-8484459
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-84844592021-10-22 ECNet is an evolutionary context-integrated deep learning framework for protein engineering Luo, Yunan Jiang, Guangde Yu, Tianhao Liu, Yang Vo, Lam Ding, Hantian Su, Yufeng Qian, Wesley Wei Zhao, Huimin Peng, Jian Nat Commun Article Machine learning has been increasingly used for protein engineering. However, because the general sequence contexts they capture are not specific to the protein being engineered, the accuracy of existing machine learning algorithms is rather limited. Here, we report ECNet (evolutionary context-integrated neural network), a deep-learning algorithm that exploits evolutionary contexts to predict functional fitness for protein engineering. This algorithm integrates local evolutionary context from homologous sequences that explicitly model residue-residue epistasis for the protein of interest with the global evolutionary context that encodes rich semantic and structural features from the enormous protein sequence universe. As such, it enables accurate mapping from sequence to function and provides generalization from low-order mutants to higher-order mutants. We show that ECNet predicts the sequence-function relationship more accurately as compared to existing machine learning algorithms by using ~50 deep mutational scanning and random mutagenesis datasets. Moreover, we used ECNet to guide the engineering of TEM-1 β-lactamase and identified variants with improved ampicillin resistance with high success rates. Nature Publishing Group UK 2021-09-30 /pmc/articles/PMC8484459/ /pubmed/34593817 http://dx.doi.org/10.1038/s41467-021-25976-8 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Luo, Yunan
Jiang, Guangde
Yu, Tianhao
Liu, Yang
Vo, Lam
Ding, Hantian
Su, Yufeng
Qian, Wesley Wei
Zhao, Huimin
Peng, Jian
ECNet is an evolutionary context-integrated deep learning framework for protein engineering
title ECNet is an evolutionary context-integrated deep learning framework for protein engineering
title_full ECNet is an evolutionary context-integrated deep learning framework for protein engineering
title_fullStr ECNet is an evolutionary context-integrated deep learning framework for protein engineering
title_full_unstemmed ECNet is an evolutionary context-integrated deep learning framework for protein engineering
title_short ECNet is an evolutionary context-integrated deep learning framework for protein engineering
title_sort ecnet is an evolutionary context-integrated deep learning framework for protein engineering
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8484459/
https://www.ncbi.nlm.nih.gov/pubmed/34593817
http://dx.doi.org/10.1038/s41467-021-25976-8
work_keys_str_mv AT luoyunan ecnetisanevolutionarycontextintegrateddeeplearningframeworkforproteinengineering
AT jiangguangde ecnetisanevolutionarycontextintegrateddeeplearningframeworkforproteinengineering
AT yutianhao ecnetisanevolutionarycontextintegrateddeeplearningframeworkforproteinengineering
AT liuyang ecnetisanevolutionarycontextintegrateddeeplearningframeworkforproteinengineering
AT volam ecnetisanevolutionarycontextintegrateddeeplearningframeworkforproteinengineering
AT dinghantian ecnetisanevolutionarycontextintegrateddeeplearningframeworkforproteinengineering
AT suyufeng ecnetisanevolutionarycontextintegrateddeeplearningframeworkforproteinengineering
AT qianwesleywei ecnetisanevolutionarycontextintegrateddeeplearningframeworkforproteinengineering
AT zhaohuimin ecnetisanevolutionarycontextintegrateddeeplearningframeworkforproteinengineering
AT pengjian ecnetisanevolutionarycontextintegrateddeeplearningframeworkforproteinengineering