Cargando…
ECNet is an evolutionary context-integrated deep learning framework for protein engineering
Machine learning has been increasingly used for protein engineering. However, because the general sequence contexts they capture are not specific to the protein being engineered, the accuracy of existing machine learning algorithms is rather limited. Here, we report ECNet (evolutionary context-integ...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8484459/ https://www.ncbi.nlm.nih.gov/pubmed/34593817 http://dx.doi.org/10.1038/s41467-021-25976-8 |
_version_ | 1784577322641260544 |
---|---|
author | Luo, Yunan Jiang, Guangde Yu, Tianhao Liu, Yang Vo, Lam Ding, Hantian Su, Yufeng Qian, Wesley Wei Zhao, Huimin Peng, Jian |
author_facet | Luo, Yunan Jiang, Guangde Yu, Tianhao Liu, Yang Vo, Lam Ding, Hantian Su, Yufeng Qian, Wesley Wei Zhao, Huimin Peng, Jian |
author_sort | Luo, Yunan |
collection | PubMed |
description | Machine learning has been increasingly used for protein engineering. However, because the general sequence contexts they capture are not specific to the protein being engineered, the accuracy of existing machine learning algorithms is rather limited. Here, we report ECNet (evolutionary context-integrated neural network), a deep-learning algorithm that exploits evolutionary contexts to predict functional fitness for protein engineering. This algorithm integrates local evolutionary context from homologous sequences that explicitly model residue-residue epistasis for the protein of interest with the global evolutionary context that encodes rich semantic and structural features from the enormous protein sequence universe. As such, it enables accurate mapping from sequence to function and provides generalization from low-order mutants to higher-order mutants. We show that ECNet predicts the sequence-function relationship more accurately as compared to existing machine learning algorithms by using ~50 deep mutational scanning and random mutagenesis datasets. Moreover, we used ECNet to guide the engineering of TEM-1 β-lactamase and identified variants with improved ampicillin resistance with high success rates. |
format | Online Article Text |
id | pubmed-8484459 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-84844592021-10-22 ECNet is an evolutionary context-integrated deep learning framework for protein engineering Luo, Yunan Jiang, Guangde Yu, Tianhao Liu, Yang Vo, Lam Ding, Hantian Su, Yufeng Qian, Wesley Wei Zhao, Huimin Peng, Jian Nat Commun Article Machine learning has been increasingly used for protein engineering. However, because the general sequence contexts they capture are not specific to the protein being engineered, the accuracy of existing machine learning algorithms is rather limited. Here, we report ECNet (evolutionary context-integrated neural network), a deep-learning algorithm that exploits evolutionary contexts to predict functional fitness for protein engineering. This algorithm integrates local evolutionary context from homologous sequences that explicitly model residue-residue epistasis for the protein of interest with the global evolutionary context that encodes rich semantic and structural features from the enormous protein sequence universe. As such, it enables accurate mapping from sequence to function and provides generalization from low-order mutants to higher-order mutants. We show that ECNet predicts the sequence-function relationship more accurately as compared to existing machine learning algorithms by using ~50 deep mutational scanning and random mutagenesis datasets. Moreover, we used ECNet to guide the engineering of TEM-1 β-lactamase and identified variants with improved ampicillin resistance with high success rates. Nature Publishing Group UK 2021-09-30 /pmc/articles/PMC8484459/ /pubmed/34593817 http://dx.doi.org/10.1038/s41467-021-25976-8 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Luo, Yunan Jiang, Guangde Yu, Tianhao Liu, Yang Vo, Lam Ding, Hantian Su, Yufeng Qian, Wesley Wei Zhao, Huimin Peng, Jian ECNet is an evolutionary context-integrated deep learning framework for protein engineering |
title | ECNet is an evolutionary context-integrated deep learning framework for protein engineering |
title_full | ECNet is an evolutionary context-integrated deep learning framework for protein engineering |
title_fullStr | ECNet is an evolutionary context-integrated deep learning framework for protein engineering |
title_full_unstemmed | ECNet is an evolutionary context-integrated deep learning framework for protein engineering |
title_short | ECNet is an evolutionary context-integrated deep learning framework for protein engineering |
title_sort | ecnet is an evolutionary context-integrated deep learning framework for protein engineering |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8484459/ https://www.ncbi.nlm.nih.gov/pubmed/34593817 http://dx.doi.org/10.1038/s41467-021-25976-8 |
work_keys_str_mv | AT luoyunan ecnetisanevolutionarycontextintegrateddeeplearningframeworkforproteinengineering AT jiangguangde ecnetisanevolutionarycontextintegrateddeeplearningframeworkforproteinengineering AT yutianhao ecnetisanevolutionarycontextintegrateddeeplearningframeworkforproteinengineering AT liuyang ecnetisanevolutionarycontextintegrateddeeplearningframeworkforproteinengineering AT volam ecnetisanevolutionarycontextintegrateddeeplearningframeworkforproteinengineering AT dinghantian ecnetisanevolutionarycontextintegrateddeeplearningframeworkforproteinengineering AT suyufeng ecnetisanevolutionarycontextintegrateddeeplearningframeworkforproteinengineering AT qianwesleywei ecnetisanevolutionarycontextintegrateddeeplearningframeworkforproteinengineering AT zhaohuimin ecnetisanevolutionarycontextintegrateddeeplearningframeworkforproteinengineering AT pengjian ecnetisanevolutionarycontextintegrateddeeplearningframeworkforproteinengineering |