Cargando…
Artificial intelligence-aided protein engineering: from topological data analysis to deep protein language models
Protein engineering is an emerging field in biotechnology that has the potential to revolutionize various areas, such as antibody design, drug discovery, food security, ecology, and more. However, the mutational space involved is too vast to be handled through experimental means alone. Leveraging ac...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cornell University
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10402185/ https://www.ncbi.nlm.nih.gov/pubmed/37547662 |
_version_ | 1785084816223371264 |
---|---|
author | Qiu, Yuchi Wei, Guo-Wei |
author_facet | Qiu, Yuchi Wei, Guo-Wei |
author_sort | Qiu, Yuchi |
collection | PubMed |
description | Protein engineering is an emerging field in biotechnology that has the potential to revolutionize various areas, such as antibody design, drug discovery, food security, ecology, and more. However, the mutational space involved is too vast to be handled through experimental means alone. Leveraging accumulative protein databases, machine learning (ML) models, particularly those based on natural language processing (NLP), have considerably expedited protein engineering. Moreover, advances in topological data analysis (TDA) and artificial intelligence-based protein structure prediction, such as AlphaFold2, have made more powerful structure-based ML-assisted protein engineering strategies possible. This review aims to offer a comprehensive, systematic, and indispensable set of methodological components, including TDA and NLP, for protein engineering and to facilitate their future development. |
format | Online Article Text |
id | pubmed-10402185 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cornell University |
record_format | MEDLINE/PubMed |
spelling | pubmed-104021852023-08-05 Artificial intelligence-aided protein engineering: from topological data analysis to deep protein language models Qiu, Yuchi Wei, Guo-Wei ArXiv Article Protein engineering is an emerging field in biotechnology that has the potential to revolutionize various areas, such as antibody design, drug discovery, food security, ecology, and more. However, the mutational space involved is too vast to be handled through experimental means alone. Leveraging accumulative protein databases, machine learning (ML) models, particularly those based on natural language processing (NLP), have considerably expedited protein engineering. Moreover, advances in topological data analysis (TDA) and artificial intelligence-based protein structure prediction, such as AlphaFold2, have made more powerful structure-based ML-assisted protein engineering strategies possible. This review aims to offer a comprehensive, systematic, and indispensable set of methodological components, including TDA and NLP, for protein engineering and to facilitate their future development. Cornell University 2023-07-27 /pmc/articles/PMC10402185/ /pubmed/37547662 Text en For permissions, please e-mail: journals.permissions@oup.com https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use. |
spellingShingle | Article Qiu, Yuchi Wei, Guo-Wei Artificial intelligence-aided protein engineering: from topological data analysis to deep protein language models |
title | Artificial intelligence-aided protein engineering: from topological data analysis to deep protein language models |
title_full | Artificial intelligence-aided protein engineering: from topological data analysis to deep protein language models |
title_fullStr | Artificial intelligence-aided protein engineering: from topological data analysis to deep protein language models |
title_full_unstemmed | Artificial intelligence-aided protein engineering: from topological data analysis to deep protein language models |
title_short | Artificial intelligence-aided protein engineering: from topological data analysis to deep protein language models |
title_sort | artificial intelligence-aided protein engineering: from topological data analysis to deep protein language models |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10402185/ https://www.ncbi.nlm.nih.gov/pubmed/37547662 |
work_keys_str_mv | AT qiuyuchi artificialintelligenceaidedproteinengineeringfromtopologicaldataanalysistodeepproteinlanguagemodels AT weiguowei artificialintelligenceaidedproteinengineeringfromtopologicaldataanalysistodeepproteinlanguagemodels |