Cargando…

Artificial intelligence-aided protein engineering: from topological data analysis to deep protein language models

Protein engineering is an emerging field in biotechnology that has the potential to revolutionize various areas, such as antibody design, drug discovery, food security, ecology, and more. However, the mutational space involved is too vast to be handled through experimental means alone. Leveraging ac...

Descripción completa

Detalles Bibliográficos
Autores principales: Qiu, Yuchi, Wei, Guo-Wei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cornell University 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10402185/
https://www.ncbi.nlm.nih.gov/pubmed/37547662
_version_ 1785084816223371264
author Qiu, Yuchi
Wei, Guo-Wei
author_facet Qiu, Yuchi
Wei, Guo-Wei
author_sort Qiu, Yuchi
collection PubMed
description Protein engineering is an emerging field in biotechnology that has the potential to revolutionize various areas, such as antibody design, drug discovery, food security, ecology, and more. However, the mutational space involved is too vast to be handled through experimental means alone. Leveraging accumulative protein databases, machine learning (ML) models, particularly those based on natural language processing (NLP), have considerably expedited protein engineering. Moreover, advances in topological data analysis (TDA) and artificial intelligence-based protein structure prediction, such as AlphaFold2, have made more powerful structure-based ML-assisted protein engineering strategies possible. This review aims to offer a comprehensive, systematic, and indispensable set of methodological components, including TDA and NLP, for protein engineering and to facilitate their future development.
format Online
Article
Text
id pubmed-10402185
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cornell University
record_format MEDLINE/PubMed
spelling pubmed-104021852023-08-05 Artificial intelligence-aided protein engineering: from topological data analysis to deep protein language models Qiu, Yuchi Wei, Guo-Wei ArXiv Article Protein engineering is an emerging field in biotechnology that has the potential to revolutionize various areas, such as antibody design, drug discovery, food security, ecology, and more. However, the mutational space involved is too vast to be handled through experimental means alone. Leveraging accumulative protein databases, machine learning (ML) models, particularly those based on natural language processing (NLP), have considerably expedited protein engineering. Moreover, advances in topological data analysis (TDA) and artificial intelligence-based protein structure prediction, such as AlphaFold2, have made more powerful structure-based ML-assisted protein engineering strategies possible. This review aims to offer a comprehensive, systematic, and indispensable set of methodological components, including TDA and NLP, for protein engineering and to facilitate their future development. Cornell University 2023-07-27 /pmc/articles/PMC10402185/ /pubmed/37547662 Text en For permissions, please e-mail: journals.permissions@oup.com https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use.
spellingShingle Article
Qiu, Yuchi
Wei, Guo-Wei
Artificial intelligence-aided protein engineering: from topological data analysis to deep protein language models
title Artificial intelligence-aided protein engineering: from topological data analysis to deep protein language models
title_full Artificial intelligence-aided protein engineering: from topological data analysis to deep protein language models
title_fullStr Artificial intelligence-aided protein engineering: from topological data analysis to deep protein language models
title_full_unstemmed Artificial intelligence-aided protein engineering: from topological data analysis to deep protein language models
title_short Artificial intelligence-aided protein engineering: from topological data analysis to deep protein language models
title_sort artificial intelligence-aided protein engineering: from topological data analysis to deep protein language models
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10402185/
https://www.ncbi.nlm.nih.gov/pubmed/37547662
work_keys_str_mv AT qiuyuchi artificialintelligenceaidedproteinengineeringfromtopologicaldataanalysistodeepproteinlanguagemodels
AT weiguowei artificialintelligenceaidedproteinengineeringfromtopologicaldataanalysistodeepproteinlanguagemodels