Cargando…

From sequence to function through structure: Deep learning for protein design

The process of designing biomolecules, in particular proteins, is witnessing a rapid change in available tooling and approaches, moving from design through physicochemical force fields, to producing plausible, complex sequences fast via end-to-end differentiable statistical models. To achieve condit...

Descripción completa

Detalles Bibliográficos
Autores principales: Ferruz, Noelia, Heinzinger, Michael, Akdel, Mehmet, Goncearenco, Alexander, Naef, Luca, Dallago, Christian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9755234/
https://www.ncbi.nlm.nih.gov/pubmed/36544476
http://dx.doi.org/10.1016/j.csbj.2022.11.014
_version_ 1784851385259393024
author Ferruz, Noelia
Heinzinger, Michael
Akdel, Mehmet
Goncearenco, Alexander
Naef, Luca
Dallago, Christian
author_facet Ferruz, Noelia
Heinzinger, Michael
Akdel, Mehmet
Goncearenco, Alexander
Naef, Luca
Dallago, Christian
author_sort Ferruz, Noelia
collection PubMed
description The process of designing biomolecules, in particular proteins, is witnessing a rapid change in available tooling and approaches, moving from design through physicochemical force fields, to producing plausible, complex sequences fast via end-to-end differentiable statistical models. To achieve conditional and controllable protein design, researchers at the interface of artificial intelligence and biology leverage advances in natural language processing (NLP) and computer vision techniques, coupled with advances in computing hardware to learn patterns from growing biological databases, curated annotations thereof, or both. Once learned, these patterns can be used to provide novel insights into mechanistic biology and the design of biomolecules. However, navigating and understanding the practical applications for the many recent protein design tools is complex. To facilitate this, we 1) document recent advances in deep learning (DL) assisted protein design from the last three years, 2) present a practical pipeline that allows to go from de novo-generated sequences to their predicted properties and web-powered visualization within minutes, and 3) leverage it to suggest a generated protein sequence which might be used to engineer a biosynthetic gene cluster to produce a molecular glue-like compound. Lastly, we discuss challenges and highlight opportunities for the protein design field.
format Online
Article
Text
id pubmed-9755234
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-97552342022-12-20 From sequence to function through structure: Deep learning for protein design Ferruz, Noelia Heinzinger, Michael Akdel, Mehmet Goncearenco, Alexander Naef, Luca Dallago, Christian Comput Struct Biotechnol J Mini Review The process of designing biomolecules, in particular proteins, is witnessing a rapid change in available tooling and approaches, moving from design through physicochemical force fields, to producing plausible, complex sequences fast via end-to-end differentiable statistical models. To achieve conditional and controllable protein design, researchers at the interface of artificial intelligence and biology leverage advances in natural language processing (NLP) and computer vision techniques, coupled with advances in computing hardware to learn patterns from growing biological databases, curated annotations thereof, or both. Once learned, these patterns can be used to provide novel insights into mechanistic biology and the design of biomolecules. However, navigating and understanding the practical applications for the many recent protein design tools is complex. To facilitate this, we 1) document recent advances in deep learning (DL) assisted protein design from the last three years, 2) present a practical pipeline that allows to go from de novo-generated sequences to their predicted properties and web-powered visualization within minutes, and 3) leverage it to suggest a generated protein sequence which might be used to engineer a biosynthetic gene cluster to produce a molecular glue-like compound. Lastly, we discuss challenges and highlight opportunities for the protein design field. Research Network of Computational and Structural Biotechnology 2022-11-19 /pmc/articles/PMC9755234/ /pubmed/36544476 http://dx.doi.org/10.1016/j.csbj.2022.11.014 Text en © 2022 The Authors https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Mini Review
Ferruz, Noelia
Heinzinger, Michael
Akdel, Mehmet
Goncearenco, Alexander
Naef, Luca
Dallago, Christian
From sequence to function through structure: Deep learning for protein design
title From sequence to function through structure: Deep learning for protein design
title_full From sequence to function through structure: Deep learning for protein design
title_fullStr From sequence to function through structure: Deep learning for protein design
title_full_unstemmed From sequence to function through structure: Deep learning for protein design
title_short From sequence to function through structure: Deep learning for protein design
title_sort from sequence to function through structure: deep learning for protein design
topic Mini Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9755234/
https://www.ncbi.nlm.nih.gov/pubmed/36544476
http://dx.doi.org/10.1016/j.csbj.2022.11.014
work_keys_str_mv AT ferruznoelia fromsequencetofunctionthroughstructuredeeplearningforproteindesign
AT heinzingermichael fromsequencetofunctionthroughstructuredeeplearningforproteindesign
AT akdelmehmet fromsequencetofunctionthroughstructuredeeplearningforproteindesign
AT goncearencoalexander fromsequencetofunctionthroughstructuredeeplearningforproteindesign
AT naefluca fromsequencetofunctionthroughstructuredeeplearningforproteindesign
AT dallagochristian fromsequencetofunctionthroughstructuredeeplearningforproteindesign