Cargando…

Learning deep representations of enzyme thermal adaptation

Temperature is a fundamental environmental factor that shapes the evolution of organisms. Learning thermal determinants of protein sequences in evolution thus has profound significance for basic biology, drug discovery, and protein engineering. Here, we use a data set of over 3 million BRENDA enzyme...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Gang, Buric, Filip, Zrimec, Jan, Viknander, Sandra, Nielsen, Jens, Zelezniak, Aleksej, Engqvist, Martin K. M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley & Sons, Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9679980/
https://www.ncbi.nlm.nih.gov/pubmed/36261883
http://dx.doi.org/10.1002/pro.4480
_version_ 1784834310733299712
author Li, Gang
Buric, Filip
Zrimec, Jan
Viknander, Sandra
Nielsen, Jens
Zelezniak, Aleksej
Engqvist, Martin K. M.
author_facet Li, Gang
Buric, Filip
Zrimec, Jan
Viknander, Sandra
Nielsen, Jens
Zelezniak, Aleksej
Engqvist, Martin K. M.
author_sort Li, Gang
collection PubMed
description Temperature is a fundamental environmental factor that shapes the evolution of organisms. Learning thermal determinants of protein sequences in evolution thus has profound significance for basic biology, drug discovery, and protein engineering. Here, we use a data set of over 3 million BRENDA enzymes labeled with optimal growth temperatures (OGTs) of their source organisms to train a deep neural network model (DeepET). The protein‐temperature representations learned by DeepET provide a temperature‐related statistical summary of protein sequences and capture structural properties that affect thermal stability. For prediction of enzyme optimal catalytic temperatures and protein melting temperatures via a transfer learning approach, our DeepET model outperforms classical regression models trained on rationally designed features and other deep‐learning‐based representations. DeepET thus holds promise for understanding enzyme thermal adaptation and guiding the engineering of thermostable enzymes.
format Online
Article
Text
id pubmed-9679980
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher John Wiley & Sons, Inc.
record_format MEDLINE/PubMed
spelling pubmed-96799802022-12-01 Learning deep representations of enzyme thermal adaptation Li, Gang Buric, Filip Zrimec, Jan Viknander, Sandra Nielsen, Jens Zelezniak, Aleksej Engqvist, Martin K. M. Protein Sci Methods and Applications Temperature is a fundamental environmental factor that shapes the evolution of organisms. Learning thermal determinants of protein sequences in evolution thus has profound significance for basic biology, drug discovery, and protein engineering. Here, we use a data set of over 3 million BRENDA enzymes labeled with optimal growth temperatures (OGTs) of their source organisms to train a deep neural network model (DeepET). The protein‐temperature representations learned by DeepET provide a temperature‐related statistical summary of protein sequences and capture structural properties that affect thermal stability. For prediction of enzyme optimal catalytic temperatures and protein melting temperatures via a transfer learning approach, our DeepET model outperforms classical regression models trained on rationally designed features and other deep‐learning‐based representations. DeepET thus holds promise for understanding enzyme thermal adaptation and guiding the engineering of thermostable enzymes. John Wiley & Sons, Inc. 2022-12 /pmc/articles/PMC9679980/ /pubmed/36261883 http://dx.doi.org/10.1002/pro.4480 Text en © 2022 The Authors. Protein Science published by Wiley Periodicals LLC on behalf of The Protein Society. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods and Applications
Li, Gang
Buric, Filip
Zrimec, Jan
Viknander, Sandra
Nielsen, Jens
Zelezniak, Aleksej
Engqvist, Martin K. M.
Learning deep representations of enzyme thermal adaptation
title Learning deep representations of enzyme thermal adaptation
title_full Learning deep representations of enzyme thermal adaptation
title_fullStr Learning deep representations of enzyme thermal adaptation
title_full_unstemmed Learning deep representations of enzyme thermal adaptation
title_short Learning deep representations of enzyme thermal adaptation
title_sort learning deep representations of enzyme thermal adaptation
topic Methods and Applications
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9679980/
https://www.ncbi.nlm.nih.gov/pubmed/36261883
http://dx.doi.org/10.1002/pro.4480
work_keys_str_mv AT ligang learningdeeprepresentationsofenzymethermaladaptation
AT buricfilip learningdeeprepresentationsofenzymethermaladaptation
AT zrimecjan learningdeeprepresentationsofenzymethermaladaptation
AT viknandersandra learningdeeprepresentationsofenzymethermaladaptation
AT nielsenjens learningdeeprepresentationsofenzymethermaladaptation
AT zelezniakaleksej learningdeeprepresentationsofenzymethermaladaptation
AT engqvistmartinkm learningdeeprepresentationsofenzymethermaladaptation