Cargando…

LectinOracle: A Generalizable Deep Learning Model for Lectin–Glycan Binding Prediction

Ranging from bacterial cell adhesion over viral cell entry to human innate immunity, glycan‐binding proteins or lectins are abound in nature. Widely used as staining and characterization reagents in cell biology and crucial for understanding the interactions in biological systems, lectins are a foca...

Descripción completa

Detalles Bibliográficos
Autores principales: Lundstrøm, Jon, Korhonen, Emma, Lisacek, Frédérique, Bojar, Daniel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8728848/
https://www.ncbi.nlm.nih.gov/pubmed/34862760
http://dx.doi.org/10.1002/advs.202103807
_version_ 1784626816526319616
author Lundstrøm, Jon
Korhonen, Emma
Lisacek, Frédérique
Bojar, Daniel
author_facet Lundstrøm, Jon
Korhonen, Emma
Lisacek, Frédérique
Bojar, Daniel
author_sort Lundstrøm, Jon
collection PubMed
description Ranging from bacterial cell adhesion over viral cell entry to human innate immunity, glycan‐binding proteins or lectins are abound in nature. Widely used as staining and characterization reagents in cell biology and crucial for understanding the interactions in biological systems, lectins are a focal point of study in glycobiology. Yet the sheer breadth and depth of specificity for diverse oligosaccharide motifs has made studying lectins a largely piecemeal approach, with few options to generalize. Here, LectinOracle, a model combining transformer‐based representations for proteins and graph convolutional neural networks for glycans to predict their interaction, is presented. Using a curated data set of 564,647 unique protein–glycan interactions, it is shown that LectinOracle predictions agree with literature‐annotated specificities for a wide range of lectins. Using a range of specialized glycan arrays, it is shown that LectinOracle predictions generalize to new glycans and lectins, with qualitative and quantitative agreement with experimental data. It is further demonstrated that LectinOracle can be used to improve lectin classification, accelerate lectin directed evolution, predict epidemiological outcomes in the context of influenza virus, and analyze whole lectomes in host–microbe interactions. It is envisioned that the herein presented platform will advance both the study of lectins and their role in (glyco)biology.
format Online
Article
Text
id pubmed-8728848
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-87288482022-01-11 LectinOracle: A Generalizable Deep Learning Model for Lectin–Glycan Binding Prediction Lundstrøm, Jon Korhonen, Emma Lisacek, Frédérique Bojar, Daniel Adv Sci (Weinh) Research Article Ranging from bacterial cell adhesion over viral cell entry to human innate immunity, glycan‐binding proteins or lectins are abound in nature. Widely used as staining and characterization reagents in cell biology and crucial for understanding the interactions in biological systems, lectins are a focal point of study in glycobiology. Yet the sheer breadth and depth of specificity for diverse oligosaccharide motifs has made studying lectins a largely piecemeal approach, with few options to generalize. Here, LectinOracle, a model combining transformer‐based representations for proteins and graph convolutional neural networks for glycans to predict their interaction, is presented. Using a curated data set of 564,647 unique protein–glycan interactions, it is shown that LectinOracle predictions agree with literature‐annotated specificities for a wide range of lectins. Using a range of specialized glycan arrays, it is shown that LectinOracle predictions generalize to new glycans and lectins, with qualitative and quantitative agreement with experimental data. It is further demonstrated that LectinOracle can be used to improve lectin classification, accelerate lectin directed evolution, predict epidemiological outcomes in the context of influenza virus, and analyze whole lectomes in host–microbe interactions. It is envisioned that the herein presented platform will advance both the study of lectins and their role in (glyco)biology. John Wiley and Sons Inc. 2021-12-04 /pmc/articles/PMC8728848/ /pubmed/34862760 http://dx.doi.org/10.1002/advs.202103807 Text en © 2021 The Authors. Advanced Science published by Wiley‐VCH GmbH https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Lundstrøm, Jon
Korhonen, Emma
Lisacek, Frédérique
Bojar, Daniel
LectinOracle: A Generalizable Deep Learning Model for Lectin–Glycan Binding Prediction
title LectinOracle: A Generalizable Deep Learning Model for Lectin–Glycan Binding Prediction
title_full LectinOracle: A Generalizable Deep Learning Model for Lectin–Glycan Binding Prediction
title_fullStr LectinOracle: A Generalizable Deep Learning Model for Lectin–Glycan Binding Prediction
title_full_unstemmed LectinOracle: A Generalizable Deep Learning Model for Lectin–Glycan Binding Prediction
title_short LectinOracle: A Generalizable Deep Learning Model for Lectin–Glycan Binding Prediction
title_sort lectinoracle: a generalizable deep learning model for lectin–glycan binding prediction
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8728848/
https://www.ncbi.nlm.nih.gov/pubmed/34862760
http://dx.doi.org/10.1002/advs.202103807
work_keys_str_mv AT lundstrømjon lectinoracleageneralizabledeeplearningmodelforlectinglycanbindingprediction
AT korhonenemma lectinoracleageneralizabledeeplearningmodelforlectinglycanbindingprediction
AT lisacekfrederique lectinoracleageneralizabledeeplearningmodelforlectinglycanbindingprediction
AT bojardaniel lectinoracleageneralizabledeeplearningmodelforlectinglycanbindingprediction