Cargando…
LectinOracle: A Generalizable Deep Learning Model for Lectin–Glycan Binding Prediction
Ranging from bacterial cell adhesion over viral cell entry to human innate immunity, glycan‐binding proteins or lectins are abound in nature. Widely used as staining and characterization reagents in cell biology and crucial for understanding the interactions in biological systems, lectins are a foca...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8728848/ https://www.ncbi.nlm.nih.gov/pubmed/34862760 http://dx.doi.org/10.1002/advs.202103807 |
_version_ | 1784626816526319616 |
---|---|
author | Lundstrøm, Jon Korhonen, Emma Lisacek, Frédérique Bojar, Daniel |
author_facet | Lundstrøm, Jon Korhonen, Emma Lisacek, Frédérique Bojar, Daniel |
author_sort | Lundstrøm, Jon |
collection | PubMed |
description | Ranging from bacterial cell adhesion over viral cell entry to human innate immunity, glycan‐binding proteins or lectins are abound in nature. Widely used as staining and characterization reagents in cell biology and crucial for understanding the interactions in biological systems, lectins are a focal point of study in glycobiology. Yet the sheer breadth and depth of specificity for diverse oligosaccharide motifs has made studying lectins a largely piecemeal approach, with few options to generalize. Here, LectinOracle, a model combining transformer‐based representations for proteins and graph convolutional neural networks for glycans to predict their interaction, is presented. Using a curated data set of 564,647 unique protein–glycan interactions, it is shown that LectinOracle predictions agree with literature‐annotated specificities for a wide range of lectins. Using a range of specialized glycan arrays, it is shown that LectinOracle predictions generalize to new glycans and lectins, with qualitative and quantitative agreement with experimental data. It is further demonstrated that LectinOracle can be used to improve lectin classification, accelerate lectin directed evolution, predict epidemiological outcomes in the context of influenza virus, and analyze whole lectomes in host–microbe interactions. It is envisioned that the herein presented platform will advance both the study of lectins and their role in (glyco)biology. |
format | Online Article Text |
id | pubmed-8728848 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-87288482022-01-11 LectinOracle: A Generalizable Deep Learning Model for Lectin–Glycan Binding Prediction Lundstrøm, Jon Korhonen, Emma Lisacek, Frédérique Bojar, Daniel Adv Sci (Weinh) Research Article Ranging from bacterial cell adhesion over viral cell entry to human innate immunity, glycan‐binding proteins or lectins are abound in nature. Widely used as staining and characterization reagents in cell biology and crucial for understanding the interactions in biological systems, lectins are a focal point of study in glycobiology. Yet the sheer breadth and depth of specificity for diverse oligosaccharide motifs has made studying lectins a largely piecemeal approach, with few options to generalize. Here, LectinOracle, a model combining transformer‐based representations for proteins and graph convolutional neural networks for glycans to predict their interaction, is presented. Using a curated data set of 564,647 unique protein–glycan interactions, it is shown that LectinOracle predictions agree with literature‐annotated specificities for a wide range of lectins. Using a range of specialized glycan arrays, it is shown that LectinOracle predictions generalize to new glycans and lectins, with qualitative and quantitative agreement with experimental data. It is further demonstrated that LectinOracle can be used to improve lectin classification, accelerate lectin directed evolution, predict epidemiological outcomes in the context of influenza virus, and analyze whole lectomes in host–microbe interactions. It is envisioned that the herein presented platform will advance both the study of lectins and their role in (glyco)biology. John Wiley and Sons Inc. 2021-12-04 /pmc/articles/PMC8728848/ /pubmed/34862760 http://dx.doi.org/10.1002/advs.202103807 Text en © 2021 The Authors. Advanced Science published by Wiley‐VCH GmbH https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Lundstrøm, Jon Korhonen, Emma Lisacek, Frédérique Bojar, Daniel LectinOracle: A Generalizable Deep Learning Model for Lectin–Glycan Binding Prediction |
title | LectinOracle: A Generalizable Deep Learning Model for Lectin–Glycan Binding Prediction |
title_full | LectinOracle: A Generalizable Deep Learning Model for Lectin–Glycan Binding Prediction |
title_fullStr | LectinOracle: A Generalizable Deep Learning Model for Lectin–Glycan Binding Prediction |
title_full_unstemmed | LectinOracle: A Generalizable Deep Learning Model for Lectin–Glycan Binding Prediction |
title_short | LectinOracle: A Generalizable Deep Learning Model for Lectin–Glycan Binding Prediction |
title_sort | lectinoracle: a generalizable deep learning model for lectin–glycan binding prediction |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8728848/ https://www.ncbi.nlm.nih.gov/pubmed/34862760 http://dx.doi.org/10.1002/advs.202103807 |
work_keys_str_mv | AT lundstrømjon lectinoracleageneralizabledeeplearningmodelforlectinglycanbindingprediction AT korhonenemma lectinoracleageneralizabledeeplearningmodelforlectinglycanbindingprediction AT lisacekfrederique lectinoracleageneralizabledeeplearningmodelforlectinglycanbindingprediction AT bojardaniel lectinoracleageneralizabledeeplearningmodelforlectinglycanbindingprediction |