Cargando…

A Useful Guide to Lectin Binding: Machine-Learning Directed Annotation of 57 Unique Lectin Specificities

[Image: see text] Glycans are critical to every facet of biology and medicine, from viral infections to embryogenesis. Tools to study glycans are rapidly evolving; however, the majority of our knowledge is deeply dependent on binding by glycan binding proteins (e.g., lectins). The specificities of l...

Descripción completa

Detalles Bibliográficos
Autores principales: Bojar, Daniel, Meche, Lawrence, Meng, Guanmin, Eng, William, Smith, David F., Cummings, Richard D., Mahal, Lara K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2022
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9679999/
https://www.ncbi.nlm.nih.gov/pubmed/35084820
http://dx.doi.org/10.1021/acschembio.1c00689
_version_ 1784834314674896896
author Bojar, Daniel
Meche, Lawrence
Meng, Guanmin
Eng, William
Smith, David F.
Cummings, Richard D.
Mahal, Lara K.
author_facet Bojar, Daniel
Meche, Lawrence
Meng, Guanmin
Eng, William
Smith, David F.
Cummings, Richard D.
Mahal, Lara K.
author_sort Bojar, Daniel
collection PubMed
description [Image: see text] Glycans are critical to every facet of biology and medicine, from viral infections to embryogenesis. Tools to study glycans are rapidly evolving; however, the majority of our knowledge is deeply dependent on binding by glycan binding proteins (e.g., lectins). The specificities of lectins, which are often naturally isolated proteins, have not been well-defined, making it difficult to leverage their full potential for glycan analysis. Herein, we use a combination of machine learning algorithms and expert annotation to define lectin specificity for this important probe set. Our analysis uses comprehensive glycan microarray analysis of commercially available lectins we obtained using version 5.0 of the Consortium for Functional Glycomics glycan microarray (CFGv5). This data set was made public in 2011. We report the creation of this data set and its use in large-scale evaluation of lectin–glycan binding behaviors. Our motif analysis was performed by integrating 68 manually defined glycan features with systematic probing of computational rules for significant binding motifs using mono- and disaccharides and linkages. Combining machine learning with manual annotation, we create a detailed interpretation of glycan-binding specificity for 57 unique lectins, categorized by their major binding motifs: mannose, complex-type N-glycan, O-glycan, fucose, sialic acid and sulfate, GlcNAc and chitin, Gal and LacNAc, and GalNAc. Our work provides fresh insights into the complex binding features of commercially available lectins in current use, providing a critical guide to these important reagents.
format Online
Article
Text
id pubmed-9679999
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-96799992022-11-23 A Useful Guide to Lectin Binding: Machine-Learning Directed Annotation of 57 Unique Lectin Specificities Bojar, Daniel Meche, Lawrence Meng, Guanmin Eng, William Smith, David F. Cummings, Richard D. Mahal, Lara K. ACS Chem Biol [Image: see text] Glycans are critical to every facet of biology and medicine, from viral infections to embryogenesis. Tools to study glycans are rapidly evolving; however, the majority of our knowledge is deeply dependent on binding by glycan binding proteins (e.g., lectins). The specificities of lectins, which are often naturally isolated proteins, have not been well-defined, making it difficult to leverage their full potential for glycan analysis. Herein, we use a combination of machine learning algorithms and expert annotation to define lectin specificity for this important probe set. Our analysis uses comprehensive glycan microarray analysis of commercially available lectins we obtained using version 5.0 of the Consortium for Functional Glycomics glycan microarray (CFGv5). This data set was made public in 2011. We report the creation of this data set and its use in large-scale evaluation of lectin–glycan binding behaviors. Our motif analysis was performed by integrating 68 manually defined glycan features with systematic probing of computational rules for significant binding motifs using mono- and disaccharides and linkages. Combining machine learning with manual annotation, we create a detailed interpretation of glycan-binding specificity for 57 unique lectins, categorized by their major binding motifs: mannose, complex-type N-glycan, O-glycan, fucose, sialic acid and sulfate, GlcNAc and chitin, Gal and LacNAc, and GalNAc. Our work provides fresh insights into the complex binding features of commercially available lectins in current use, providing a critical guide to these important reagents. American Chemical Society 2022-01-27 2022-11-18 /pmc/articles/PMC9679999/ /pubmed/35084820 http://dx.doi.org/10.1021/acschembio.1c00689 Text en © 2022 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Bojar, Daniel
Meche, Lawrence
Meng, Guanmin
Eng, William
Smith, David F.
Cummings, Richard D.
Mahal, Lara K.
A Useful Guide to Lectin Binding: Machine-Learning Directed Annotation of 57 Unique Lectin Specificities
title A Useful Guide to Lectin Binding: Machine-Learning Directed Annotation of 57 Unique Lectin Specificities
title_full A Useful Guide to Lectin Binding: Machine-Learning Directed Annotation of 57 Unique Lectin Specificities
title_fullStr A Useful Guide to Lectin Binding: Machine-Learning Directed Annotation of 57 Unique Lectin Specificities
title_full_unstemmed A Useful Guide to Lectin Binding: Machine-Learning Directed Annotation of 57 Unique Lectin Specificities
title_short A Useful Guide to Lectin Binding: Machine-Learning Directed Annotation of 57 Unique Lectin Specificities
title_sort useful guide to lectin binding: machine-learning directed annotation of 57 unique lectin specificities
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9679999/
https://www.ncbi.nlm.nih.gov/pubmed/35084820
http://dx.doi.org/10.1021/acschembio.1c00689
work_keys_str_mv AT bojardaniel ausefulguidetolectinbindingmachinelearningdirectedannotationof57uniquelectinspecificities
AT mechelawrence ausefulguidetolectinbindingmachinelearningdirectedannotationof57uniquelectinspecificities
AT mengguanmin ausefulguidetolectinbindingmachinelearningdirectedannotationof57uniquelectinspecificities
AT engwilliam ausefulguidetolectinbindingmachinelearningdirectedannotationof57uniquelectinspecificities
AT smithdavidf ausefulguidetolectinbindingmachinelearningdirectedannotationof57uniquelectinspecificities
AT cummingsrichardd ausefulguidetolectinbindingmachinelearningdirectedannotationof57uniquelectinspecificities
AT mahallarak ausefulguidetolectinbindingmachinelearningdirectedannotationof57uniquelectinspecificities
AT bojardaniel usefulguidetolectinbindingmachinelearningdirectedannotationof57uniquelectinspecificities
AT mechelawrence usefulguidetolectinbindingmachinelearningdirectedannotationof57uniquelectinspecificities
AT mengguanmin usefulguidetolectinbindingmachinelearningdirectedannotationof57uniquelectinspecificities
AT engwilliam usefulguidetolectinbindingmachinelearningdirectedannotationof57uniquelectinspecificities
AT smithdavidf usefulguidetolectinbindingmachinelearningdirectedannotationof57uniquelectinspecificities
AT cummingsrichardd usefulguidetolectinbindingmachinelearningdirectedannotationof57uniquelectinspecificities
AT mahallarak usefulguidetolectinbindingmachinelearningdirectedannotationof57uniquelectinspecificities