Cargando…
A Useful Guide to Lectin Binding: Machine-Learning Directed Annotation of 57 Unique Lectin Specificities
[Image: see text] Glycans are critical to every facet of biology and medicine, from viral infections to embryogenesis. Tools to study glycans are rapidly evolving; however, the majority of our knowledge is deeply dependent on binding by glycan binding proteins (e.g., lectins). The specificities of l...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Chemical Society
2022
|
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9679999/ https://www.ncbi.nlm.nih.gov/pubmed/35084820 http://dx.doi.org/10.1021/acschembio.1c00689 |
_version_ | 1784834314674896896 |
---|---|
author | Bojar, Daniel Meche, Lawrence Meng, Guanmin Eng, William Smith, David F. Cummings, Richard D. Mahal, Lara K. |
author_facet | Bojar, Daniel Meche, Lawrence Meng, Guanmin Eng, William Smith, David F. Cummings, Richard D. Mahal, Lara K. |
author_sort | Bojar, Daniel |
collection | PubMed |
description | [Image: see text] Glycans are critical to every facet of biology and medicine, from viral infections to embryogenesis. Tools to study glycans are rapidly evolving; however, the majority of our knowledge is deeply dependent on binding by glycan binding proteins (e.g., lectins). The specificities of lectins, which are often naturally isolated proteins, have not been well-defined, making it difficult to leverage their full potential for glycan analysis. Herein, we use a combination of machine learning algorithms and expert annotation to define lectin specificity for this important probe set. Our analysis uses comprehensive glycan microarray analysis of commercially available lectins we obtained using version 5.0 of the Consortium for Functional Glycomics glycan microarray (CFGv5). This data set was made public in 2011. We report the creation of this data set and its use in large-scale evaluation of lectin–glycan binding behaviors. Our motif analysis was performed by integrating 68 manually defined glycan features with systematic probing of computational rules for significant binding motifs using mono- and disaccharides and linkages. Combining machine learning with manual annotation, we create a detailed interpretation of glycan-binding specificity for 57 unique lectins, categorized by their major binding motifs: mannose, complex-type N-glycan, O-glycan, fucose, sialic acid and sulfate, GlcNAc and chitin, Gal and LacNAc, and GalNAc. Our work provides fresh insights into the complex binding features of commercially available lectins in current use, providing a critical guide to these important reagents. |
format | Online Article Text |
id | pubmed-9679999 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | American Chemical Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-96799992022-11-23 A Useful Guide to Lectin Binding: Machine-Learning Directed Annotation of 57 Unique Lectin Specificities Bojar, Daniel Meche, Lawrence Meng, Guanmin Eng, William Smith, David F. Cummings, Richard D. Mahal, Lara K. ACS Chem Biol [Image: see text] Glycans are critical to every facet of biology and medicine, from viral infections to embryogenesis. Tools to study glycans are rapidly evolving; however, the majority of our knowledge is deeply dependent on binding by glycan binding proteins (e.g., lectins). The specificities of lectins, which are often naturally isolated proteins, have not been well-defined, making it difficult to leverage their full potential for glycan analysis. Herein, we use a combination of machine learning algorithms and expert annotation to define lectin specificity for this important probe set. Our analysis uses comprehensive glycan microarray analysis of commercially available lectins we obtained using version 5.0 of the Consortium for Functional Glycomics glycan microarray (CFGv5). This data set was made public in 2011. We report the creation of this data set and its use in large-scale evaluation of lectin–glycan binding behaviors. Our motif analysis was performed by integrating 68 manually defined glycan features with systematic probing of computational rules for significant binding motifs using mono- and disaccharides and linkages. Combining machine learning with manual annotation, we create a detailed interpretation of glycan-binding specificity for 57 unique lectins, categorized by their major binding motifs: mannose, complex-type N-glycan, O-glycan, fucose, sialic acid and sulfate, GlcNAc and chitin, Gal and LacNAc, and GalNAc. Our work provides fresh insights into the complex binding features of commercially available lectins in current use, providing a critical guide to these important reagents. American Chemical Society 2022-01-27 2022-11-18 /pmc/articles/PMC9679999/ /pubmed/35084820 http://dx.doi.org/10.1021/acschembio.1c00689 Text en © 2022 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Bojar, Daniel Meche, Lawrence Meng, Guanmin Eng, William Smith, David F. Cummings, Richard D. Mahal, Lara K. A Useful Guide to Lectin Binding: Machine-Learning Directed Annotation of 57 Unique Lectin Specificities |
title | A Useful Guide
to Lectin Binding: Machine-Learning
Directed Annotation of 57 Unique Lectin Specificities |
title_full | A Useful Guide
to Lectin Binding: Machine-Learning
Directed Annotation of 57 Unique Lectin Specificities |
title_fullStr | A Useful Guide
to Lectin Binding: Machine-Learning
Directed Annotation of 57 Unique Lectin Specificities |
title_full_unstemmed | A Useful Guide
to Lectin Binding: Machine-Learning
Directed Annotation of 57 Unique Lectin Specificities |
title_short | A Useful Guide
to Lectin Binding: Machine-Learning
Directed Annotation of 57 Unique Lectin Specificities |
title_sort | useful guide
to lectin binding: machine-learning
directed annotation of 57 unique lectin specificities |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9679999/ https://www.ncbi.nlm.nih.gov/pubmed/35084820 http://dx.doi.org/10.1021/acschembio.1c00689 |
work_keys_str_mv | AT bojardaniel ausefulguidetolectinbindingmachinelearningdirectedannotationof57uniquelectinspecificities AT mechelawrence ausefulguidetolectinbindingmachinelearningdirectedannotationof57uniquelectinspecificities AT mengguanmin ausefulguidetolectinbindingmachinelearningdirectedannotationof57uniquelectinspecificities AT engwilliam ausefulguidetolectinbindingmachinelearningdirectedannotationof57uniquelectinspecificities AT smithdavidf ausefulguidetolectinbindingmachinelearningdirectedannotationof57uniquelectinspecificities AT cummingsrichardd ausefulguidetolectinbindingmachinelearningdirectedannotationof57uniquelectinspecificities AT mahallarak ausefulguidetolectinbindingmachinelearningdirectedannotationof57uniquelectinspecificities AT bojardaniel usefulguidetolectinbindingmachinelearningdirectedannotationof57uniquelectinspecificities AT mechelawrence usefulguidetolectinbindingmachinelearningdirectedannotationof57uniquelectinspecificities AT mengguanmin usefulguidetolectinbindingmachinelearningdirectedannotationof57uniquelectinspecificities AT engwilliam usefulguidetolectinbindingmachinelearningdirectedannotationof57uniquelectinspecificities AT smithdavidf usefulguidetolectinbindingmachinelearningdirectedannotationof57uniquelectinspecificities AT cummingsrichardd usefulguidetolectinbindingmachinelearningdirectedannotationof57uniquelectinspecificities AT mahallarak usefulguidetolectinbindingmachinelearningdirectedannotationof57uniquelectinspecificities |