Cargando…

Predicting Structural Motifs of Glycosaminoglycans using Cryogenic Infrared Spectroscopy and Random Forest

[Image: see text] In recent years, glycosaminoglycans (GAGs) have emerged into the focus of biochemical and biomedical research due to their importance in a variety of physiological processes. These molecules show great diversity, which makes their analysis highly challenging. A promising tool for i...

Descripción completa

Detalles Bibliográficos
Autores principales: Riedel, Jerome, Lettow, Maike, Grabarics, Márkó, Götze, Michael, Miller, Rebecca L., Boons, Geert-Jan, Meijer, Gerard, von Helden, Gert, Szekeres, Gergo Peter, Pagel, Kevin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2023
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10103134/
https://www.ncbi.nlm.nih.gov/pubmed/37000483
http://dx.doi.org/10.1021/jacs.2c12762
_version_ 1785025818095779840
author Riedel, Jerome
Lettow, Maike
Grabarics, Márkó
Götze, Michael
Miller, Rebecca L.
Boons, Geert-Jan
Meijer, Gerard
von Helden, Gert
Szekeres, Gergo Peter
Pagel, Kevin
author_facet Riedel, Jerome
Lettow, Maike
Grabarics, Márkó
Götze, Michael
Miller, Rebecca L.
Boons, Geert-Jan
Meijer, Gerard
von Helden, Gert
Szekeres, Gergo Peter
Pagel, Kevin
author_sort Riedel, Jerome
collection PubMed
description [Image: see text] In recent years, glycosaminoglycans (GAGs) have emerged into the focus of biochemical and biomedical research due to their importance in a variety of physiological processes. These molecules show great diversity, which makes their analysis highly challenging. A promising tool for identifying the structural motifs and conformation of shorter GAG chains is cryogenic gas-phase infrared (IR) spectroscopy. In this work, the cryogenic gas-phase IR spectra of mass-selected heparan sulfate (HS) di-, tetra-, and hexasaccharide ions were recorded to extract vibrational features that are characteristic to structural motifs. The data were augmented with chondroitin sulfate (CS) disaccharide spectra to assemble a training library for random forest (RF) classifiers. These were used to discriminate between GAG classes (CS or HS) and different sulfate positions (2-O-, 4-O-, 6-O-, and N-sulfation). With optimized data preprocessing and RF modeling, a prediction accuracy of >97% was achieved for HS tetra- and hexasaccharides based on a training set of only 21 spectra. These results exemplify the importance of combining gas-phase cryogenic IR ion spectroscopy with machine learning to improve the future analytical workflow for GAG sequencing and that of other biomolecules, such as metabolites.
format Online
Article
Text
id pubmed-10103134
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-101031342023-04-15 Predicting Structural Motifs of Glycosaminoglycans using Cryogenic Infrared Spectroscopy and Random Forest Riedel, Jerome Lettow, Maike Grabarics, Márkó Götze, Michael Miller, Rebecca L. Boons, Geert-Jan Meijer, Gerard von Helden, Gert Szekeres, Gergo Peter Pagel, Kevin J Am Chem Soc [Image: see text] In recent years, glycosaminoglycans (GAGs) have emerged into the focus of biochemical and biomedical research due to their importance in a variety of physiological processes. These molecules show great diversity, which makes their analysis highly challenging. A promising tool for identifying the structural motifs and conformation of shorter GAG chains is cryogenic gas-phase infrared (IR) spectroscopy. In this work, the cryogenic gas-phase IR spectra of mass-selected heparan sulfate (HS) di-, tetra-, and hexasaccharide ions were recorded to extract vibrational features that are characteristic to structural motifs. The data were augmented with chondroitin sulfate (CS) disaccharide spectra to assemble a training library for random forest (RF) classifiers. These were used to discriminate between GAG classes (CS or HS) and different sulfate positions (2-O-, 4-O-, 6-O-, and N-sulfation). With optimized data preprocessing and RF modeling, a prediction accuracy of >97% was achieved for HS tetra- and hexasaccharides based on a training set of only 21 spectra. These results exemplify the importance of combining gas-phase cryogenic IR ion spectroscopy with machine learning to improve the future analytical workflow for GAG sequencing and that of other biomolecules, such as metabolites. American Chemical Society 2023-03-31 /pmc/articles/PMC10103134/ /pubmed/37000483 http://dx.doi.org/10.1021/jacs.2c12762 Text en © 2023 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Riedel, Jerome
Lettow, Maike
Grabarics, Márkó
Götze, Michael
Miller, Rebecca L.
Boons, Geert-Jan
Meijer, Gerard
von Helden, Gert
Szekeres, Gergo Peter
Pagel, Kevin
Predicting Structural Motifs of Glycosaminoglycans using Cryogenic Infrared Spectroscopy and Random Forest
title Predicting Structural Motifs of Glycosaminoglycans using Cryogenic Infrared Spectroscopy and Random Forest
title_full Predicting Structural Motifs of Glycosaminoglycans using Cryogenic Infrared Spectroscopy and Random Forest
title_fullStr Predicting Structural Motifs of Glycosaminoglycans using Cryogenic Infrared Spectroscopy and Random Forest
title_full_unstemmed Predicting Structural Motifs of Glycosaminoglycans using Cryogenic Infrared Spectroscopy and Random Forest
title_short Predicting Structural Motifs of Glycosaminoglycans using Cryogenic Infrared Spectroscopy and Random Forest
title_sort predicting structural motifs of glycosaminoglycans using cryogenic infrared spectroscopy and random forest
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10103134/
https://www.ncbi.nlm.nih.gov/pubmed/37000483
http://dx.doi.org/10.1021/jacs.2c12762
work_keys_str_mv AT riedeljerome predictingstructuralmotifsofglycosaminoglycansusingcryogenicinfraredspectroscopyandrandomforest
AT lettowmaike predictingstructuralmotifsofglycosaminoglycansusingcryogenicinfraredspectroscopyandrandomforest
AT grabaricsmarko predictingstructuralmotifsofglycosaminoglycansusingcryogenicinfraredspectroscopyandrandomforest
AT gotzemichael predictingstructuralmotifsofglycosaminoglycansusingcryogenicinfraredspectroscopyandrandomforest
AT millerrebeccal predictingstructuralmotifsofglycosaminoglycansusingcryogenicinfraredspectroscopyandrandomforest
AT boonsgeertjan predictingstructuralmotifsofglycosaminoglycansusingcryogenicinfraredspectroscopyandrandomforest
AT meijergerard predictingstructuralmotifsofglycosaminoglycansusingcryogenicinfraredspectroscopyandrandomforest
AT vonheldengert predictingstructuralmotifsofglycosaminoglycansusingcryogenicinfraredspectroscopyandrandomforest
AT szekeresgergopeter predictingstructuralmotifsofglycosaminoglycansusingcryogenicinfraredspectroscopyandrandomforest
AT pagelkevin predictingstructuralmotifsofglycosaminoglycansusingcryogenicinfraredspectroscopyandrandomforest