Cargando…
Predicting Structural Motifs of Glycosaminoglycans using Cryogenic Infrared Spectroscopy and Random Forest
[Image: see text] In recent years, glycosaminoglycans (GAGs) have emerged into the focus of biochemical and biomedical research due to their importance in a variety of physiological processes. These molecules show great diversity, which makes their analysis highly challenging. A promising tool for i...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Chemical Society
2023
|
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10103134/ https://www.ncbi.nlm.nih.gov/pubmed/37000483 http://dx.doi.org/10.1021/jacs.2c12762 |
_version_ | 1785025818095779840 |
---|---|
author | Riedel, Jerome Lettow, Maike Grabarics, Márkó Götze, Michael Miller, Rebecca L. Boons, Geert-Jan Meijer, Gerard von Helden, Gert Szekeres, Gergo Peter Pagel, Kevin |
author_facet | Riedel, Jerome Lettow, Maike Grabarics, Márkó Götze, Michael Miller, Rebecca L. Boons, Geert-Jan Meijer, Gerard von Helden, Gert Szekeres, Gergo Peter Pagel, Kevin |
author_sort | Riedel, Jerome |
collection | PubMed |
description | [Image: see text] In recent years, glycosaminoglycans (GAGs) have emerged into the focus of biochemical and biomedical research due to their importance in a variety of physiological processes. These molecules show great diversity, which makes their analysis highly challenging. A promising tool for identifying the structural motifs and conformation of shorter GAG chains is cryogenic gas-phase infrared (IR) spectroscopy. In this work, the cryogenic gas-phase IR spectra of mass-selected heparan sulfate (HS) di-, tetra-, and hexasaccharide ions were recorded to extract vibrational features that are characteristic to structural motifs. The data were augmented with chondroitin sulfate (CS) disaccharide spectra to assemble a training library for random forest (RF) classifiers. These were used to discriminate between GAG classes (CS or HS) and different sulfate positions (2-O-, 4-O-, 6-O-, and N-sulfation). With optimized data preprocessing and RF modeling, a prediction accuracy of >97% was achieved for HS tetra- and hexasaccharides based on a training set of only 21 spectra. These results exemplify the importance of combining gas-phase cryogenic IR ion spectroscopy with machine learning to improve the future analytical workflow for GAG sequencing and that of other biomolecules, such as metabolites. |
format | Online Article Text |
id | pubmed-10103134 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | American Chemical Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-101031342023-04-15 Predicting Structural Motifs of Glycosaminoglycans using Cryogenic Infrared Spectroscopy and Random Forest Riedel, Jerome Lettow, Maike Grabarics, Márkó Götze, Michael Miller, Rebecca L. Boons, Geert-Jan Meijer, Gerard von Helden, Gert Szekeres, Gergo Peter Pagel, Kevin J Am Chem Soc [Image: see text] In recent years, glycosaminoglycans (GAGs) have emerged into the focus of biochemical and biomedical research due to their importance in a variety of physiological processes. These molecules show great diversity, which makes their analysis highly challenging. A promising tool for identifying the structural motifs and conformation of shorter GAG chains is cryogenic gas-phase infrared (IR) spectroscopy. In this work, the cryogenic gas-phase IR spectra of mass-selected heparan sulfate (HS) di-, tetra-, and hexasaccharide ions were recorded to extract vibrational features that are characteristic to structural motifs. The data were augmented with chondroitin sulfate (CS) disaccharide spectra to assemble a training library for random forest (RF) classifiers. These were used to discriminate between GAG classes (CS or HS) and different sulfate positions (2-O-, 4-O-, 6-O-, and N-sulfation). With optimized data preprocessing and RF modeling, a prediction accuracy of >97% was achieved for HS tetra- and hexasaccharides based on a training set of only 21 spectra. These results exemplify the importance of combining gas-phase cryogenic IR ion spectroscopy with machine learning to improve the future analytical workflow for GAG sequencing and that of other biomolecules, such as metabolites. American Chemical Society 2023-03-31 /pmc/articles/PMC10103134/ /pubmed/37000483 http://dx.doi.org/10.1021/jacs.2c12762 Text en © 2023 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Riedel, Jerome Lettow, Maike Grabarics, Márkó Götze, Michael Miller, Rebecca L. Boons, Geert-Jan Meijer, Gerard von Helden, Gert Szekeres, Gergo Peter Pagel, Kevin Predicting Structural Motifs of Glycosaminoglycans using Cryogenic Infrared Spectroscopy and Random Forest |
title | Predicting Structural
Motifs of Glycosaminoglycans
using Cryogenic Infrared Spectroscopy and Random Forest |
title_full | Predicting Structural
Motifs of Glycosaminoglycans
using Cryogenic Infrared Spectroscopy and Random Forest |
title_fullStr | Predicting Structural
Motifs of Glycosaminoglycans
using Cryogenic Infrared Spectroscopy and Random Forest |
title_full_unstemmed | Predicting Structural
Motifs of Glycosaminoglycans
using Cryogenic Infrared Spectroscopy and Random Forest |
title_short | Predicting Structural
Motifs of Glycosaminoglycans
using Cryogenic Infrared Spectroscopy and Random Forest |
title_sort | predicting structural
motifs of glycosaminoglycans
using cryogenic infrared spectroscopy and random forest |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10103134/ https://www.ncbi.nlm.nih.gov/pubmed/37000483 http://dx.doi.org/10.1021/jacs.2c12762 |
work_keys_str_mv | AT riedeljerome predictingstructuralmotifsofglycosaminoglycansusingcryogenicinfraredspectroscopyandrandomforest AT lettowmaike predictingstructuralmotifsofglycosaminoglycansusingcryogenicinfraredspectroscopyandrandomforest AT grabaricsmarko predictingstructuralmotifsofglycosaminoglycansusingcryogenicinfraredspectroscopyandrandomforest AT gotzemichael predictingstructuralmotifsofglycosaminoglycansusingcryogenicinfraredspectroscopyandrandomforest AT millerrebeccal predictingstructuralmotifsofglycosaminoglycansusingcryogenicinfraredspectroscopyandrandomforest AT boonsgeertjan predictingstructuralmotifsofglycosaminoglycansusingcryogenicinfraredspectroscopyandrandomforest AT meijergerard predictingstructuralmotifsofglycosaminoglycansusingcryogenicinfraredspectroscopyandrandomforest AT vonheldengert predictingstructuralmotifsofglycosaminoglycansusingcryogenicinfraredspectroscopyandrandomforest AT szekeresgergopeter predictingstructuralmotifsofglycosaminoglycansusingcryogenicinfraredspectroscopyandrandomforest AT pagelkevin predictingstructuralmotifsofglycosaminoglycansusingcryogenicinfraredspectroscopyandrandomforest |