Cargando…

Deep learning explains the biology of branched glycans from single-cell sequencing data

Glycosylation is ubiquitous and often dysregulated in disease. However, the regulation and functional significance of various types of glycosylation at cellular levels is hard to unravel experimentally. Multi-omics, single-cell measurements such as SUGAR-seq, which quantifies transcriptomes and cell...

Descripción completa

Detalles Bibliográficos
Autores principales: Qin, Rui, Mahal, Lara K., Bojar, Daniel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9547197/
https://www.ncbi.nlm.nih.gov/pubmed/36217547
http://dx.doi.org/10.1016/j.isci.2022.105163
_version_ 1784805210716110848
author Qin, Rui
Mahal, Lara K.
Bojar, Daniel
author_facet Qin, Rui
Mahal, Lara K.
Bojar, Daniel
author_sort Qin, Rui
collection PubMed
description Glycosylation is ubiquitous and often dysregulated in disease. However, the regulation and functional significance of various types of glycosylation at cellular levels is hard to unravel experimentally. Multi-omics, single-cell measurements such as SUGAR-seq, which quantifies transcriptomes and cell surface glycans, facilitate addressing this issue. Using SUGAR-seq data, we pioneered a deep learning model to predict the glycan phenotypes of cells (mouse T lymphocytes) from transcripts, with the example of predicting β1,6GlcNAc-branching across T cell subtypes (test set F1 score: 0.9351). Model interpretation via SHAP (SHapley Additive exPlanations) identified highly predictive genes, in part known to impact (i) branched glycan levels and (ii) the biology of branched glycans. These genes included physiologically relevant low-abundance genes that were not captured by conventional differential expression analysis. Our work shows that interpretable deep learning models are promising for uncovering novel functions and regulatory mechanisms of glycans from integrated transcriptomic and glycomic datasets.
format Online
Article
Text
id pubmed-9547197
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-95471972022-10-09 Deep learning explains the biology of branched glycans from single-cell sequencing data Qin, Rui Mahal, Lara K. Bojar, Daniel iScience Article Glycosylation is ubiquitous and often dysregulated in disease. However, the regulation and functional significance of various types of glycosylation at cellular levels is hard to unravel experimentally. Multi-omics, single-cell measurements such as SUGAR-seq, which quantifies transcriptomes and cell surface glycans, facilitate addressing this issue. Using SUGAR-seq data, we pioneered a deep learning model to predict the glycan phenotypes of cells (mouse T lymphocytes) from transcripts, with the example of predicting β1,6GlcNAc-branching across T cell subtypes (test set F1 score: 0.9351). Model interpretation via SHAP (SHapley Additive exPlanations) identified highly predictive genes, in part known to impact (i) branched glycan levels and (ii) the biology of branched glycans. These genes included physiologically relevant low-abundance genes that were not captured by conventional differential expression analysis. Our work shows that interpretable deep learning models are promising for uncovering novel functions and regulatory mechanisms of glycans from integrated transcriptomic and glycomic datasets. Elsevier 2022-09-19 /pmc/articles/PMC9547197/ /pubmed/36217547 http://dx.doi.org/10.1016/j.isci.2022.105163 Text en © 2022 The Author(s) https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Qin, Rui
Mahal, Lara K.
Bojar, Daniel
Deep learning explains the biology of branched glycans from single-cell sequencing data
title Deep learning explains the biology of branched glycans from single-cell sequencing data
title_full Deep learning explains the biology of branched glycans from single-cell sequencing data
title_fullStr Deep learning explains the biology of branched glycans from single-cell sequencing data
title_full_unstemmed Deep learning explains the biology of branched glycans from single-cell sequencing data
title_short Deep learning explains the biology of branched glycans from single-cell sequencing data
title_sort deep learning explains the biology of branched glycans from single-cell sequencing data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9547197/
https://www.ncbi.nlm.nih.gov/pubmed/36217547
http://dx.doi.org/10.1016/j.isci.2022.105163
work_keys_str_mv AT qinrui deeplearningexplainsthebiologyofbranchedglycansfromsinglecellsequencingdata
AT mahallarak deeplearningexplainsthebiologyofbranchedglycansfromsinglecellsequencingdata
AT bojardaniel deeplearningexplainsthebiologyofbranchedglycansfromsinglecellsequencingdata