Cargando…

SD(2): spatially resolved transcriptomics deconvolution through integration of dropout and spatial information

MOTIVATION: Unveiling the heterogeneity in the tissues is crucial to explore cell–cell interactions and cellular targets of human diseases. Spatial transcriptomics (ST) supplies spatial gene expression profile which has revolutionized our biological understanding, but variations in cell-type proport...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Haoyang, Li, Hanmin, Zhou, Juexiao, Gao, Xin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9789790/
https://www.ncbi.nlm.nih.gov/pubmed/36063455
http://dx.doi.org/10.1093/bioinformatics/btac605
_version_ 1784859031826857984
author Li, Haoyang
Li, Hanmin
Zhou, Juexiao
Gao, Xin
author_facet Li, Haoyang
Li, Hanmin
Zhou, Juexiao
Gao, Xin
author_sort Li, Haoyang
collection PubMed
description MOTIVATION: Unveiling the heterogeneity in the tissues is crucial to explore cell–cell interactions and cellular targets of human diseases. Spatial transcriptomics (ST) supplies spatial gene expression profile which has revolutionized our biological understanding, but variations in cell-type proportions of each spot with dozens of cells would confound downstream analysis. Therefore, deconvolution of ST has been an indispensable step and a technical challenge toward the higher-resolution panorama of tissues. RESULTS: Here, we propose a novel ST deconvolution method called SD(2) integrating spatial information of ST data and embracing an important characteristic, dropout, which is traditionally considered as an obstruction in single-cell RNA sequencing data (scRNA-seq) analysis. First, we extract the dropout-based genes as informative features from ST and scRNA-seq data by fitting a Michaelis–Menten function. After synthesizing pseudo-ST spots by randomly composing cells from scRNA-seq data, auto-encoder is applied to discover low-dimensional and non-linear representation of the real- and pseudo-ST spots. Next, we create a graph containing embedded profiles as nodes, and edges determined by transcriptional similarity and spatial relationship. Given the graph, a graph convolutional neural network is used to predict the cell-type compositions for real-ST spots. We benchmark the performance of SD(2) on the simulated seqFISH+ dataset with different resolutions and measurements which show superior performance compared with the state-of-the-art methods. SD(2) is further validated on three real-world datasets with different ST technologies and demonstrates the capability to localize cell-type composition accurately with quantitative evidence. Finally, ablation study is conducted to verify the contribution of different modules proposed in SD(2). AVAILABILITY AND IMPLEMENTATION: The SD(2) is freely available in github (https://github.com/leihouyeung/SD2) and Zenodo (https://doi.org/10.5281/zenodo.7024684). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-9789790
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-97897902022-12-30 SD(2): spatially resolved transcriptomics deconvolution through integration of dropout and spatial information Li, Haoyang Li, Hanmin Zhou, Juexiao Gao, Xin Bioinformatics Original Papers MOTIVATION: Unveiling the heterogeneity in the tissues is crucial to explore cell–cell interactions and cellular targets of human diseases. Spatial transcriptomics (ST) supplies spatial gene expression profile which has revolutionized our biological understanding, but variations in cell-type proportions of each spot with dozens of cells would confound downstream analysis. Therefore, deconvolution of ST has been an indispensable step and a technical challenge toward the higher-resolution panorama of tissues. RESULTS: Here, we propose a novel ST deconvolution method called SD(2) integrating spatial information of ST data and embracing an important characteristic, dropout, which is traditionally considered as an obstruction in single-cell RNA sequencing data (scRNA-seq) analysis. First, we extract the dropout-based genes as informative features from ST and scRNA-seq data by fitting a Michaelis–Menten function. After synthesizing pseudo-ST spots by randomly composing cells from scRNA-seq data, auto-encoder is applied to discover low-dimensional and non-linear representation of the real- and pseudo-ST spots. Next, we create a graph containing embedded profiles as nodes, and edges determined by transcriptional similarity and spatial relationship. Given the graph, a graph convolutional neural network is used to predict the cell-type compositions for real-ST spots. We benchmark the performance of SD(2) on the simulated seqFISH+ dataset with different resolutions and measurements which show superior performance compared with the state-of-the-art methods. SD(2) is further validated on three real-world datasets with different ST technologies and demonstrates the capability to localize cell-type composition accurately with quantitative evidence. Finally, ablation study is conducted to verify the contribution of different modules proposed in SD(2). AVAILABILITY AND IMPLEMENTATION: The SD(2) is freely available in github (https://github.com/leihouyeung/SD2) and Zenodo (https://doi.org/10.5281/zenodo.7024684). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2022-09-05 /pmc/articles/PMC9789790/ /pubmed/36063455 http://dx.doi.org/10.1093/bioinformatics/btac605 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Li, Haoyang
Li, Hanmin
Zhou, Juexiao
Gao, Xin
SD(2): spatially resolved transcriptomics deconvolution through integration of dropout and spatial information
title SD(2): spatially resolved transcriptomics deconvolution through integration of dropout and spatial information
title_full SD(2): spatially resolved transcriptomics deconvolution through integration of dropout and spatial information
title_fullStr SD(2): spatially resolved transcriptomics deconvolution through integration of dropout and spatial information
title_full_unstemmed SD(2): spatially resolved transcriptomics deconvolution through integration of dropout and spatial information
title_short SD(2): spatially resolved transcriptomics deconvolution through integration of dropout and spatial information
title_sort sd(2): spatially resolved transcriptomics deconvolution through integration of dropout and spatial information
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9789790/
https://www.ncbi.nlm.nih.gov/pubmed/36063455
http://dx.doi.org/10.1093/bioinformatics/btac605
work_keys_str_mv AT lihaoyang sd2spatiallyresolvedtranscriptomicsdeconvolutionthroughintegrationofdropoutandspatialinformation
AT lihanmin sd2spatiallyresolvedtranscriptomicsdeconvolutionthroughintegrationofdropoutandspatialinformation
AT zhoujuexiao sd2spatiallyresolvedtranscriptomicsdeconvolutionthroughintegrationofdropoutandspatialinformation
AT gaoxin sd2spatiallyresolvedtranscriptomicsdeconvolutionthroughintegrationofdropoutandspatialinformation