Cargando…

A multi-view latent variable model reveals cellular heterogeneity in complex tissues for paired multimodal single-cell data

MOTIVATION: Single-cell multimodal assays allow us to simultaneously measure two different molecular features of the same cell, enabling new insights into cellular heterogeneity, cell development and diseases. However, most existing methods suffer from inaccurate dimensionality reduction for the joi...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Yuwei, Lian, Bin, Zhang, Haohui, Zhong, Yuanke, He, Jie, Wu, Fashuai, Reinert, Knut, Shang, Xuequn, Yang, Hui, Hu, Jialu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9857983/
https://www.ncbi.nlm.nih.gov/pubmed/36622018
http://dx.doi.org/10.1093/bioinformatics/btad005
_version_ 1784873984985137152
author Wang, Yuwei
Lian, Bin
Zhang, Haohui
Zhong, Yuanke
He, Jie
Wu, Fashuai
Reinert, Knut
Shang, Xuequn
Yang, Hui
Hu, Jialu
author_facet Wang, Yuwei
Lian, Bin
Zhang, Haohui
Zhong, Yuanke
He, Jie
Wu, Fashuai
Reinert, Knut
Shang, Xuequn
Yang, Hui
Hu, Jialu
author_sort Wang, Yuwei
collection PubMed
description MOTIVATION: Single-cell multimodal assays allow us to simultaneously measure two different molecular features of the same cell, enabling new insights into cellular heterogeneity, cell development and diseases. However, most existing methods suffer from inaccurate dimensionality reduction for the joint-modality data, hindering their discovery of novel or rare cell subpopulations. RESULTS: Here, we present VIMCCA, a computational framework based on variational-assisted multi-view canonical correlation analysis to integrate paired multimodal single-cell data. Our statistical model uses a common latent variable to interpret the common source of variances in two different data modalities. Our approach jointly learns an inference model and two modality-specific non-linear models by leveraging variational inference and deep learning. We perform VIMCCA and compare it with 10 existing state-of-the-art algorithms on four paired multi-modal datasets sequenced by different protocols. Results demonstrate that VIMCCA facilitates integrating various types of joint-modality data, thus leading to more reliable and accurate downstream analysis. VIMCCA improves our ability to identify novel or rare cell subtypes compared to existing widely used methods. Besides, it can also facilitate inferring cell lineage based on joint-modality profiles. AVAILABILITY AND IMPLEMENTATION: The VIMCCA algorithm has been implemented in our toolkit package scbean ([Formula: see text] 0.5.0), and its code has been archived at https://github.com/jhu99/scbean under MIT license. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-9857983
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-98579832023-01-23 A multi-view latent variable model reveals cellular heterogeneity in complex tissues for paired multimodal single-cell data Wang, Yuwei Lian, Bin Zhang, Haohui Zhong, Yuanke He, Jie Wu, Fashuai Reinert, Knut Shang, Xuequn Yang, Hui Hu, Jialu Bioinformatics Original Paper MOTIVATION: Single-cell multimodal assays allow us to simultaneously measure two different molecular features of the same cell, enabling new insights into cellular heterogeneity, cell development and diseases. However, most existing methods suffer from inaccurate dimensionality reduction for the joint-modality data, hindering their discovery of novel or rare cell subpopulations. RESULTS: Here, we present VIMCCA, a computational framework based on variational-assisted multi-view canonical correlation analysis to integrate paired multimodal single-cell data. Our statistical model uses a common latent variable to interpret the common source of variances in two different data modalities. Our approach jointly learns an inference model and two modality-specific non-linear models by leveraging variational inference and deep learning. We perform VIMCCA and compare it with 10 existing state-of-the-art algorithms on four paired multi-modal datasets sequenced by different protocols. Results demonstrate that VIMCCA facilitates integrating various types of joint-modality data, thus leading to more reliable and accurate downstream analysis. VIMCCA improves our ability to identify novel or rare cell subtypes compared to existing widely used methods. Besides, it can also facilitate inferring cell lineage based on joint-modality profiles. AVAILABILITY AND IMPLEMENTATION: The VIMCCA algorithm has been implemented in our toolkit package scbean ([Formula: see text] 0.5.0), and its code has been archived at https://github.com/jhu99/scbean under MIT license. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2023-01-09 /pmc/articles/PMC9857983/ /pubmed/36622018 http://dx.doi.org/10.1093/bioinformatics/btad005 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Wang, Yuwei
Lian, Bin
Zhang, Haohui
Zhong, Yuanke
He, Jie
Wu, Fashuai
Reinert, Knut
Shang, Xuequn
Yang, Hui
Hu, Jialu
A multi-view latent variable model reveals cellular heterogeneity in complex tissues for paired multimodal single-cell data
title A multi-view latent variable model reveals cellular heterogeneity in complex tissues for paired multimodal single-cell data
title_full A multi-view latent variable model reveals cellular heterogeneity in complex tissues for paired multimodal single-cell data
title_fullStr A multi-view latent variable model reveals cellular heterogeneity in complex tissues for paired multimodal single-cell data
title_full_unstemmed A multi-view latent variable model reveals cellular heterogeneity in complex tissues for paired multimodal single-cell data
title_short A multi-view latent variable model reveals cellular heterogeneity in complex tissues for paired multimodal single-cell data
title_sort multi-view latent variable model reveals cellular heterogeneity in complex tissues for paired multimodal single-cell data
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9857983/
https://www.ncbi.nlm.nih.gov/pubmed/36622018
http://dx.doi.org/10.1093/bioinformatics/btad005
work_keys_str_mv AT wangyuwei amultiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata
AT lianbin amultiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata
AT zhanghaohui amultiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata
AT zhongyuanke amultiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata
AT hejie amultiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata
AT wufashuai amultiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata
AT reinertknut amultiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata
AT shangxuequn amultiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata
AT yanghui amultiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata
AT hujialu amultiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata
AT wangyuwei multiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata
AT lianbin multiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata
AT zhanghaohui multiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata
AT zhongyuanke multiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata
AT hejie multiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata
AT wufashuai multiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata
AT reinertknut multiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata
AT shangxuequn multiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata
AT yanghui multiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata
AT hujialu multiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata