Cargando…
A multi-view latent variable model reveals cellular heterogeneity in complex tissues for paired multimodal single-cell data
MOTIVATION: Single-cell multimodal assays allow us to simultaneously measure two different molecular features of the same cell, enabling new insights into cellular heterogeneity, cell development and diseases. However, most existing methods suffer from inaccurate dimensionality reduction for the joi...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9857983/ https://www.ncbi.nlm.nih.gov/pubmed/36622018 http://dx.doi.org/10.1093/bioinformatics/btad005 |
_version_ | 1784873984985137152 |
---|---|
author | Wang, Yuwei Lian, Bin Zhang, Haohui Zhong, Yuanke He, Jie Wu, Fashuai Reinert, Knut Shang, Xuequn Yang, Hui Hu, Jialu |
author_facet | Wang, Yuwei Lian, Bin Zhang, Haohui Zhong, Yuanke He, Jie Wu, Fashuai Reinert, Knut Shang, Xuequn Yang, Hui Hu, Jialu |
author_sort | Wang, Yuwei |
collection | PubMed |
description | MOTIVATION: Single-cell multimodal assays allow us to simultaneously measure two different molecular features of the same cell, enabling new insights into cellular heterogeneity, cell development and diseases. However, most existing methods suffer from inaccurate dimensionality reduction for the joint-modality data, hindering their discovery of novel or rare cell subpopulations. RESULTS: Here, we present VIMCCA, a computational framework based on variational-assisted multi-view canonical correlation analysis to integrate paired multimodal single-cell data. Our statistical model uses a common latent variable to interpret the common source of variances in two different data modalities. Our approach jointly learns an inference model and two modality-specific non-linear models by leveraging variational inference and deep learning. We perform VIMCCA and compare it with 10 existing state-of-the-art algorithms on four paired multi-modal datasets sequenced by different protocols. Results demonstrate that VIMCCA facilitates integrating various types of joint-modality data, thus leading to more reliable and accurate downstream analysis. VIMCCA improves our ability to identify novel or rare cell subtypes compared to existing widely used methods. Besides, it can also facilitate inferring cell lineage based on joint-modality profiles. AVAILABILITY AND IMPLEMENTATION: The VIMCCA algorithm has been implemented in our toolkit package scbean ([Formula: see text] 0.5.0), and its code has been archived at https://github.com/jhu99/scbean under MIT license. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-9857983 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-98579832023-01-23 A multi-view latent variable model reveals cellular heterogeneity in complex tissues for paired multimodal single-cell data Wang, Yuwei Lian, Bin Zhang, Haohui Zhong, Yuanke He, Jie Wu, Fashuai Reinert, Knut Shang, Xuequn Yang, Hui Hu, Jialu Bioinformatics Original Paper MOTIVATION: Single-cell multimodal assays allow us to simultaneously measure two different molecular features of the same cell, enabling new insights into cellular heterogeneity, cell development and diseases. However, most existing methods suffer from inaccurate dimensionality reduction for the joint-modality data, hindering their discovery of novel or rare cell subpopulations. RESULTS: Here, we present VIMCCA, a computational framework based on variational-assisted multi-view canonical correlation analysis to integrate paired multimodal single-cell data. Our statistical model uses a common latent variable to interpret the common source of variances in two different data modalities. Our approach jointly learns an inference model and two modality-specific non-linear models by leveraging variational inference and deep learning. We perform VIMCCA and compare it with 10 existing state-of-the-art algorithms on four paired multi-modal datasets sequenced by different protocols. Results demonstrate that VIMCCA facilitates integrating various types of joint-modality data, thus leading to more reliable and accurate downstream analysis. VIMCCA improves our ability to identify novel or rare cell subtypes compared to existing widely used methods. Besides, it can also facilitate inferring cell lineage based on joint-modality profiles. AVAILABILITY AND IMPLEMENTATION: The VIMCCA algorithm has been implemented in our toolkit package scbean ([Formula: see text] 0.5.0), and its code has been archived at https://github.com/jhu99/scbean under MIT license. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2023-01-09 /pmc/articles/PMC9857983/ /pubmed/36622018 http://dx.doi.org/10.1093/bioinformatics/btad005 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Paper Wang, Yuwei Lian, Bin Zhang, Haohui Zhong, Yuanke He, Jie Wu, Fashuai Reinert, Knut Shang, Xuequn Yang, Hui Hu, Jialu A multi-view latent variable model reveals cellular heterogeneity in complex tissues for paired multimodal single-cell data |
title | A multi-view latent variable model reveals cellular heterogeneity in complex tissues for paired multimodal single-cell data |
title_full | A multi-view latent variable model reveals cellular heterogeneity in complex tissues for paired multimodal single-cell data |
title_fullStr | A multi-view latent variable model reveals cellular heterogeneity in complex tissues for paired multimodal single-cell data |
title_full_unstemmed | A multi-view latent variable model reveals cellular heterogeneity in complex tissues for paired multimodal single-cell data |
title_short | A multi-view latent variable model reveals cellular heterogeneity in complex tissues for paired multimodal single-cell data |
title_sort | multi-view latent variable model reveals cellular heterogeneity in complex tissues for paired multimodal single-cell data |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9857983/ https://www.ncbi.nlm.nih.gov/pubmed/36622018 http://dx.doi.org/10.1093/bioinformatics/btad005 |
work_keys_str_mv | AT wangyuwei amultiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata AT lianbin amultiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata AT zhanghaohui amultiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata AT zhongyuanke amultiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata AT hejie amultiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata AT wufashuai amultiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata AT reinertknut amultiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata AT shangxuequn amultiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata AT yanghui amultiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata AT hujialu amultiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata AT wangyuwei multiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata AT lianbin multiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata AT zhanghaohui multiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata AT zhongyuanke multiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata AT hejie multiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata AT wufashuai multiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata AT reinertknut multiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata AT shangxuequn multiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata AT yanghui multiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata AT hujialu multiviewlatentvariablemodelrevealscellularheterogeneityincomplextissuesforpairedmultimodalsinglecelldata |