Cargando…

Manifold alignment for heterogeneous single-cell multi-omics data integration using Pamona

MOTIVATION: Single-cell multi-omics sequencing data can provide a comprehensive molecular view of cells. However, effective approaches for the integrative analysis of such data are challenging. Existing manifold alignment methods demonstrated the state-of-the-art performance on single-cell multi-omi...

Descripción completa

Detalles Bibliográficos
Autores principales: Cao, Kai, Hong, Yiguang, Wan, Lin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8696097/
https://www.ncbi.nlm.nih.gov/pubmed/34398192
http://dx.doi.org/10.1093/bioinformatics/btab594
_version_ 1784619730771902464
author Cao, Kai
Hong, Yiguang
Wan, Lin
author_facet Cao, Kai
Hong, Yiguang
Wan, Lin
author_sort Cao, Kai
collection PubMed
description MOTIVATION: Single-cell multi-omics sequencing data can provide a comprehensive molecular view of cells. However, effective approaches for the integrative analysis of such data are challenging. Existing manifold alignment methods demonstrated the state-of-the-art performance on single-cell multi-omics data integration, but they are often limited by requiring that single-cell datasets be derived from the same underlying cellular structure. RESULTS: In this study, we present Pamona, a partial Gromov-Wasserstein distance-based manifold alignment framework that integrates heterogeneous single-cell multi-omics datasets with the aim of delineating and representing the shared and dataset-specific cellular structures across modalities. We formulate this task as a partial manifold alignment problem and develop a partial Gromov-Wasserstein optimal transport framework to solve it. Pamona identifies both shared and dataset-specific cells based on the computed probabilistic couplings of cells across datasets, and it aligns cellular modalities in a common low-dimensional space, while simultaneously preserving both shared and dataset-specific structures. Our framework can easily incorporate prior information, such as cell type annotations or cell-cell correspondence, to further improve alignment quality. We evaluated Pamona on a comprehensive set of publicly available benchmark datasets. We demonstrated that Pamona can accurately identify shared and dataset-specific cells, as well as faithfully recover and align cellular structures of heterogeneous single-cell modalities in a common space, outperforming the comparable existing methods. AVAILABILITYAND IMPLEMENTATION: Pamona software is available at https://github.com/caokai1073/Pamona. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-8696097
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-86960972022-01-04 Manifold alignment for heterogeneous single-cell multi-omics data integration using Pamona Cao, Kai Hong, Yiguang Wan, Lin Bioinformatics Original Papers MOTIVATION: Single-cell multi-omics sequencing data can provide a comprehensive molecular view of cells. However, effective approaches for the integrative analysis of such data are challenging. Existing manifold alignment methods demonstrated the state-of-the-art performance on single-cell multi-omics data integration, but they are often limited by requiring that single-cell datasets be derived from the same underlying cellular structure. RESULTS: In this study, we present Pamona, a partial Gromov-Wasserstein distance-based manifold alignment framework that integrates heterogeneous single-cell multi-omics datasets with the aim of delineating and representing the shared and dataset-specific cellular structures across modalities. We formulate this task as a partial manifold alignment problem and develop a partial Gromov-Wasserstein optimal transport framework to solve it. Pamona identifies both shared and dataset-specific cells based on the computed probabilistic couplings of cells across datasets, and it aligns cellular modalities in a common low-dimensional space, while simultaneously preserving both shared and dataset-specific structures. Our framework can easily incorporate prior information, such as cell type annotations or cell-cell correspondence, to further improve alignment quality. We evaluated Pamona on a comprehensive set of publicly available benchmark datasets. We demonstrated that Pamona can accurately identify shared and dataset-specific cells, as well as faithfully recover and align cellular structures of heterogeneous single-cell modalities in a common space, outperforming the comparable existing methods. AVAILABILITYAND IMPLEMENTATION: Pamona software is available at https://github.com/caokai1073/Pamona. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2021-08-16 /pmc/articles/PMC8696097/ /pubmed/34398192 http://dx.doi.org/10.1093/bioinformatics/btab594 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Cao, Kai
Hong, Yiguang
Wan, Lin
Manifold alignment for heterogeneous single-cell multi-omics data integration using Pamona
title Manifold alignment for heterogeneous single-cell multi-omics data integration using Pamona
title_full Manifold alignment for heterogeneous single-cell multi-omics data integration using Pamona
title_fullStr Manifold alignment for heterogeneous single-cell multi-omics data integration using Pamona
title_full_unstemmed Manifold alignment for heterogeneous single-cell multi-omics data integration using Pamona
title_short Manifold alignment for heterogeneous single-cell multi-omics data integration using Pamona
title_sort manifold alignment for heterogeneous single-cell multi-omics data integration using pamona
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8696097/
https://www.ncbi.nlm.nih.gov/pubmed/34398192
http://dx.doi.org/10.1093/bioinformatics/btab594
work_keys_str_mv AT caokai manifoldalignmentforheterogeneoussinglecellmultiomicsdataintegrationusingpamona
AT hongyiguang manifoldalignmentforheterogeneoussinglecellmultiomicsdataintegrationusingpamona
AT wanlin manifoldalignmentforheterogeneoussinglecellmultiomicsdataintegrationusingpamona