Cargando…
Manifold alignment for heterogeneous single-cell multi-omics data integration using Pamona
MOTIVATION: Single-cell multi-omics sequencing data can provide a comprehensive molecular view of cells. However, effective approaches for the integrative analysis of such data are challenging. Existing manifold alignment methods demonstrated the state-of-the-art performance on single-cell multi-omi...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8696097/ https://www.ncbi.nlm.nih.gov/pubmed/34398192 http://dx.doi.org/10.1093/bioinformatics/btab594 |
_version_ | 1784619730771902464 |
---|---|
author | Cao, Kai Hong, Yiguang Wan, Lin |
author_facet | Cao, Kai Hong, Yiguang Wan, Lin |
author_sort | Cao, Kai |
collection | PubMed |
description | MOTIVATION: Single-cell multi-omics sequencing data can provide a comprehensive molecular view of cells. However, effective approaches for the integrative analysis of such data are challenging. Existing manifold alignment methods demonstrated the state-of-the-art performance on single-cell multi-omics data integration, but they are often limited by requiring that single-cell datasets be derived from the same underlying cellular structure. RESULTS: In this study, we present Pamona, a partial Gromov-Wasserstein distance-based manifold alignment framework that integrates heterogeneous single-cell multi-omics datasets with the aim of delineating and representing the shared and dataset-specific cellular structures across modalities. We formulate this task as a partial manifold alignment problem and develop a partial Gromov-Wasserstein optimal transport framework to solve it. Pamona identifies both shared and dataset-specific cells based on the computed probabilistic couplings of cells across datasets, and it aligns cellular modalities in a common low-dimensional space, while simultaneously preserving both shared and dataset-specific structures. Our framework can easily incorporate prior information, such as cell type annotations or cell-cell correspondence, to further improve alignment quality. We evaluated Pamona on a comprehensive set of publicly available benchmark datasets. We demonstrated that Pamona can accurately identify shared and dataset-specific cells, as well as faithfully recover and align cellular structures of heterogeneous single-cell modalities in a common space, outperforming the comparable existing methods. AVAILABILITYAND IMPLEMENTATION: Pamona software is available at https://github.com/caokai1073/Pamona. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-8696097 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-86960972022-01-04 Manifold alignment for heterogeneous single-cell multi-omics data integration using Pamona Cao, Kai Hong, Yiguang Wan, Lin Bioinformatics Original Papers MOTIVATION: Single-cell multi-omics sequencing data can provide a comprehensive molecular view of cells. However, effective approaches for the integrative analysis of such data are challenging. Existing manifold alignment methods demonstrated the state-of-the-art performance on single-cell multi-omics data integration, but they are often limited by requiring that single-cell datasets be derived from the same underlying cellular structure. RESULTS: In this study, we present Pamona, a partial Gromov-Wasserstein distance-based manifold alignment framework that integrates heterogeneous single-cell multi-omics datasets with the aim of delineating and representing the shared and dataset-specific cellular structures across modalities. We formulate this task as a partial manifold alignment problem and develop a partial Gromov-Wasserstein optimal transport framework to solve it. Pamona identifies both shared and dataset-specific cells based on the computed probabilistic couplings of cells across datasets, and it aligns cellular modalities in a common low-dimensional space, while simultaneously preserving both shared and dataset-specific structures. Our framework can easily incorporate prior information, such as cell type annotations or cell-cell correspondence, to further improve alignment quality. We evaluated Pamona on a comprehensive set of publicly available benchmark datasets. We demonstrated that Pamona can accurately identify shared and dataset-specific cells, as well as faithfully recover and align cellular structures of heterogeneous single-cell modalities in a common space, outperforming the comparable existing methods. AVAILABILITYAND IMPLEMENTATION: Pamona software is available at https://github.com/caokai1073/Pamona. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2021-08-16 /pmc/articles/PMC8696097/ /pubmed/34398192 http://dx.doi.org/10.1093/bioinformatics/btab594 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Original Papers Cao, Kai Hong, Yiguang Wan, Lin Manifold alignment for heterogeneous single-cell multi-omics data integration using Pamona |
title | Manifold alignment for heterogeneous single-cell multi-omics data integration using Pamona |
title_full | Manifold alignment for heterogeneous single-cell multi-omics data integration using Pamona |
title_fullStr | Manifold alignment for heterogeneous single-cell multi-omics data integration using Pamona |
title_full_unstemmed | Manifold alignment for heterogeneous single-cell multi-omics data integration using Pamona |
title_short | Manifold alignment for heterogeneous single-cell multi-omics data integration using Pamona |
title_sort | manifold alignment for heterogeneous single-cell multi-omics data integration using pamona |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8696097/ https://www.ncbi.nlm.nih.gov/pubmed/34398192 http://dx.doi.org/10.1093/bioinformatics/btab594 |
work_keys_str_mv | AT caokai manifoldalignmentforheterogeneoussinglecellmultiomicsdataintegrationusingpamona AT hongyiguang manifoldalignmentforheterogeneoussinglecellmultiomicsdataintegrationusingpamona AT wanlin manifoldalignmentforheterogeneoussinglecellmultiomicsdataintegrationusingpamona |