Cargando…
Ensemble deep learning of embeddings for clustering multimodal single-cell omics data
MOTIVATION: Recent advances in multimodal single-cell omics technologies enable multiple modalities of molecular attributes, such as gene expression, chromatin accessibility, and protein abundance, to be profiled simultaneously at a global level in individual cells. While the increasing availability...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10287920/ https://www.ncbi.nlm.nih.gov/pubmed/37314966 http://dx.doi.org/10.1093/bioinformatics/btad382 |
_version_ | 1785061970303516672 |
---|---|
author | Yu, Lijia Liu, Chunlei Yang, Jean Yee Hwa Yang, Pengyi |
author_facet | Yu, Lijia Liu, Chunlei Yang, Jean Yee Hwa Yang, Pengyi |
author_sort | Yu, Lijia |
collection | PubMed |
description | MOTIVATION: Recent advances in multimodal single-cell omics technologies enable multiple modalities of molecular attributes, such as gene expression, chromatin accessibility, and protein abundance, to be profiled simultaneously at a global level in individual cells. While the increasing availability of multiple data modalities is expected to provide a more accurate clustering and characterization of cells, the development of computational methods that are capable of extracting information embedded across data modalities is still in its infancy. RESULTS: We propose SnapCCESS for clustering cells by integrating data modalities in multimodal single-cell omics data using an unsupervised ensemble deep learning framework. By creating snapshots of embeddings of multimodality using variational autoencoders, SnapCCESS can be coupled with various clustering algorithms for generating consensus clustering of cells. We applied SnapCCESS with several clustering algorithms to various datasets generated from popular multimodal single-cell omics technologies. Our results demonstrate that SnapCCESS is effective and more efficient than conventional ensemble deep learning-based clustering methods and outperforms other state-of-the-art multimodal embedding generation methods in integrating data modalities for clustering cells. The improved clustering of cells from SnapCCESS will pave the way for more accurate characterization of cell identity and types, an essential step for various downstream analyses of multimodal single-cell omics data. AVAILABILITY AND IMPLEMENTATION: SnapCCESS is implemented as a Python package and is freely available from https://github.com/PYangLab/SnapCCESS under the open-source license of GPL-3. The data used in this study are publicly available (see section ‘Data availability’). |
format | Online Article Text |
id | pubmed-10287920 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-102879202023-06-24 Ensemble deep learning of embeddings for clustering multimodal single-cell omics data Yu, Lijia Liu, Chunlei Yang, Jean Yee Hwa Yang, Pengyi Bioinformatics Original Paper MOTIVATION: Recent advances in multimodal single-cell omics technologies enable multiple modalities of molecular attributes, such as gene expression, chromatin accessibility, and protein abundance, to be profiled simultaneously at a global level in individual cells. While the increasing availability of multiple data modalities is expected to provide a more accurate clustering and characterization of cells, the development of computational methods that are capable of extracting information embedded across data modalities is still in its infancy. RESULTS: We propose SnapCCESS for clustering cells by integrating data modalities in multimodal single-cell omics data using an unsupervised ensemble deep learning framework. By creating snapshots of embeddings of multimodality using variational autoencoders, SnapCCESS can be coupled with various clustering algorithms for generating consensus clustering of cells. We applied SnapCCESS with several clustering algorithms to various datasets generated from popular multimodal single-cell omics technologies. Our results demonstrate that SnapCCESS is effective and more efficient than conventional ensemble deep learning-based clustering methods and outperforms other state-of-the-art multimodal embedding generation methods in integrating data modalities for clustering cells. The improved clustering of cells from SnapCCESS will pave the way for more accurate characterization of cell identity and types, an essential step for various downstream analyses of multimodal single-cell omics data. AVAILABILITY AND IMPLEMENTATION: SnapCCESS is implemented as a Python package and is freely available from https://github.com/PYangLab/SnapCCESS under the open-source license of GPL-3. The data used in this study are publicly available (see section ‘Data availability’). Oxford University Press 2023-06-14 /pmc/articles/PMC10287920/ /pubmed/37314966 http://dx.doi.org/10.1093/bioinformatics/btad382 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Paper Yu, Lijia Liu, Chunlei Yang, Jean Yee Hwa Yang, Pengyi Ensemble deep learning of embeddings for clustering multimodal single-cell omics data |
title | Ensemble deep learning of embeddings for clustering multimodal single-cell omics data |
title_full | Ensemble deep learning of embeddings for clustering multimodal single-cell omics data |
title_fullStr | Ensemble deep learning of embeddings for clustering multimodal single-cell omics data |
title_full_unstemmed | Ensemble deep learning of embeddings for clustering multimodal single-cell omics data |
title_short | Ensemble deep learning of embeddings for clustering multimodal single-cell omics data |
title_sort | ensemble deep learning of embeddings for clustering multimodal single-cell omics data |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10287920/ https://www.ncbi.nlm.nih.gov/pubmed/37314966 http://dx.doi.org/10.1093/bioinformatics/btad382 |
work_keys_str_mv | AT yulijia ensembledeeplearningofembeddingsforclusteringmultimodalsinglecellomicsdata AT liuchunlei ensembledeeplearningofembeddingsforclusteringmultimodalsinglecellomicsdata AT yangjeanyeehwa ensembledeeplearningofembeddingsforclusteringmultimodalsinglecellomicsdata AT yangpengyi ensembledeeplearningofembeddingsforclusteringmultimodalsinglecellomicsdata |