Cargando…

Ensemble deep learning of embeddings for clustering multimodal single-cell omics data

MOTIVATION: Recent advances in multimodal single-cell omics technologies enable multiple modalities of molecular attributes, such as gene expression, chromatin accessibility, and protein abundance, to be profiled simultaneously at a global level in individual cells. While the increasing availability...

Descripción completa

Detalles Bibliográficos
Autores principales: Yu, Lijia, Liu, Chunlei, Yang, Jean Yee Hwa, Yang, Pengyi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10287920/
https://www.ncbi.nlm.nih.gov/pubmed/37314966
http://dx.doi.org/10.1093/bioinformatics/btad382
_version_ 1785061970303516672
author Yu, Lijia
Liu, Chunlei
Yang, Jean Yee Hwa
Yang, Pengyi
author_facet Yu, Lijia
Liu, Chunlei
Yang, Jean Yee Hwa
Yang, Pengyi
author_sort Yu, Lijia
collection PubMed
description MOTIVATION: Recent advances in multimodal single-cell omics technologies enable multiple modalities of molecular attributes, such as gene expression, chromatin accessibility, and protein abundance, to be profiled simultaneously at a global level in individual cells. While the increasing availability of multiple data modalities is expected to provide a more accurate clustering and characterization of cells, the development of computational methods that are capable of extracting information embedded across data modalities is still in its infancy. RESULTS: We propose SnapCCESS for clustering cells by integrating data modalities in multimodal single-cell omics data using an unsupervised ensemble deep learning framework. By creating snapshots of embeddings of multimodality using variational autoencoders, SnapCCESS can be coupled with various clustering algorithms for generating consensus clustering of cells. We applied SnapCCESS with several clustering algorithms to various datasets generated from popular multimodal single-cell omics technologies. Our results demonstrate that SnapCCESS is effective and more efficient than conventional ensemble deep learning-based clustering methods and outperforms other state-of-the-art multimodal embedding generation methods in integrating data modalities for clustering cells. The improved clustering of cells from SnapCCESS will pave the way for more accurate characterization of cell identity and types, an essential step for various downstream analyses of multimodal single-cell omics data. AVAILABILITY AND IMPLEMENTATION: SnapCCESS is implemented as a Python package and is freely available from https://github.com/PYangLab/SnapCCESS under the open-source license of GPL-3. The data used in this study are publicly available (see section ‘Data availability’).
format Online
Article
Text
id pubmed-10287920
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-102879202023-06-24 Ensemble deep learning of embeddings for clustering multimodal single-cell omics data Yu, Lijia Liu, Chunlei Yang, Jean Yee Hwa Yang, Pengyi Bioinformatics Original Paper MOTIVATION: Recent advances in multimodal single-cell omics technologies enable multiple modalities of molecular attributes, such as gene expression, chromatin accessibility, and protein abundance, to be profiled simultaneously at a global level in individual cells. While the increasing availability of multiple data modalities is expected to provide a more accurate clustering and characterization of cells, the development of computational methods that are capable of extracting information embedded across data modalities is still in its infancy. RESULTS: We propose SnapCCESS for clustering cells by integrating data modalities in multimodal single-cell omics data using an unsupervised ensemble deep learning framework. By creating snapshots of embeddings of multimodality using variational autoencoders, SnapCCESS can be coupled with various clustering algorithms for generating consensus clustering of cells. We applied SnapCCESS with several clustering algorithms to various datasets generated from popular multimodal single-cell omics technologies. Our results demonstrate that SnapCCESS is effective and more efficient than conventional ensemble deep learning-based clustering methods and outperforms other state-of-the-art multimodal embedding generation methods in integrating data modalities for clustering cells. The improved clustering of cells from SnapCCESS will pave the way for more accurate characterization of cell identity and types, an essential step for various downstream analyses of multimodal single-cell omics data. AVAILABILITY AND IMPLEMENTATION: SnapCCESS is implemented as a Python package and is freely available from https://github.com/PYangLab/SnapCCESS under the open-source license of GPL-3. The data used in this study are publicly available (see section ‘Data availability’). Oxford University Press 2023-06-14 /pmc/articles/PMC10287920/ /pubmed/37314966 http://dx.doi.org/10.1093/bioinformatics/btad382 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Yu, Lijia
Liu, Chunlei
Yang, Jean Yee Hwa
Yang, Pengyi
Ensemble deep learning of embeddings for clustering multimodal single-cell omics data
title Ensemble deep learning of embeddings for clustering multimodal single-cell omics data
title_full Ensemble deep learning of embeddings for clustering multimodal single-cell omics data
title_fullStr Ensemble deep learning of embeddings for clustering multimodal single-cell omics data
title_full_unstemmed Ensemble deep learning of embeddings for clustering multimodal single-cell omics data
title_short Ensemble deep learning of embeddings for clustering multimodal single-cell omics data
title_sort ensemble deep learning of embeddings for clustering multimodal single-cell omics data
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10287920/
https://www.ncbi.nlm.nih.gov/pubmed/37314966
http://dx.doi.org/10.1093/bioinformatics/btad382
work_keys_str_mv AT yulijia ensembledeeplearningofembeddingsforclusteringmultimodalsinglecellomicsdata
AT liuchunlei ensembledeeplearningofembeddingsforclusteringmultimodalsinglecellomicsdata
AT yangjeanyeehwa ensembledeeplearningofembeddingsforclusteringmultimodalsinglecellomicsdata
AT yangpengyi ensembledeeplearningofembeddingsforclusteringmultimodalsinglecellomicsdata