Cargando…

Clustering single-cell multimodal omics data with jrSiCKLSNMF

Introduction: The development of multimodal single-cell omics methods has enabled the collection of data across different omics modalities from the same set of single cells. Each omics modality provides unique information about cell type and function, so the ability to integrate data from different...

Descripción completa

Detalles Bibliográficos
Autores principales: Ellis, Dorothy, Roy, Arkaprava, Datta, Susmita
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10288154/
https://www.ncbi.nlm.nih.gov/pubmed/37359367
http://dx.doi.org/10.3389/fgene.2023.1179439
_version_ 1785062020246142976
author Ellis, Dorothy
Roy, Arkaprava
Datta, Susmita
author_facet Ellis, Dorothy
Roy, Arkaprava
Datta, Susmita
author_sort Ellis, Dorothy
collection PubMed
description Introduction: The development of multimodal single-cell omics methods has enabled the collection of data across different omics modalities from the same set of single cells. Each omics modality provides unique information about cell type and function, so the ability to integrate data from different modalities can provide deeper insights into cellular functions. Often, single-cell omics data can prove challenging to model because of high dimensionality, sparsity, and technical noise. Methods: We propose a novel multimodal data analysis method called joint graph-regularized Single-Cell Kullback-Leibler Sparse Non-negative Matrix Factorization (jrSiCKLSNMF, pronounced “junior sickles NMF”) that extracts latent factors shared across omics modalities within the same set of single cells. Results: We compare our clustering algorithm to several existing methods on four sets of data simulated from third party software. We also apply our algorithm to a real set of cell line data. Discussion: We show overwhelmingly better clustering performance than several existing methods on the simulated data. On a real multimodal omics dataset, we also find our method to produce scientifically accurate clustering results.
format Online
Article
Text
id pubmed-10288154
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-102881542023-06-24 Clustering single-cell multimodal omics data with jrSiCKLSNMF Ellis, Dorothy Roy, Arkaprava Datta, Susmita Front Genet Genetics Introduction: The development of multimodal single-cell omics methods has enabled the collection of data across different omics modalities from the same set of single cells. Each omics modality provides unique information about cell type and function, so the ability to integrate data from different modalities can provide deeper insights into cellular functions. Often, single-cell omics data can prove challenging to model because of high dimensionality, sparsity, and technical noise. Methods: We propose a novel multimodal data analysis method called joint graph-regularized Single-Cell Kullback-Leibler Sparse Non-negative Matrix Factorization (jrSiCKLSNMF, pronounced “junior sickles NMF”) that extracts latent factors shared across omics modalities within the same set of single cells. Results: We compare our clustering algorithm to several existing methods on four sets of data simulated from third party software. We also apply our algorithm to a real set of cell line data. Discussion: We show overwhelmingly better clustering performance than several existing methods on the simulated data. On a real multimodal omics dataset, we also find our method to produce scientifically accurate clustering results. Frontiers Media S.A. 2023-06-09 /pmc/articles/PMC10288154/ /pubmed/37359367 http://dx.doi.org/10.3389/fgene.2023.1179439 Text en Copyright © 2023 Ellis, Roy and Datta. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Ellis, Dorothy
Roy, Arkaprava
Datta, Susmita
Clustering single-cell multimodal omics data with jrSiCKLSNMF
title Clustering single-cell multimodal omics data with jrSiCKLSNMF
title_full Clustering single-cell multimodal omics data with jrSiCKLSNMF
title_fullStr Clustering single-cell multimodal omics data with jrSiCKLSNMF
title_full_unstemmed Clustering single-cell multimodal omics data with jrSiCKLSNMF
title_short Clustering single-cell multimodal omics data with jrSiCKLSNMF
title_sort clustering single-cell multimodal omics data with jrsicklsnmf
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10288154/
https://www.ncbi.nlm.nih.gov/pubmed/37359367
http://dx.doi.org/10.3389/fgene.2023.1179439
work_keys_str_mv AT ellisdorothy clusteringsinglecellmultimodalomicsdatawithjrsicklsnmf
AT royarkaprava clusteringsinglecellmultimodalomicsdatawithjrsicklsnmf
AT dattasusmita clusteringsinglecellmultimodalomicsdatawithjrsicklsnmf