Cargando…
Hybrid Clustering of Single-Cell Gene Expression and Spatial Information via Integrated NMF and K-Means
Advances in single cell transcriptomics have allowed us to study the identity of single cells. This has led to the discovery of new cell types and high resolution tissue maps of them. Technologies that measure multiple modalities of such data add more detail, but they also complicate data integratio...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8606648/ https://www.ncbi.nlm.nih.gov/pubmed/34819947 http://dx.doi.org/10.3389/fgene.2021.763263 |
_version_ | 1784602377053011968 |
---|---|
author | Oh, Sooyoun Park, Haesun Zhang, Xiuwei |
author_facet | Oh, Sooyoun Park, Haesun Zhang, Xiuwei |
author_sort | Oh, Sooyoun |
collection | PubMed |
description | Advances in single cell transcriptomics have allowed us to study the identity of single cells. This has led to the discovery of new cell types and high resolution tissue maps of them. Technologies that measure multiple modalities of such data add more detail, but they also complicate data integration. We offer an integrated analysis of the spatial location and gene expression profiles of cells to determine their identity. We propose scHybridNMF (single-cell Hybrid Nonnegative Matrix Factorization), which performs cell type identification by combining sparse nonnegative matrix factorization (sparse NMF) with k-means clustering to cluster high-dimensional gene expression and low-dimensional location data. We show that, under multiple scenarios, including the cases where there is a small number of genes profiled and the location data is noisy, scHybridNMF outperforms sparse NMF, k-means, and an existing method that uses a hidden Markov random field to encode cell location and gene expression data for cell type identification. |
format | Online Article Text |
id | pubmed-8606648 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-86066482021-11-23 Hybrid Clustering of Single-Cell Gene Expression and Spatial Information via Integrated NMF and K-Means Oh, Sooyoun Park, Haesun Zhang, Xiuwei Front Genet Genetics Advances in single cell transcriptomics have allowed us to study the identity of single cells. This has led to the discovery of new cell types and high resolution tissue maps of them. Technologies that measure multiple modalities of such data add more detail, but they also complicate data integration. We offer an integrated analysis of the spatial location and gene expression profiles of cells to determine their identity. We propose scHybridNMF (single-cell Hybrid Nonnegative Matrix Factorization), which performs cell type identification by combining sparse nonnegative matrix factorization (sparse NMF) with k-means clustering to cluster high-dimensional gene expression and low-dimensional location data. We show that, under multiple scenarios, including the cases where there is a small number of genes profiled and the location data is noisy, scHybridNMF outperforms sparse NMF, k-means, and an existing method that uses a hidden Markov random field to encode cell location and gene expression data for cell type identification. Frontiers Media S.A. 2021-11-08 /pmc/articles/PMC8606648/ /pubmed/34819947 http://dx.doi.org/10.3389/fgene.2021.763263 Text en Copyright © 2021 Oh, Park and Zhang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Oh, Sooyoun Park, Haesun Zhang, Xiuwei Hybrid Clustering of Single-Cell Gene Expression and Spatial Information via Integrated NMF and K-Means |
title | Hybrid Clustering of Single-Cell Gene Expression and Spatial Information via Integrated NMF and K-Means |
title_full | Hybrid Clustering of Single-Cell Gene Expression and Spatial Information via Integrated NMF and K-Means |
title_fullStr | Hybrid Clustering of Single-Cell Gene Expression and Spatial Information via Integrated NMF and K-Means |
title_full_unstemmed | Hybrid Clustering of Single-Cell Gene Expression and Spatial Information via Integrated NMF and K-Means |
title_short | Hybrid Clustering of Single-Cell Gene Expression and Spatial Information via Integrated NMF and K-Means |
title_sort | hybrid clustering of single-cell gene expression and spatial information via integrated nmf and k-means |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8606648/ https://www.ncbi.nlm.nih.gov/pubmed/34819947 http://dx.doi.org/10.3389/fgene.2021.763263 |
work_keys_str_mv | AT ohsooyoun hybridclusteringofsinglecellgeneexpressionandspatialinformationviaintegratednmfandkmeans AT parkhaesun hybridclusteringofsinglecellgeneexpressionandspatialinformationviaintegratednmfandkmeans AT zhangxiuwei hybridclusteringofsinglecellgeneexpressionandspatialinformationviaintegratednmfandkmeans |