Cargando…
A copula based topology preserving graph convolution network for clustering of single-cell RNA-seq data
Annotation of cells in single-cell clustering requires a homogeneous grouping of cell populations. There are various issues in single cell sequencing that effect homogeneous grouping (clustering) of cells, such as small amount of starting RNA, limited per-cell sequenced reads, cell-to-cell variabili...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8979455/ https://www.ncbi.nlm.nih.gov/pubmed/35271564 http://dx.doi.org/10.1371/journal.pcbi.1009600 |
_version_ | 1784681179249639424 |
---|---|
author | Lall, Snehalika Ray, Sumanta Bandyopadhyay, Sanghamitra |
author_facet | Lall, Snehalika Ray, Sumanta Bandyopadhyay, Sanghamitra |
author_sort | Lall, Snehalika |
collection | PubMed |
description | Annotation of cells in single-cell clustering requires a homogeneous grouping of cell populations. There are various issues in single cell sequencing that effect homogeneous grouping (clustering) of cells, such as small amount of starting RNA, limited per-cell sequenced reads, cell-to-cell variability due to cell-cycle, cellular morphology, and variable reagent concentrations. Moreover, single cell data is susceptible to technical noise, which affects the quality of genes (or features) selected/extracted prior to clustering. Here we introduce sc-CGconv (copula based graph convolution network for single clustering), a stepwise robust unsupervised feature extraction and clustering approach that formulates and aggregates cell–cell relationships using copula correlation (Ccor), followed by a graph convolution network based clustering approach. sc-CGconv formulates a cell-cell graph using Ccor that is learned by a graph-based artificial intelligence model, graph convolution network. The learned representation (low dimensional embedding) is utilized for cell clustering. sc-CGconv features the following advantages. a. sc-CGconv works with substantially smaller sample sizes to identify homogeneous clusters. b. sc-CGconv can model the expression co-variability of a large number of genes, thereby outperforming state-of-the-art gene selection/extraction methods for clustering. c. sc-CGconv preserves the cell-to-cell variability within the selected gene set by constructing a cell-cell graph through copula correlation measure. d. sc-CGconv provides a topology-preserving embedding of cells in low dimensional space. |
format | Online Article Text |
id | pubmed-8979455 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-89794552022-04-05 A copula based topology preserving graph convolution network for clustering of single-cell RNA-seq data Lall, Snehalika Ray, Sumanta Bandyopadhyay, Sanghamitra PLoS Comput Biol Research Article Annotation of cells in single-cell clustering requires a homogeneous grouping of cell populations. There are various issues in single cell sequencing that effect homogeneous grouping (clustering) of cells, such as small amount of starting RNA, limited per-cell sequenced reads, cell-to-cell variability due to cell-cycle, cellular morphology, and variable reagent concentrations. Moreover, single cell data is susceptible to technical noise, which affects the quality of genes (or features) selected/extracted prior to clustering. Here we introduce sc-CGconv (copula based graph convolution network for single clustering), a stepwise robust unsupervised feature extraction and clustering approach that formulates and aggregates cell–cell relationships using copula correlation (Ccor), followed by a graph convolution network based clustering approach. sc-CGconv formulates a cell-cell graph using Ccor that is learned by a graph-based artificial intelligence model, graph convolution network. The learned representation (low dimensional embedding) is utilized for cell clustering. sc-CGconv features the following advantages. a. sc-CGconv works with substantially smaller sample sizes to identify homogeneous clusters. b. sc-CGconv can model the expression co-variability of a large number of genes, thereby outperforming state-of-the-art gene selection/extraction methods for clustering. c. sc-CGconv preserves the cell-to-cell variability within the selected gene set by constructing a cell-cell graph through copula correlation measure. d. sc-CGconv provides a topology-preserving embedding of cells in low dimensional space. Public Library of Science 2022-03-10 /pmc/articles/PMC8979455/ /pubmed/35271564 http://dx.doi.org/10.1371/journal.pcbi.1009600 Text en © 2022 Lall et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Lall, Snehalika Ray, Sumanta Bandyopadhyay, Sanghamitra A copula based topology preserving graph convolution network for clustering of single-cell RNA-seq data |
title | A copula based topology preserving graph convolution network for clustering of single-cell RNA-seq data |
title_full | A copula based topology preserving graph convolution network for clustering of single-cell RNA-seq data |
title_fullStr | A copula based topology preserving graph convolution network for clustering of single-cell RNA-seq data |
title_full_unstemmed | A copula based topology preserving graph convolution network for clustering of single-cell RNA-seq data |
title_short | A copula based topology preserving graph convolution network for clustering of single-cell RNA-seq data |
title_sort | copula based topology preserving graph convolution network for clustering of single-cell rna-seq data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8979455/ https://www.ncbi.nlm.nih.gov/pubmed/35271564 http://dx.doi.org/10.1371/journal.pcbi.1009600 |
work_keys_str_mv | AT lallsnehalika acopulabasedtopologypreservinggraphconvolutionnetworkforclusteringofsinglecellrnaseqdata AT raysumanta acopulabasedtopologypreservinggraphconvolutionnetworkforclusteringofsinglecellrnaseqdata AT bandyopadhyaysanghamitra acopulabasedtopologypreservinggraphconvolutionnetworkforclusteringofsinglecellrnaseqdata AT lallsnehalika copulabasedtopologypreservinggraphconvolutionnetworkforclusteringofsinglecellrnaseqdata AT raysumanta copulabasedtopologypreservinggraphconvolutionnetworkforclusteringofsinglecellrnaseqdata AT bandyopadhyaysanghamitra copulabasedtopologypreservinggraphconvolutionnetworkforclusteringofsinglecellrnaseqdata |