Cargando…

A copula based topology preserving graph convolution network for clustering of single-cell RNA-seq data

Annotation of cells in single-cell clustering requires a homogeneous grouping of cell populations. There are various issues in single cell sequencing that effect homogeneous grouping (clustering) of cells, such as small amount of starting RNA, limited per-cell sequenced reads, cell-to-cell variabili...

Descripción completa

Detalles Bibliográficos
Autores principales: Lall, Snehalika, Ray, Sumanta, Bandyopadhyay, Sanghamitra
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8979455/
https://www.ncbi.nlm.nih.gov/pubmed/35271564
http://dx.doi.org/10.1371/journal.pcbi.1009600
_version_ 1784681179249639424
author Lall, Snehalika
Ray, Sumanta
Bandyopadhyay, Sanghamitra
author_facet Lall, Snehalika
Ray, Sumanta
Bandyopadhyay, Sanghamitra
author_sort Lall, Snehalika
collection PubMed
description Annotation of cells in single-cell clustering requires a homogeneous grouping of cell populations. There are various issues in single cell sequencing that effect homogeneous grouping (clustering) of cells, such as small amount of starting RNA, limited per-cell sequenced reads, cell-to-cell variability due to cell-cycle, cellular morphology, and variable reagent concentrations. Moreover, single cell data is susceptible to technical noise, which affects the quality of genes (or features) selected/extracted prior to clustering. Here we introduce sc-CGconv (copula based graph convolution network for single clustering), a stepwise robust unsupervised feature extraction and clustering approach that formulates and aggregates cell–cell relationships using copula correlation (Ccor), followed by a graph convolution network based clustering approach. sc-CGconv formulates a cell-cell graph using Ccor that is learned by a graph-based artificial intelligence model, graph convolution network. The learned representation (low dimensional embedding) is utilized for cell clustering. sc-CGconv features the following advantages. a. sc-CGconv works with substantially smaller sample sizes to identify homogeneous clusters. b. sc-CGconv can model the expression co-variability of a large number of genes, thereby outperforming state-of-the-art gene selection/extraction methods for clustering. c. sc-CGconv preserves the cell-to-cell variability within the selected gene set by constructing a cell-cell graph through copula correlation measure. d. sc-CGconv provides a topology-preserving embedding of cells in low dimensional space.
format Online
Article
Text
id pubmed-8979455
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-89794552022-04-05 A copula based topology preserving graph convolution network for clustering of single-cell RNA-seq data Lall, Snehalika Ray, Sumanta Bandyopadhyay, Sanghamitra PLoS Comput Biol Research Article Annotation of cells in single-cell clustering requires a homogeneous grouping of cell populations. There are various issues in single cell sequencing that effect homogeneous grouping (clustering) of cells, such as small amount of starting RNA, limited per-cell sequenced reads, cell-to-cell variability due to cell-cycle, cellular morphology, and variable reagent concentrations. Moreover, single cell data is susceptible to technical noise, which affects the quality of genes (or features) selected/extracted prior to clustering. Here we introduce sc-CGconv (copula based graph convolution network for single clustering), a stepwise robust unsupervised feature extraction and clustering approach that formulates and aggregates cell–cell relationships using copula correlation (Ccor), followed by a graph convolution network based clustering approach. sc-CGconv formulates a cell-cell graph using Ccor that is learned by a graph-based artificial intelligence model, graph convolution network. The learned representation (low dimensional embedding) is utilized for cell clustering. sc-CGconv features the following advantages. a. sc-CGconv works with substantially smaller sample sizes to identify homogeneous clusters. b. sc-CGconv can model the expression co-variability of a large number of genes, thereby outperforming state-of-the-art gene selection/extraction methods for clustering. c. sc-CGconv preserves the cell-to-cell variability within the selected gene set by constructing a cell-cell graph through copula correlation measure. d. sc-CGconv provides a topology-preserving embedding of cells in low dimensional space. Public Library of Science 2022-03-10 /pmc/articles/PMC8979455/ /pubmed/35271564 http://dx.doi.org/10.1371/journal.pcbi.1009600 Text en © 2022 Lall et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Lall, Snehalika
Ray, Sumanta
Bandyopadhyay, Sanghamitra
A copula based topology preserving graph convolution network for clustering of single-cell RNA-seq data
title A copula based topology preserving graph convolution network for clustering of single-cell RNA-seq data
title_full A copula based topology preserving graph convolution network for clustering of single-cell RNA-seq data
title_fullStr A copula based topology preserving graph convolution network for clustering of single-cell RNA-seq data
title_full_unstemmed A copula based topology preserving graph convolution network for clustering of single-cell RNA-seq data
title_short A copula based topology preserving graph convolution network for clustering of single-cell RNA-seq data
title_sort copula based topology preserving graph convolution network for clustering of single-cell rna-seq data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8979455/
https://www.ncbi.nlm.nih.gov/pubmed/35271564
http://dx.doi.org/10.1371/journal.pcbi.1009600
work_keys_str_mv AT lallsnehalika acopulabasedtopologypreservinggraphconvolutionnetworkforclusteringofsinglecellrnaseqdata
AT raysumanta acopulabasedtopologypreservinggraphconvolutionnetworkforclusteringofsinglecellrnaseqdata
AT bandyopadhyaysanghamitra acopulabasedtopologypreservinggraphconvolutionnetworkforclusteringofsinglecellrnaseqdata
AT lallsnehalika copulabasedtopologypreservinggraphconvolutionnetworkforclusteringofsinglecellrnaseqdata
AT raysumanta copulabasedtopologypreservinggraphconvolutionnetworkforclusteringofsinglecellrnaseqdata
AT bandyopadhyaysanghamitra copulabasedtopologypreservinggraphconvolutionnetworkforclusteringofsinglecellrnaseqdata