Cargando…

Consensus clustering of single-cell RNA-seq data by enhancing network affinity

Elucidation of cell subpopulations at high resolution is a key and challenging goal of single-cell ribonucleic acid (RNA) sequencing (scRNA-seq) data analysis. Although unsupervised clustering methods have been proposed for de novo identification of cell populations, their performance and robustness...

Descripción completa

Detalles Bibliográficos
Autores principales: Cui, Yaxuan, Zhang, Shaoqiang, Liang, Ying, Wang, Xiangyun, Ferraro, Thomas N, Chen, Yong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8574980/
https://www.ncbi.nlm.nih.gov/pubmed/34160582
http://dx.doi.org/10.1093/bib/bbab236
_version_ 1784595598512488448
author Cui, Yaxuan
Zhang, Shaoqiang
Liang, Ying
Wang, Xiangyun
Ferraro, Thomas N
Chen, Yong
author_facet Cui, Yaxuan
Zhang, Shaoqiang
Liang, Ying
Wang, Xiangyun
Ferraro, Thomas N
Chen, Yong
author_sort Cui, Yaxuan
collection PubMed
description Elucidation of cell subpopulations at high resolution is a key and challenging goal of single-cell ribonucleic acid (RNA) sequencing (scRNA-seq) data analysis. Although unsupervised clustering methods have been proposed for de novo identification of cell populations, their performance and robustness suffer from the high variability, low capture efficiency and high dropout rates which are characteristic of scRNA-seq experiments. Here, we present a novel unsupervised method for Single-cell Clustering by Enhancing Network Affinity (SCENA), which mainly employed three strategies: selecting multiple gene sets, enhancing local affinity among cells and clustering of consensus matrices. Large-scale validations on 13 real scRNA-seq datasets show that SCENA has high accuracy in detecting cell populations and is robust against dropout noise. When we applied SCENA to large-scale scRNA-seq data of mouse brain cells, known cell types were successfully detected, and novel cell types of interneurons were identified with differential expression of gamma-aminobutyric acid receptor subunits and transporters. SCENA is equipped with CPU + GPU (Central Processing Units + Graphics Processing Units) heterogeneous parallel computing to achieve high running speed. The high performance and running speed of SCENA combine into a new and efficient platform for biological discoveries in clustering analysis of large and diverse scRNA-seq datasets.
format Online
Article
Text
id pubmed-8574980
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-85749802021-11-09 Consensus clustering of single-cell RNA-seq data by enhancing network affinity Cui, Yaxuan Zhang, Shaoqiang Liang, Ying Wang, Xiangyun Ferraro, Thomas N Chen, Yong Brief Bioinform Problem Solving Protocol Elucidation of cell subpopulations at high resolution is a key and challenging goal of single-cell ribonucleic acid (RNA) sequencing (scRNA-seq) data analysis. Although unsupervised clustering methods have been proposed for de novo identification of cell populations, their performance and robustness suffer from the high variability, low capture efficiency and high dropout rates which are characteristic of scRNA-seq experiments. Here, we present a novel unsupervised method for Single-cell Clustering by Enhancing Network Affinity (SCENA), which mainly employed three strategies: selecting multiple gene sets, enhancing local affinity among cells and clustering of consensus matrices. Large-scale validations on 13 real scRNA-seq datasets show that SCENA has high accuracy in detecting cell populations and is robust against dropout noise. When we applied SCENA to large-scale scRNA-seq data of mouse brain cells, known cell types were successfully detected, and novel cell types of interneurons were identified with differential expression of gamma-aminobutyric acid receptor subunits and transporters. SCENA is equipped with CPU + GPU (Central Processing Units + Graphics Processing Units) heterogeneous parallel computing to achieve high running speed. The high performance and running speed of SCENA combine into a new and efficient platform for biological discoveries in clustering analysis of large and diverse scRNA-seq datasets. Oxford University Press 2021-06-23 /pmc/articles/PMC8574980/ /pubmed/34160582 http://dx.doi.org/10.1093/bib/bbab236 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Problem Solving Protocol
Cui, Yaxuan
Zhang, Shaoqiang
Liang, Ying
Wang, Xiangyun
Ferraro, Thomas N
Chen, Yong
Consensus clustering of single-cell RNA-seq data by enhancing network affinity
title Consensus clustering of single-cell RNA-seq data by enhancing network affinity
title_full Consensus clustering of single-cell RNA-seq data by enhancing network affinity
title_fullStr Consensus clustering of single-cell RNA-seq data by enhancing network affinity
title_full_unstemmed Consensus clustering of single-cell RNA-seq data by enhancing network affinity
title_short Consensus clustering of single-cell RNA-seq data by enhancing network affinity
title_sort consensus clustering of single-cell rna-seq data by enhancing network affinity
topic Problem Solving Protocol
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8574980/
https://www.ncbi.nlm.nih.gov/pubmed/34160582
http://dx.doi.org/10.1093/bib/bbab236
work_keys_str_mv AT cuiyaxuan consensusclusteringofsinglecellrnaseqdatabyenhancingnetworkaffinity
AT zhangshaoqiang consensusclusteringofsinglecellrnaseqdatabyenhancingnetworkaffinity
AT liangying consensusclusteringofsinglecellrnaseqdatabyenhancingnetworkaffinity
AT wangxiangyun consensusclusteringofsinglecellrnaseqdatabyenhancingnetworkaffinity
AT ferrarothomasn consensusclusteringofsinglecellrnaseqdatabyenhancingnetworkaffinity
AT chenyong consensusclusteringofsinglecellrnaseqdatabyenhancingnetworkaffinity