Cargando…

Entropy subspace separation-based clustering for noise reduction (ENCORE) of scRNA-seq data

Single-cell RNA sequencing enables us to characterize the cellular heterogeneity in single cell resolution with the help of cell type identification algorithms. However, the noise inherent in single-cell RNA-sequencing data severely disturbs the accuracy of cell clustering, marker identification and...

Descripción completa

Detalles Bibliográficos
Autores principales: Song, Jia, Liu, Yao, Zhang, Xuebing, Wu, Qiuyue, Gao, Juan, Wang, Wei, Li, Jin, Song, Yanling, Yang, Chaoyong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7897472/
https://www.ncbi.nlm.nih.gov/pubmed/33305325
http://dx.doi.org/10.1093/nar/gkaa1157
_version_ 1783653675614339072
author Song, Jia
Liu, Yao
Zhang, Xuebing
Wu, Qiuyue
Gao, Juan
Wang, Wei
Li, Jin
Song, Yanling
Yang, Chaoyong
author_facet Song, Jia
Liu, Yao
Zhang, Xuebing
Wu, Qiuyue
Gao, Juan
Wang, Wei
Li, Jin
Song, Yanling
Yang, Chaoyong
author_sort Song, Jia
collection PubMed
description Single-cell RNA sequencing enables us to characterize the cellular heterogeneity in single cell resolution with the help of cell type identification algorithms. However, the noise inherent in single-cell RNA-sequencing data severely disturbs the accuracy of cell clustering, marker identification and visualization. We propose that clustering based on feature density profiles can distinguish informative features from noise. We named such strategy as ‘entropy subspace’ separation and designed a cell clustering algorithm called ENtropy subspace separation-based Clustering for nOise REduction (ENCORE) by integrating the ‘entropy subspace’ separation strategy with a consensus clustering method. We demonstrate that ENCORE performs superiorly on cell clustering and generates high-resolution visualization across 12 standard datasets. More importantly, ENCORE enables identification of group markers with biological significance from a hard-to-separate dataset. With the advantages of effective feature selection, improved clustering, accurate marker identification and high-resolution visualization, we present ENCORE to the community as an important tool for scRNA-seq data analysis to study cellular heterogeneity and discover group markers.
format Online
Article
Text
id pubmed-7897472
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-78974722021-02-25 Entropy subspace separation-based clustering for noise reduction (ENCORE) of scRNA-seq data Song, Jia Liu, Yao Zhang, Xuebing Wu, Qiuyue Gao, Juan Wang, Wei Li, Jin Song, Yanling Yang, Chaoyong Nucleic Acids Res Methods Online Single-cell RNA sequencing enables us to characterize the cellular heterogeneity in single cell resolution with the help of cell type identification algorithms. However, the noise inherent in single-cell RNA-sequencing data severely disturbs the accuracy of cell clustering, marker identification and visualization. We propose that clustering based on feature density profiles can distinguish informative features from noise. We named such strategy as ‘entropy subspace’ separation and designed a cell clustering algorithm called ENtropy subspace separation-based Clustering for nOise REduction (ENCORE) by integrating the ‘entropy subspace’ separation strategy with a consensus clustering method. We demonstrate that ENCORE performs superiorly on cell clustering and generates high-resolution visualization across 12 standard datasets. More importantly, ENCORE enables identification of group markers with biological significance from a hard-to-separate dataset. With the advantages of effective feature selection, improved clustering, accurate marker identification and high-resolution visualization, we present ENCORE to the community as an important tool for scRNA-seq data analysis to study cellular heterogeneity and discover group markers. Oxford University Press 2020-12-10 /pmc/articles/PMC7897472/ /pubmed/33305325 http://dx.doi.org/10.1093/nar/gkaa1157 Text en © The Author(s) 2020. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Online
Song, Jia
Liu, Yao
Zhang, Xuebing
Wu, Qiuyue
Gao, Juan
Wang, Wei
Li, Jin
Song, Yanling
Yang, Chaoyong
Entropy subspace separation-based clustering for noise reduction (ENCORE) of scRNA-seq data
title Entropy subspace separation-based clustering for noise reduction (ENCORE) of scRNA-seq data
title_full Entropy subspace separation-based clustering for noise reduction (ENCORE) of scRNA-seq data
title_fullStr Entropy subspace separation-based clustering for noise reduction (ENCORE) of scRNA-seq data
title_full_unstemmed Entropy subspace separation-based clustering for noise reduction (ENCORE) of scRNA-seq data
title_short Entropy subspace separation-based clustering for noise reduction (ENCORE) of scRNA-seq data
title_sort entropy subspace separation-based clustering for noise reduction (encore) of scrna-seq data
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7897472/
https://www.ncbi.nlm.nih.gov/pubmed/33305325
http://dx.doi.org/10.1093/nar/gkaa1157
work_keys_str_mv AT songjia entropysubspaceseparationbasedclusteringfornoisereductionencoreofscrnaseqdata
AT liuyao entropysubspaceseparationbasedclusteringfornoisereductionencoreofscrnaseqdata
AT zhangxuebing entropysubspaceseparationbasedclusteringfornoisereductionencoreofscrnaseqdata
AT wuqiuyue entropysubspaceseparationbasedclusteringfornoisereductionencoreofscrnaseqdata
AT gaojuan entropysubspaceseparationbasedclusteringfornoisereductionencoreofscrnaseqdata
AT wangwei entropysubspaceseparationbasedclusteringfornoisereductionencoreofscrnaseqdata
AT lijin entropysubspaceseparationbasedclusteringfornoisereductionencoreofscrnaseqdata
AT songyanling entropysubspaceseparationbasedclusteringfornoisereductionencoreofscrnaseqdata
AT yangchaoyong entropysubspaceseparationbasedclusteringfornoisereductionencoreofscrnaseqdata