Cargando…

Sc-GPE: A Graph Partitioning-Based Cluster Ensemble Method for Single-Cell

Clustering is an efficient way to analyze single-cell RNA sequencing data. It is commonly used to identify cell types, which can help in understanding cell differentiation processes. However, different clustering results can be obtained from different single-cell clustering methods, sometimes includ...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhu, Xiaoshu, Li, Jian, Li, Hong-Dong, Xie, Miao, Wang, Jianxin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7770236/
https://www.ncbi.nlm.nih.gov/pubmed/33384718
http://dx.doi.org/10.3389/fgene.2020.604790
_version_ 1783629465605111808
author Zhu, Xiaoshu
Li, Jian
Li, Hong-Dong
Xie, Miao
Wang, Jianxin
author_facet Zhu, Xiaoshu
Li, Jian
Li, Hong-Dong
Xie, Miao
Wang, Jianxin
author_sort Zhu, Xiaoshu
collection PubMed
description Clustering is an efficient way to analyze single-cell RNA sequencing data. It is commonly used to identify cell types, which can help in understanding cell differentiation processes. However, different clustering results can be obtained from different single-cell clustering methods, sometimes including conflicting conclusions, and biologists will often fail to get the right clustering results and interpret the biological significance. The cluster ensemble strategy can be an effective solution for the problem. As the graph partitioning-based clustering methods are good at clustering single-cell, we developed Sc-GPE, a novel cluster ensemble method combining five single-cell graph partitioning-based clustering methods. The five methods are SNN-cliq, PhenoGraph, SC3, SSNN-Louvain, and MPGS-Louvain. In Sc-GPE, a consensus matrix is constructed based on the five clustering solutions by calculating the probability that the cell pairs are divided into the same cluster. It solved the problem in the hypergraph-based ensemble approach, including the different cluster labels that were assigned in the individual clustering method, and it was difficult to find the corresponding cluster labels across all methods. Then, to distinguish the different importance of each method in a clustering ensemble, a weighted consensus matrix was constructed by designing an importance score strategy. Finally, hierarchical clustering was performed on the weighted consensus matrix to cluster cells. To evaluate the performance, we compared Sc-GPE with the individual clustering methods and the state-of-the-art SAME-clustering on 12 single-cell RNA-seq datasets. The results show that Sc-GPE obtained the best average performance, and achieved the highest NMI and ARI value in five datasets.
format Online
Article
Text
id pubmed-7770236
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-77702362020-12-30 Sc-GPE: A Graph Partitioning-Based Cluster Ensemble Method for Single-Cell Zhu, Xiaoshu Li, Jian Li, Hong-Dong Xie, Miao Wang, Jianxin Front Genet Genetics Clustering is an efficient way to analyze single-cell RNA sequencing data. It is commonly used to identify cell types, which can help in understanding cell differentiation processes. However, different clustering results can be obtained from different single-cell clustering methods, sometimes including conflicting conclusions, and biologists will often fail to get the right clustering results and interpret the biological significance. The cluster ensemble strategy can be an effective solution for the problem. As the graph partitioning-based clustering methods are good at clustering single-cell, we developed Sc-GPE, a novel cluster ensemble method combining five single-cell graph partitioning-based clustering methods. The five methods are SNN-cliq, PhenoGraph, SC3, SSNN-Louvain, and MPGS-Louvain. In Sc-GPE, a consensus matrix is constructed based on the five clustering solutions by calculating the probability that the cell pairs are divided into the same cluster. It solved the problem in the hypergraph-based ensemble approach, including the different cluster labels that were assigned in the individual clustering method, and it was difficult to find the corresponding cluster labels across all methods. Then, to distinguish the different importance of each method in a clustering ensemble, a weighted consensus matrix was constructed by designing an importance score strategy. Finally, hierarchical clustering was performed on the weighted consensus matrix to cluster cells. To evaluate the performance, we compared Sc-GPE with the individual clustering methods and the state-of-the-art SAME-clustering on 12 single-cell RNA-seq datasets. The results show that Sc-GPE obtained the best average performance, and achieved the highest NMI and ARI value in five datasets. Frontiers Media S.A. 2020-12-15 /pmc/articles/PMC7770236/ /pubmed/33384718 http://dx.doi.org/10.3389/fgene.2020.604790 Text en Copyright © 2020 Zhu, Li, Li, Xie and Wang. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Zhu, Xiaoshu
Li, Jian
Li, Hong-Dong
Xie, Miao
Wang, Jianxin
Sc-GPE: A Graph Partitioning-Based Cluster Ensemble Method for Single-Cell
title Sc-GPE: A Graph Partitioning-Based Cluster Ensemble Method for Single-Cell
title_full Sc-GPE: A Graph Partitioning-Based Cluster Ensemble Method for Single-Cell
title_fullStr Sc-GPE: A Graph Partitioning-Based Cluster Ensemble Method for Single-Cell
title_full_unstemmed Sc-GPE: A Graph Partitioning-Based Cluster Ensemble Method for Single-Cell
title_short Sc-GPE: A Graph Partitioning-Based Cluster Ensemble Method for Single-Cell
title_sort sc-gpe: a graph partitioning-based cluster ensemble method for single-cell
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7770236/
https://www.ncbi.nlm.nih.gov/pubmed/33384718
http://dx.doi.org/10.3389/fgene.2020.604790
work_keys_str_mv AT zhuxiaoshu scgpeagraphpartitioningbasedclusterensemblemethodforsinglecell
AT lijian scgpeagraphpartitioningbasedclusterensemblemethodforsinglecell
AT lihongdong scgpeagraphpartitioningbasedclusterensemblemethodforsinglecell
AT xiemiao scgpeagraphpartitioningbasedclusterensemblemethodforsinglecell
AT wangjianxin scgpeagraphpartitioningbasedclusterensemblemethodforsinglecell