Cargando…

ProgClust: A progressive clustering method to identify cell populations

Identifying different types of cells in scRNA-seq data is a critical task in single-cell data analysis. In this paper, we propose a method called ProgClust for the decomposition of cell populations and detection of rare cells. ProgClust represents the single-cell data with clustering trees where a p...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Han, Wang, Ying, Lai, Yongxuan, Zeng, Feng, Yang, Fan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10115987/
https://www.ncbi.nlm.nih.gov/pubmed/37091787
http://dx.doi.org/10.3389/fgene.2023.1183099
_version_ 1785028325445468160
author Li, Han
Wang, Ying
Lai, Yongxuan
Zeng, Feng
Yang, Fan
author_facet Li, Han
Wang, Ying
Lai, Yongxuan
Zeng, Feng
Yang, Fan
author_sort Li, Han
collection PubMed
description Identifying different types of cells in scRNA-seq data is a critical task in single-cell data analysis. In this paper, we propose a method called ProgClust for the decomposition of cell populations and detection of rare cells. ProgClust represents the single-cell data with clustering trees where a progressive searching method is designed to select cell population-specific genes and cluster cells. The obtained trees reveal the structure of both abundant cell populations and rare cell populations. Additionally, it can automatically determine the number of clusters. Experimental results show that ProgClust outperforms the baseline method and is capable of accurately identifying both common and rare cells. Moreover, when applied to real unlabeled data, it reveals potential cell subpopulations which provides clues for further exploration. In summary, ProgClust shows potential in identifying subpopulations of complex single-cell data.
format Online
Article
Text
id pubmed-10115987
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-101159872023-04-21 ProgClust: A progressive clustering method to identify cell populations Li, Han Wang, Ying Lai, Yongxuan Zeng, Feng Yang, Fan Front Genet Genetics Identifying different types of cells in scRNA-seq data is a critical task in single-cell data analysis. In this paper, we propose a method called ProgClust for the decomposition of cell populations and detection of rare cells. ProgClust represents the single-cell data with clustering trees where a progressive searching method is designed to select cell population-specific genes and cluster cells. The obtained trees reveal the structure of both abundant cell populations and rare cell populations. Additionally, it can automatically determine the number of clusters. Experimental results show that ProgClust outperforms the baseline method and is capable of accurately identifying both common and rare cells. Moreover, when applied to real unlabeled data, it reveals potential cell subpopulations which provides clues for further exploration. In summary, ProgClust shows potential in identifying subpopulations of complex single-cell data. Frontiers Media S.A. 2023-04-06 /pmc/articles/PMC10115987/ /pubmed/37091787 http://dx.doi.org/10.3389/fgene.2023.1183099 Text en Copyright © 2023 Li, Wang, Lai, Zeng and Yang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Li, Han
Wang, Ying
Lai, Yongxuan
Zeng, Feng
Yang, Fan
ProgClust: A progressive clustering method to identify cell populations
title ProgClust: A progressive clustering method to identify cell populations
title_full ProgClust: A progressive clustering method to identify cell populations
title_fullStr ProgClust: A progressive clustering method to identify cell populations
title_full_unstemmed ProgClust: A progressive clustering method to identify cell populations
title_short ProgClust: A progressive clustering method to identify cell populations
title_sort progclust: a progressive clustering method to identify cell populations
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10115987/
https://www.ncbi.nlm.nih.gov/pubmed/37091787
http://dx.doi.org/10.3389/fgene.2023.1183099
work_keys_str_mv AT lihan progclustaprogressiveclusteringmethodtoidentifycellpopulations
AT wangying progclustaprogressiveclusteringmethodtoidentifycellpopulations
AT laiyongxuan progclustaprogressiveclusteringmethodtoidentifycellpopulations
AT zengfeng progclustaprogressiveclusteringmethodtoidentifycellpopulations
AT yangfan progclustaprogressiveclusteringmethodtoidentifycellpopulations