Cargando…

scMelody: An Enhanced Consensus-Based Clustering Model for Single-Cell Methylation Data by Reconstructing Cell-to-Cell Similarity

Single-cell DNA methylation sequencing technology has brought new perspectives to investigate epigenetic heterogeneity, supporting a need for computational methods to cluster cells based on single-cell methylation profiles. Although several methods have been developed, most of them cluster cells bas...

Descripción completa

Detalles Bibliográficos
Autores principales: Tian, Qi, Zou, Jianxiao, Tang, Jianxiong, Liang, Liang, Cao, Xiaohong, Fan, Shicai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8905497/
https://www.ncbi.nlm.nih.gov/pubmed/35284424
http://dx.doi.org/10.3389/fbioe.2022.842019
_version_ 1784665200696229888
author Tian, Qi
Zou, Jianxiao
Tang, Jianxiong
Liang, Liang
Cao, Xiaohong
Fan, Shicai
author_facet Tian, Qi
Zou, Jianxiao
Tang, Jianxiong
Liang, Liang
Cao, Xiaohong
Fan, Shicai
author_sort Tian, Qi
collection PubMed
description Single-cell DNA methylation sequencing technology has brought new perspectives to investigate epigenetic heterogeneity, supporting a need for computational methods to cluster cells based on single-cell methylation profiles. Although several methods have been developed, most of them cluster cells based on single (dis)similarity measures, failing to capture complete cell heterogeneity and resulting in locally optimal solutions. Here, we present scMelody, which utilizes an enhanced consensus-based clustering model to reconstruct cell-to-cell methylation similarity patterns and identifies cell subpopulations with the leveraged information from multiple basic similarity measures. Besides, benefitted from the reconstructed cell-to-cell similarity measure, scMelody could conveniently leverage the clustering validation criteria to determine the optimal number of clusters. Assessments on distinct real datasets showed that scMelody accurately recapitulated methylation subpopulations and outperformed existing methods in terms of both cluster partitions and the number of clusters. Moreover, when benchmarking the clustering stability of scMelody on a variety of synthetic datasets, it achieved significant clustering performance gains over existing methods and robustly maintained its clustering accuracy over a wide range of number of cells, number of clusters and CpG dropout proportions. Finally, the real case studies demonstrated the capability of scMelody to assess known cell types and uncover novel cell clusters.
format Online
Article
Text
id pubmed-8905497
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-89054972022-03-10 scMelody: An Enhanced Consensus-Based Clustering Model for Single-Cell Methylation Data by Reconstructing Cell-to-Cell Similarity Tian, Qi Zou, Jianxiao Tang, Jianxiong Liang, Liang Cao, Xiaohong Fan, Shicai Front Bioeng Biotechnol Bioengineering and Biotechnology Single-cell DNA methylation sequencing technology has brought new perspectives to investigate epigenetic heterogeneity, supporting a need for computational methods to cluster cells based on single-cell methylation profiles. Although several methods have been developed, most of them cluster cells based on single (dis)similarity measures, failing to capture complete cell heterogeneity and resulting in locally optimal solutions. Here, we present scMelody, which utilizes an enhanced consensus-based clustering model to reconstruct cell-to-cell methylation similarity patterns and identifies cell subpopulations with the leveraged information from multiple basic similarity measures. Besides, benefitted from the reconstructed cell-to-cell similarity measure, scMelody could conveniently leverage the clustering validation criteria to determine the optimal number of clusters. Assessments on distinct real datasets showed that scMelody accurately recapitulated methylation subpopulations and outperformed existing methods in terms of both cluster partitions and the number of clusters. Moreover, when benchmarking the clustering stability of scMelody on a variety of synthetic datasets, it achieved significant clustering performance gains over existing methods and robustly maintained its clustering accuracy over a wide range of number of cells, number of clusters and CpG dropout proportions. Finally, the real case studies demonstrated the capability of scMelody to assess known cell types and uncover novel cell clusters. Frontiers Media S.A. 2022-02-23 /pmc/articles/PMC8905497/ /pubmed/35284424 http://dx.doi.org/10.3389/fbioe.2022.842019 Text en Copyright © 2022 Tian, Zou, Tang, Liang, Cao and Fan. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Bioengineering and Biotechnology
Tian, Qi
Zou, Jianxiao
Tang, Jianxiong
Liang, Liang
Cao, Xiaohong
Fan, Shicai
scMelody: An Enhanced Consensus-Based Clustering Model for Single-Cell Methylation Data by Reconstructing Cell-to-Cell Similarity
title scMelody: An Enhanced Consensus-Based Clustering Model for Single-Cell Methylation Data by Reconstructing Cell-to-Cell Similarity
title_full scMelody: An Enhanced Consensus-Based Clustering Model for Single-Cell Methylation Data by Reconstructing Cell-to-Cell Similarity
title_fullStr scMelody: An Enhanced Consensus-Based Clustering Model for Single-Cell Methylation Data by Reconstructing Cell-to-Cell Similarity
title_full_unstemmed scMelody: An Enhanced Consensus-Based Clustering Model for Single-Cell Methylation Data by Reconstructing Cell-to-Cell Similarity
title_short scMelody: An Enhanced Consensus-Based Clustering Model for Single-Cell Methylation Data by Reconstructing Cell-to-Cell Similarity
title_sort scmelody: an enhanced consensus-based clustering model for single-cell methylation data by reconstructing cell-to-cell similarity
topic Bioengineering and Biotechnology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8905497/
https://www.ncbi.nlm.nih.gov/pubmed/35284424
http://dx.doi.org/10.3389/fbioe.2022.842019
work_keys_str_mv AT tianqi scmelodyanenhancedconsensusbasedclusteringmodelforsinglecellmethylationdatabyreconstructingcelltocellsimilarity
AT zoujianxiao scmelodyanenhancedconsensusbasedclusteringmodelforsinglecellmethylationdatabyreconstructingcelltocellsimilarity
AT tangjianxiong scmelodyanenhancedconsensusbasedclusteringmodelforsinglecellmethylationdatabyreconstructingcelltocellsimilarity
AT liangliang scmelodyanenhancedconsensusbasedclusteringmodelforsinglecellmethylationdatabyreconstructingcelltocellsimilarity
AT caoxiaohong scmelodyanenhancedconsensusbasedclusteringmodelforsinglecellmethylationdatabyreconstructingcelltocellsimilarity
AT fanshicai scmelodyanenhancedconsensusbasedclusteringmodelforsinglecellmethylationdatabyreconstructingcelltocellsimilarity