Cargando…
SCClone: Accurate Clustering of Tumor Single-Cell DNA Sequencing Data
Single-cell DNA sequencing (scDNA-seq) enables high-resolution profiling of genetic diversity among single cells and is especially useful for deciphering the intra-tumor heterogeneity and evolutionary history of tumor. Specific technical issues such as allele dropout, false-positive errors, and doub...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8830741/ https://www.ncbi.nlm.nih.gov/pubmed/35154282 http://dx.doi.org/10.3389/fgene.2022.823941 |
_version_ | 1784648340010434560 |
---|---|
author | Yu , Zhenhua Du, Fang Song, Lijuan |
author_facet | Yu , Zhenhua Du, Fang Song, Lijuan |
author_sort | Yu , Zhenhua |
collection | PubMed |
description | Single-cell DNA sequencing (scDNA-seq) enables high-resolution profiling of genetic diversity among single cells and is especially useful for deciphering the intra-tumor heterogeneity and evolutionary history of tumor. Specific technical issues such as allele dropout, false-positive errors, and doublets make scDNA-seq data incomplete and error-prone, giving rise to a severe challenge of accurately inferring clonal architecture of tumor. To effectively address these issues, we introduce a new computational method called SCClone for reasoning subclones from single nucleotide variation (SNV) data of single cells. Specifically, SCClone leverages a probability mixture model for binary data to cluster single cells into distinct subclones. To accurately decipher underlying clonal composition, a novel model selection scheme based on inter-cluster variance is employed to find the optimal number of subclones. Extensive evaluations on various simulated datasets suggest SCClone has strong robustness against different technical noises in scDNA-seq data and achieves better performance than the state-of-the-art methods in reasoning clonal composition. Further evaluations of SCClone on three real scDNA-seq datasets show that it can effectively find the underlying subclones from severely disturbed data. The SCClone software is freely available at https://github.com/qasimyu/scclone. |
format | Online Article Text |
id | pubmed-8830741 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-88307412022-02-11 SCClone: Accurate Clustering of Tumor Single-Cell DNA Sequencing Data Yu , Zhenhua Du, Fang Song, Lijuan Front Genet Genetics Single-cell DNA sequencing (scDNA-seq) enables high-resolution profiling of genetic diversity among single cells and is especially useful for deciphering the intra-tumor heterogeneity and evolutionary history of tumor. Specific technical issues such as allele dropout, false-positive errors, and doublets make scDNA-seq data incomplete and error-prone, giving rise to a severe challenge of accurately inferring clonal architecture of tumor. To effectively address these issues, we introduce a new computational method called SCClone for reasoning subclones from single nucleotide variation (SNV) data of single cells. Specifically, SCClone leverages a probability mixture model for binary data to cluster single cells into distinct subclones. To accurately decipher underlying clonal composition, a novel model selection scheme based on inter-cluster variance is employed to find the optimal number of subclones. Extensive evaluations on various simulated datasets suggest SCClone has strong robustness against different technical noises in scDNA-seq data and achieves better performance than the state-of-the-art methods in reasoning clonal composition. Further evaluations of SCClone on three real scDNA-seq datasets show that it can effectively find the underlying subclones from severely disturbed data. The SCClone software is freely available at https://github.com/qasimyu/scclone. Frontiers Media S.A. 2022-01-27 /pmc/articles/PMC8830741/ /pubmed/35154282 http://dx.doi.org/10.3389/fgene.2022.823941 Text en Copyright © 2022 Yu , Du and Song. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Yu , Zhenhua Du, Fang Song, Lijuan SCClone: Accurate Clustering of Tumor Single-Cell DNA Sequencing Data |
title | SCClone: Accurate Clustering of Tumor Single-Cell DNA Sequencing Data |
title_full | SCClone: Accurate Clustering of Tumor Single-Cell DNA Sequencing Data |
title_fullStr | SCClone: Accurate Clustering of Tumor Single-Cell DNA Sequencing Data |
title_full_unstemmed | SCClone: Accurate Clustering of Tumor Single-Cell DNA Sequencing Data |
title_short | SCClone: Accurate Clustering of Tumor Single-Cell DNA Sequencing Data |
title_sort | scclone: accurate clustering of tumor single-cell dna sequencing data |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8830741/ https://www.ncbi.nlm.nih.gov/pubmed/35154282 http://dx.doi.org/10.3389/fgene.2022.823941 |
work_keys_str_mv | AT yuzhenhua sccloneaccurateclusteringoftumorsinglecelldnasequencingdata AT dufang sccloneaccurateclusteringoftumorsinglecelldnasequencingdata AT songlijuan sccloneaccurateclusteringoftumorsinglecelldnasequencingdata |