Cargando…

SCClone: Accurate Clustering of Tumor Single-Cell DNA Sequencing Data

Single-cell DNA sequencing (scDNA-seq) enables high-resolution profiling of genetic diversity among single cells and is especially useful for deciphering the intra-tumor heterogeneity and evolutionary history of tumor. Specific technical issues such as allele dropout, false-positive errors, and doub...

Descripción completa

Detalles Bibliográficos
Autores principales: Yu , Zhenhua, Du, Fang, Song, Lijuan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8830741/
https://www.ncbi.nlm.nih.gov/pubmed/35154282
http://dx.doi.org/10.3389/fgene.2022.823941
_version_ 1784648340010434560
author Yu , Zhenhua
Du, Fang
Song, Lijuan
author_facet Yu , Zhenhua
Du, Fang
Song, Lijuan
author_sort Yu , Zhenhua
collection PubMed
description Single-cell DNA sequencing (scDNA-seq) enables high-resolution profiling of genetic diversity among single cells and is especially useful for deciphering the intra-tumor heterogeneity and evolutionary history of tumor. Specific technical issues such as allele dropout, false-positive errors, and doublets make scDNA-seq data incomplete and error-prone, giving rise to a severe challenge of accurately inferring clonal architecture of tumor. To effectively address these issues, we introduce a new computational method called SCClone for reasoning subclones from single nucleotide variation (SNV) data of single cells. Specifically, SCClone leverages a probability mixture model for binary data to cluster single cells into distinct subclones. To accurately decipher underlying clonal composition, a novel model selection scheme based on inter-cluster variance is employed to find the optimal number of subclones. Extensive evaluations on various simulated datasets suggest SCClone has strong robustness against different technical noises in scDNA-seq data and achieves better performance than the state-of-the-art methods in reasoning clonal composition. Further evaluations of SCClone on three real scDNA-seq datasets show that it can effectively find the underlying subclones from severely disturbed data. The SCClone software is freely available at https://github.com/qasimyu/scclone.
format Online
Article
Text
id pubmed-8830741
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-88307412022-02-11 SCClone: Accurate Clustering of Tumor Single-Cell DNA Sequencing Data Yu , Zhenhua Du, Fang Song, Lijuan Front Genet Genetics Single-cell DNA sequencing (scDNA-seq) enables high-resolution profiling of genetic diversity among single cells and is especially useful for deciphering the intra-tumor heterogeneity and evolutionary history of tumor. Specific technical issues such as allele dropout, false-positive errors, and doublets make scDNA-seq data incomplete and error-prone, giving rise to a severe challenge of accurately inferring clonal architecture of tumor. To effectively address these issues, we introduce a new computational method called SCClone for reasoning subclones from single nucleotide variation (SNV) data of single cells. Specifically, SCClone leverages a probability mixture model for binary data to cluster single cells into distinct subclones. To accurately decipher underlying clonal composition, a novel model selection scheme based on inter-cluster variance is employed to find the optimal number of subclones. Extensive evaluations on various simulated datasets suggest SCClone has strong robustness against different technical noises in scDNA-seq data and achieves better performance than the state-of-the-art methods in reasoning clonal composition. Further evaluations of SCClone on three real scDNA-seq datasets show that it can effectively find the underlying subclones from severely disturbed data. The SCClone software is freely available at https://github.com/qasimyu/scclone. Frontiers Media S.A. 2022-01-27 /pmc/articles/PMC8830741/ /pubmed/35154282 http://dx.doi.org/10.3389/fgene.2022.823941 Text en Copyright © 2022 Yu , Du and Song. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Yu , Zhenhua
Du, Fang
Song, Lijuan
SCClone: Accurate Clustering of Tumor Single-Cell DNA Sequencing Data
title SCClone: Accurate Clustering of Tumor Single-Cell DNA Sequencing Data
title_full SCClone: Accurate Clustering of Tumor Single-Cell DNA Sequencing Data
title_fullStr SCClone: Accurate Clustering of Tumor Single-Cell DNA Sequencing Data
title_full_unstemmed SCClone: Accurate Clustering of Tumor Single-Cell DNA Sequencing Data
title_short SCClone: Accurate Clustering of Tumor Single-Cell DNA Sequencing Data
title_sort scclone: accurate clustering of tumor single-cell dna sequencing data
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8830741/
https://www.ncbi.nlm.nih.gov/pubmed/35154282
http://dx.doi.org/10.3389/fgene.2022.823941
work_keys_str_mv AT yuzhenhua sccloneaccurateclusteringoftumorsinglecelldnasequencingdata
AT dufang sccloneaccurateclusteringoftumorsinglecelldnasequencingdata
AT songlijuan sccloneaccurateclusteringoftumorsinglecelldnasequencingdata