Cargando…
SiCloneFit: Bayesian inference of population structure, genotype, and phylogeny of tumor clones from single-cell genome sequencing data
Accumulation and selection of somatic mutations in a Darwinian framework result in intra-tumor heterogeneity (ITH) that poses significant challenges to the diagnosis and clinical therapy of cancer. Identification of the tumor cell populations (clones) and reconstruction of their evolutionary relatio...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6836738/ https://www.ncbi.nlm.nih.gov/pubmed/31628257 http://dx.doi.org/10.1101/gr.243121.118 |
_version_ | 1783466961753079808 |
---|---|
author | Zafar, Hamim Navin, Nicholas Chen, Ken Nakhleh, Luay |
author_facet | Zafar, Hamim Navin, Nicholas Chen, Ken Nakhleh, Luay |
author_sort | Zafar, Hamim |
collection | PubMed |
description | Accumulation and selection of somatic mutations in a Darwinian framework result in intra-tumor heterogeneity (ITH) that poses significant challenges to the diagnosis and clinical therapy of cancer. Identification of the tumor cell populations (clones) and reconstruction of their evolutionary relationship can elucidate this heterogeneity. Recently developed single-cell DNA sequencing (SCS) technologies promise to resolve ITH to a single-cell level. However, technical errors in SCS data sets, including false-positives (FP) and false-negatives (FN) due to allelic dropout, and cell doublets, significantly complicate these tasks. Here, we propose a nonparametric Bayesian method that reconstructs the clonal populations as clusters of single cells, genotypes of each clone, and the evolutionary relationship between the clones. It employs a tree-structured Chinese restaurant process as the prior on the number and composition of clonal populations. The evolution of the clonal populations is modeled by a clonal phylogeny and a finite-site model of evolution to account for potential mutation recurrence and losses. We probabilistically account for FP and FN errors, and cell doublets are modeled by employing a Beta-binomial distribution. We develop a Gibbs sampling algorithm comprising partial reversible-jump and partial Metropolis-Hastings updates to explore the joint posterior space of all parameters. The performance of our method on synthetic and experimental data sets suggests that joint reconstruction of tumor clones and clonal phylogeny under a finite-site model of evolution leads to more accurate inferences. Our method is the first to enable this joint reconstruction in a fully Bayesian framework, thus providing measures of support of the inferences it makes. |
format | Online Article Text |
id | pubmed-6836738 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Cold Spring Harbor Laboratory Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-68367382020-05-01 SiCloneFit: Bayesian inference of population structure, genotype, and phylogeny of tumor clones from single-cell genome sequencing data Zafar, Hamim Navin, Nicholas Chen, Ken Nakhleh, Luay Genome Res Method Accumulation and selection of somatic mutations in a Darwinian framework result in intra-tumor heterogeneity (ITH) that poses significant challenges to the diagnosis and clinical therapy of cancer. Identification of the tumor cell populations (clones) and reconstruction of their evolutionary relationship can elucidate this heterogeneity. Recently developed single-cell DNA sequencing (SCS) technologies promise to resolve ITH to a single-cell level. However, technical errors in SCS data sets, including false-positives (FP) and false-negatives (FN) due to allelic dropout, and cell doublets, significantly complicate these tasks. Here, we propose a nonparametric Bayesian method that reconstructs the clonal populations as clusters of single cells, genotypes of each clone, and the evolutionary relationship between the clones. It employs a tree-structured Chinese restaurant process as the prior on the number and composition of clonal populations. The evolution of the clonal populations is modeled by a clonal phylogeny and a finite-site model of evolution to account for potential mutation recurrence and losses. We probabilistically account for FP and FN errors, and cell doublets are modeled by employing a Beta-binomial distribution. We develop a Gibbs sampling algorithm comprising partial reversible-jump and partial Metropolis-Hastings updates to explore the joint posterior space of all parameters. The performance of our method on synthetic and experimental data sets suggests that joint reconstruction of tumor clones and clonal phylogeny under a finite-site model of evolution leads to more accurate inferences. Our method is the first to enable this joint reconstruction in a fully Bayesian framework, thus providing measures of support of the inferences it makes. Cold Spring Harbor Laboratory Press 2019-11 /pmc/articles/PMC6836738/ /pubmed/31628257 http://dx.doi.org/10.1101/gr.243121.118 Text en © 2019 Zafar et al.; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/. |
spellingShingle | Method Zafar, Hamim Navin, Nicholas Chen, Ken Nakhleh, Luay SiCloneFit: Bayesian inference of population structure, genotype, and phylogeny of tumor clones from single-cell genome sequencing data |
title | SiCloneFit: Bayesian inference of population structure, genotype, and phylogeny of tumor clones from single-cell genome sequencing data |
title_full | SiCloneFit: Bayesian inference of population structure, genotype, and phylogeny of tumor clones from single-cell genome sequencing data |
title_fullStr | SiCloneFit: Bayesian inference of population structure, genotype, and phylogeny of tumor clones from single-cell genome sequencing data |
title_full_unstemmed | SiCloneFit: Bayesian inference of population structure, genotype, and phylogeny of tumor clones from single-cell genome sequencing data |
title_short | SiCloneFit: Bayesian inference of population structure, genotype, and phylogeny of tumor clones from single-cell genome sequencing data |
title_sort | siclonefit: bayesian inference of population structure, genotype, and phylogeny of tumor clones from single-cell genome sequencing data |
topic | Method |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6836738/ https://www.ncbi.nlm.nih.gov/pubmed/31628257 http://dx.doi.org/10.1101/gr.243121.118 |
work_keys_str_mv | AT zafarhamim siclonefitbayesianinferenceofpopulationstructuregenotypeandphylogenyoftumorclonesfromsinglecellgenomesequencingdata AT navinnicholas siclonefitbayesianinferenceofpopulationstructuregenotypeandphylogenyoftumorclonesfromsinglecellgenomesequencingdata AT chenken siclonefitbayesianinferenceofpopulationstructuregenotypeandphylogenyoftumorclonesfromsinglecellgenomesequencingdata AT nakhlehluay siclonefitbayesianinferenceofpopulationstructuregenotypeandphylogenyoftumorclonesfromsinglecellgenomesequencingdata |