Cargando…

Implications of non-uniqueness in phylogenetic deconvolution of bulk DNA samples of tumors

BACKGROUND: Tumors exhibit extensive intra-tumor heterogeneity, the presence of groups of cellular populations with distinct sets of somatic mutations. This heterogeneity is the result of an evolutionary process, described by a phylogenetic tree. In addition to enabling clinicians to devise patient-...

Descripción completa

Detalles Bibliográficos
Autores principales: Qi, Yuanyuan, Pradhan, Dikshant, El-Kebir, Mohammed
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6719395/
https://www.ncbi.nlm.nih.gov/pubmed/31497065
http://dx.doi.org/10.1186/s13015-019-0155-6
_version_ 1783447924533886976
author Qi, Yuanyuan
Pradhan, Dikshant
El-Kebir, Mohammed
author_facet Qi, Yuanyuan
Pradhan, Dikshant
El-Kebir, Mohammed
author_sort Qi, Yuanyuan
collection PubMed
description BACKGROUND: Tumors exhibit extensive intra-tumor heterogeneity, the presence of groups of cellular populations with distinct sets of somatic mutations. This heterogeneity is the result of an evolutionary process, described by a phylogenetic tree. In addition to enabling clinicians to devise patient-specific treatment plans, phylogenetic trees of tumors enable researchers to decipher the mechanisms of tumorigenesis and metastasis. However, the problem of reconstructing a phylogenetic tree T given bulk sequencing data from a tumor is more complicated than the classic phylogeny inference problem. Rather than observing the leaves of T directly, we are given mutation frequencies that are the result of mixtures of the leaves of T. The majority of current tumor phylogeny inference methods employ the perfect phylogeny evolutionary model. The underlying Perfect Phylogeny Mixture (PPM) combinatorial problem typically has multiple solutions. RESULTS: We prove that determining the exact number of solutions to the PPM problem is #P-complete and hard to approximate within a constant factor. Moreover, we show that sampling solutions uniformly at random is hard as well. On the positive side, we provide a polynomial-time computable upper bound on the number of solutions and introduce a simple rejection-sampling based scheme that works well for small instances. Using simulated and real data, we identify factors that contribute to and counteract non-uniqueness of solutions. In addition, we study the sampling performance of current methods, identifying significant biases. CONCLUSIONS: Awareness of non-uniqueness of solutions to the PPM problem is key to drawing accurate conclusions in downstream analyses based on tumor phylogenies. This work provides the theoretical foundations for non-uniqueness of solutions in tumor phylogeny inference from bulk DNA samples.
format Online
Article
Text
id pubmed-6719395
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-67193952019-09-06 Implications of non-uniqueness in phylogenetic deconvolution of bulk DNA samples of tumors Qi, Yuanyuan Pradhan, Dikshant El-Kebir, Mohammed Algorithms Mol Biol Research BACKGROUND: Tumors exhibit extensive intra-tumor heterogeneity, the presence of groups of cellular populations with distinct sets of somatic mutations. This heterogeneity is the result of an evolutionary process, described by a phylogenetic tree. In addition to enabling clinicians to devise patient-specific treatment plans, phylogenetic trees of tumors enable researchers to decipher the mechanisms of tumorigenesis and metastasis. However, the problem of reconstructing a phylogenetic tree T given bulk sequencing data from a tumor is more complicated than the classic phylogeny inference problem. Rather than observing the leaves of T directly, we are given mutation frequencies that are the result of mixtures of the leaves of T. The majority of current tumor phylogeny inference methods employ the perfect phylogeny evolutionary model. The underlying Perfect Phylogeny Mixture (PPM) combinatorial problem typically has multiple solutions. RESULTS: We prove that determining the exact number of solutions to the PPM problem is #P-complete and hard to approximate within a constant factor. Moreover, we show that sampling solutions uniformly at random is hard as well. On the positive side, we provide a polynomial-time computable upper bound on the number of solutions and introduce a simple rejection-sampling based scheme that works well for small instances. Using simulated and real data, we identify factors that contribute to and counteract non-uniqueness of solutions. In addition, we study the sampling performance of current methods, identifying significant biases. CONCLUSIONS: Awareness of non-uniqueness of solutions to the PPM problem is key to drawing accurate conclusions in downstream analyses based on tumor phylogenies. This work provides the theoretical foundations for non-uniqueness of solutions in tumor phylogeny inference from bulk DNA samples. BioMed Central 2019-09-03 /pmc/articles/PMC6719395/ /pubmed/31497065 http://dx.doi.org/10.1186/s13015-019-0155-6 Text en © The Author(s) 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Qi, Yuanyuan
Pradhan, Dikshant
El-Kebir, Mohammed
Implications of non-uniqueness in phylogenetic deconvolution of bulk DNA samples of tumors
title Implications of non-uniqueness in phylogenetic deconvolution of bulk DNA samples of tumors
title_full Implications of non-uniqueness in phylogenetic deconvolution of bulk DNA samples of tumors
title_fullStr Implications of non-uniqueness in phylogenetic deconvolution of bulk DNA samples of tumors
title_full_unstemmed Implications of non-uniqueness in phylogenetic deconvolution of bulk DNA samples of tumors
title_short Implications of non-uniqueness in phylogenetic deconvolution of bulk DNA samples of tumors
title_sort implications of non-uniqueness in phylogenetic deconvolution of bulk dna samples of tumors
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6719395/
https://www.ncbi.nlm.nih.gov/pubmed/31497065
http://dx.doi.org/10.1186/s13015-019-0155-6
work_keys_str_mv AT qiyuanyuan implicationsofnonuniquenessinphylogeneticdeconvolutionofbulkdnasamplesoftumors
AT pradhandikshant implicationsofnonuniquenessinphylogeneticdeconvolutionofbulkdnasamplesoftumors
AT elkebirmohammed implicationsofnonuniquenessinphylogeneticdeconvolutionofbulkdnasamplesoftumors