Cargando…
Parameter, noise, and tree topology effects in tumor phylogeny inference
BACKGROUND: Accurate inference of the evolutionary history of a tumor has important implications for understanding and potentially treating the disease. While a number of methods have been proposed to reconstruct the evolutionary history of a tumor from DNA sequencing data, it is not clear how aspec...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6927103/ https://www.ncbi.nlm.nih.gov/pubmed/31865909 http://dx.doi.org/10.1186/s12920-019-0626-0 |
_version_ | 1783482239997181952 |
---|---|
author | Tomlinson, Kiran Oesper, Layla |
author_facet | Tomlinson, Kiran Oesper, Layla |
author_sort | Tomlinson, Kiran |
collection | PubMed |
description | BACKGROUND: Accurate inference of the evolutionary history of a tumor has important implications for understanding and potentially treating the disease. While a number of methods have been proposed to reconstruct the evolutionary history of a tumor from DNA sequencing data, it is not clear how aspects of the sequencing data and tumor itself affect these reconstructions. METHODS: We investigate when and how well these histories can be reconstructed from multi-sample bulk sequencing data when considering only single nucleotide variants (SNVs). Specifically, we examine the space of all possible tumor phylogenies under the infinite sites assumption (ISA) using several approaches for enumerating phylogenies consistent with the sequencing data. RESULTS: On noisy simulated data, we find that the ISA is often violated and that low coverage and high noise make it more difficult to identify phylogenies. Additionally, we find that evolutionary trees with branching topologies are easier to reconstruct accurately. We also apply our reconstruction methods to both chronic lymphocytic leukemia and clear cell renal cell carcinoma datasets and confirm that ISA violations are common in practice, especially in lower-coverage sequencing data. Nonetheless, we show that an ISA-based approach can be relaxed to produce high-quality phylogenies. CONCLUSIONS: Consideration of practical aspects of sequencing data such as coverage or the model of tumor evolution (branching, linear, etc.) is essential to effectively using the output of tumor phylogeny inference methods. Additionally, these factors should be considered in the development of new inference methods. |
format | Online Article Text |
id | pubmed-6927103 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-69271032019-12-30 Parameter, noise, and tree topology effects in tumor phylogeny inference Tomlinson, Kiran Oesper, Layla BMC Med Genomics Research BACKGROUND: Accurate inference of the evolutionary history of a tumor has important implications for understanding and potentially treating the disease. While a number of methods have been proposed to reconstruct the evolutionary history of a tumor from DNA sequencing data, it is not clear how aspects of the sequencing data and tumor itself affect these reconstructions. METHODS: We investigate when and how well these histories can be reconstructed from multi-sample bulk sequencing data when considering only single nucleotide variants (SNVs). Specifically, we examine the space of all possible tumor phylogenies under the infinite sites assumption (ISA) using several approaches for enumerating phylogenies consistent with the sequencing data. RESULTS: On noisy simulated data, we find that the ISA is often violated and that low coverage and high noise make it more difficult to identify phylogenies. Additionally, we find that evolutionary trees with branching topologies are easier to reconstruct accurately. We also apply our reconstruction methods to both chronic lymphocytic leukemia and clear cell renal cell carcinoma datasets and confirm that ISA violations are common in practice, especially in lower-coverage sequencing data. Nonetheless, we show that an ISA-based approach can be relaxed to produce high-quality phylogenies. CONCLUSIONS: Consideration of practical aspects of sequencing data such as coverage or the model of tumor evolution (branching, linear, etc.) is essential to effectively using the output of tumor phylogeny inference methods. Additionally, these factors should be considered in the development of new inference methods. BioMed Central 2019-12-23 /pmc/articles/PMC6927103/ /pubmed/31865909 http://dx.doi.org/10.1186/s12920-019-0626-0 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Tomlinson, Kiran Oesper, Layla Parameter, noise, and tree topology effects in tumor phylogeny inference |
title | Parameter, noise, and tree topology effects in tumor phylogeny inference |
title_full | Parameter, noise, and tree topology effects in tumor phylogeny inference |
title_fullStr | Parameter, noise, and tree topology effects in tumor phylogeny inference |
title_full_unstemmed | Parameter, noise, and tree topology effects in tumor phylogeny inference |
title_short | Parameter, noise, and tree topology effects in tumor phylogeny inference |
title_sort | parameter, noise, and tree topology effects in tumor phylogeny inference |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6927103/ https://www.ncbi.nlm.nih.gov/pubmed/31865909 http://dx.doi.org/10.1186/s12920-019-0626-0 |
work_keys_str_mv | AT tomlinsonkiran parameternoiseandtreetopologyeffectsintumorphylogenyinference AT oesperlayla parameternoiseandtreetopologyeffectsintumorphylogenyinference |