Cargando…

Parameter, noise, and tree topology effects in tumor phylogeny inference

BACKGROUND: Accurate inference of the evolutionary history of a tumor has important implications for understanding and potentially treating the disease. While a number of methods have been proposed to reconstruct the evolutionary history of a tumor from DNA sequencing data, it is not clear how aspec...

Descripción completa

Detalles Bibliográficos
Autores principales: Tomlinson, Kiran, Oesper, Layla
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6927103/
https://www.ncbi.nlm.nih.gov/pubmed/31865909
http://dx.doi.org/10.1186/s12920-019-0626-0
_version_ 1783482239997181952
author Tomlinson, Kiran
Oesper, Layla
author_facet Tomlinson, Kiran
Oesper, Layla
author_sort Tomlinson, Kiran
collection PubMed
description BACKGROUND: Accurate inference of the evolutionary history of a tumor has important implications for understanding and potentially treating the disease. While a number of methods have been proposed to reconstruct the evolutionary history of a tumor from DNA sequencing data, it is not clear how aspects of the sequencing data and tumor itself affect these reconstructions. METHODS: We investigate when and how well these histories can be reconstructed from multi-sample bulk sequencing data when considering only single nucleotide variants (SNVs). Specifically, we examine the space of all possible tumor phylogenies under the infinite sites assumption (ISA) using several approaches for enumerating phylogenies consistent with the sequencing data. RESULTS: On noisy simulated data, we find that the ISA is often violated and that low coverage and high noise make it more difficult to identify phylogenies. Additionally, we find that evolutionary trees with branching topologies are easier to reconstruct accurately. We also apply our reconstruction methods to both chronic lymphocytic leukemia and clear cell renal cell carcinoma datasets and confirm that ISA violations are common in practice, especially in lower-coverage sequencing data. Nonetheless, we show that an ISA-based approach can be relaxed to produce high-quality phylogenies. CONCLUSIONS: Consideration of practical aspects of sequencing data such as coverage or the model of tumor evolution (branching, linear, etc.) is essential to effectively using the output of tumor phylogeny inference methods. Additionally, these factors should be considered in the development of new inference methods.
format Online
Article
Text
id pubmed-6927103
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-69271032019-12-30 Parameter, noise, and tree topology effects in tumor phylogeny inference Tomlinson, Kiran Oesper, Layla BMC Med Genomics Research BACKGROUND: Accurate inference of the evolutionary history of a tumor has important implications for understanding and potentially treating the disease. While a number of methods have been proposed to reconstruct the evolutionary history of a tumor from DNA sequencing data, it is not clear how aspects of the sequencing data and tumor itself affect these reconstructions. METHODS: We investigate when and how well these histories can be reconstructed from multi-sample bulk sequencing data when considering only single nucleotide variants (SNVs). Specifically, we examine the space of all possible tumor phylogenies under the infinite sites assumption (ISA) using several approaches for enumerating phylogenies consistent with the sequencing data. RESULTS: On noisy simulated data, we find that the ISA is often violated and that low coverage and high noise make it more difficult to identify phylogenies. Additionally, we find that evolutionary trees with branching topologies are easier to reconstruct accurately. We also apply our reconstruction methods to both chronic lymphocytic leukemia and clear cell renal cell carcinoma datasets and confirm that ISA violations are common in practice, especially in lower-coverage sequencing data. Nonetheless, we show that an ISA-based approach can be relaxed to produce high-quality phylogenies. CONCLUSIONS: Consideration of practical aspects of sequencing data such as coverage or the model of tumor evolution (branching, linear, etc.) is essential to effectively using the output of tumor phylogeny inference methods. Additionally, these factors should be considered in the development of new inference methods. BioMed Central 2019-12-23 /pmc/articles/PMC6927103/ /pubmed/31865909 http://dx.doi.org/10.1186/s12920-019-0626-0 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Tomlinson, Kiran
Oesper, Layla
Parameter, noise, and tree topology effects in tumor phylogeny inference
title Parameter, noise, and tree topology effects in tumor phylogeny inference
title_full Parameter, noise, and tree topology effects in tumor phylogeny inference
title_fullStr Parameter, noise, and tree topology effects in tumor phylogeny inference
title_full_unstemmed Parameter, noise, and tree topology effects in tumor phylogeny inference
title_short Parameter, noise, and tree topology effects in tumor phylogeny inference
title_sort parameter, noise, and tree topology effects in tumor phylogeny inference
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6927103/
https://www.ncbi.nlm.nih.gov/pubmed/31865909
http://dx.doi.org/10.1186/s12920-019-0626-0
work_keys_str_mv AT tomlinsonkiran parameternoiseandtreetopologyeffectsintumorphylogenyinference
AT oesperlayla parameternoiseandtreetopologyeffectsintumorphylogenyinference