Cargando…
Reconstruction of clonal trees and tumor composition from multi-sample sequencing data
Motivation: DNA sequencing of multiple samples from the same tumor provides data to analyze the process of clonal evolution in the population of cells that give rise to a tumor. Results: We formalize the problem of reconstructing the clonal evolution of a tumor using single-nucleotide mutations as t...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4542783/ https://www.ncbi.nlm.nih.gov/pubmed/26072510 http://dx.doi.org/10.1093/bioinformatics/btv261 |
_version_ | 1782386562275737600 |
---|---|
author | El-Kebir, Mohammed Oesper, Layla Acheson-Field, Hannah Raphael, Benjamin J. |
author_facet | El-Kebir, Mohammed Oesper, Layla Acheson-Field, Hannah Raphael, Benjamin J. |
author_sort | El-Kebir, Mohammed |
collection | PubMed |
description | Motivation: DNA sequencing of multiple samples from the same tumor provides data to analyze the process of clonal evolution in the population of cells that give rise to a tumor. Results: We formalize the problem of reconstructing the clonal evolution of a tumor using single-nucleotide mutations as the variant allele frequency (VAF) factorization problem. We derive a combinatorial characterization of the solutions to this problem and show that the problem is NP-complete. We derive an integer linear programming solution to the VAF factorization problem in the case of error-free data and extend this solution to real data with a probabilistic model for errors. The resulting AncesTree algorithm is better able to identify ancestral relationships between individual mutations than existing approaches, particularly in ultra-deep sequencing data when high read counts for mutations yield high confidence VAFs. Availability and implementation: An implementation of AncesTree is available at: http://compbio.cs.brown.edu/software. Contact: braphael@brown.edu Supplementary information: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-4542783 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-45427832015-08-25 Reconstruction of clonal trees and tumor composition from multi-sample sequencing data El-Kebir, Mohammed Oesper, Layla Acheson-Field, Hannah Raphael, Benjamin J. Bioinformatics Ismb/Eccb 2015 Proceedings Papers Committee July 10 to July 14, 2015, Dublin, Ireland Motivation: DNA sequencing of multiple samples from the same tumor provides data to analyze the process of clonal evolution in the population of cells that give rise to a tumor. Results: We formalize the problem of reconstructing the clonal evolution of a tumor using single-nucleotide mutations as the variant allele frequency (VAF) factorization problem. We derive a combinatorial characterization of the solutions to this problem and show that the problem is NP-complete. We derive an integer linear programming solution to the VAF factorization problem in the case of error-free data and extend this solution to real data with a probabilistic model for errors. The resulting AncesTree algorithm is better able to identify ancestral relationships between individual mutations than existing approaches, particularly in ultra-deep sequencing data when high read counts for mutations yield high confidence VAFs. Availability and implementation: An implementation of AncesTree is available at: http://compbio.cs.brown.edu/software. Contact: braphael@brown.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2015-06-15 2015-06-10 /pmc/articles/PMC4542783/ /pubmed/26072510 http://dx.doi.org/10.1093/bioinformatics/btv261 Text en © The Author 2015. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License(http://creativecommons.org/licenses/by-nc/3.0/),which permits non-commercial reuse, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Ismb/Eccb 2015 Proceedings Papers Committee July 10 to July 14, 2015, Dublin, Ireland El-Kebir, Mohammed Oesper, Layla Acheson-Field, Hannah Raphael, Benjamin J. Reconstruction of clonal trees and tumor composition from multi-sample sequencing data |
title | Reconstruction of clonal trees and tumor composition from multi-sample sequencing data |
title_full | Reconstruction of clonal trees and tumor composition from multi-sample sequencing data |
title_fullStr | Reconstruction of clonal trees and tumor composition from multi-sample sequencing data |
title_full_unstemmed | Reconstruction of clonal trees and tumor composition from multi-sample sequencing data |
title_short | Reconstruction of clonal trees and tumor composition from multi-sample sequencing data |
title_sort | reconstruction of clonal trees and tumor composition from multi-sample sequencing data |
topic | Ismb/Eccb 2015 Proceedings Papers Committee July 10 to July 14, 2015, Dublin, Ireland |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4542783/ https://www.ncbi.nlm.nih.gov/pubmed/26072510 http://dx.doi.org/10.1093/bioinformatics/btv261 |
work_keys_str_mv | AT elkebirmohammed reconstructionofclonaltreesandtumorcompositionfrommultisamplesequencingdata AT oesperlayla reconstructionofclonaltreesandtumorcompositionfrommultisamplesequencingdata AT achesonfieldhannah reconstructionofclonaltreesandtumorcompositionfrommultisamplesequencingdata AT raphaelbenjaminj reconstructionofclonaltreesandtumorcompositionfrommultisamplesequencingdata |