Cargando…

Reconstruction of clonal trees and tumor composition from multi-sample sequencing data

Motivation: DNA sequencing of multiple samples from the same tumor provides data to analyze the process of clonal evolution in the population of cells that give rise to a tumor. Results: We formalize the problem of reconstructing the clonal evolution of a tumor using single-nucleotide mutations as t...

Descripción completa

Detalles Bibliográficos
Autores principales: El-Kebir, Mohammed, Oesper, Layla, Acheson-Field, Hannah, Raphael, Benjamin J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4542783/
https://www.ncbi.nlm.nih.gov/pubmed/26072510
http://dx.doi.org/10.1093/bioinformatics/btv261
_version_ 1782386562275737600
author El-Kebir, Mohammed
Oesper, Layla
Acheson-Field, Hannah
Raphael, Benjamin J.
author_facet El-Kebir, Mohammed
Oesper, Layla
Acheson-Field, Hannah
Raphael, Benjamin J.
author_sort El-Kebir, Mohammed
collection PubMed
description Motivation: DNA sequencing of multiple samples from the same tumor provides data to analyze the process of clonal evolution in the population of cells that give rise to a tumor. Results: We formalize the problem of reconstructing the clonal evolution of a tumor using single-nucleotide mutations as the variant allele frequency (VAF) factorization problem. We derive a combinatorial characterization of the solutions to this problem and show that the problem is NP-complete. We derive an integer linear programming solution to the VAF factorization problem in the case of error-free data and extend this solution to real data with a probabilistic model for errors. The resulting AncesTree algorithm is better able to identify ancestral relationships between individual mutations than existing approaches, particularly in ultra-deep sequencing data when high read counts for mutations yield high confidence VAFs. Availability and implementation: An implementation of AncesTree is available at: http://compbio.cs.brown.edu/software. Contact: braphael@brown.edu Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-4542783
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-45427832015-08-25 Reconstruction of clonal trees and tumor composition from multi-sample sequencing data El-Kebir, Mohammed Oesper, Layla Acheson-Field, Hannah Raphael, Benjamin J. Bioinformatics Ismb/Eccb 2015 Proceedings Papers Committee July 10 to July 14, 2015, Dublin, Ireland Motivation: DNA sequencing of multiple samples from the same tumor provides data to analyze the process of clonal evolution in the population of cells that give rise to a tumor. Results: We formalize the problem of reconstructing the clonal evolution of a tumor using single-nucleotide mutations as the variant allele frequency (VAF) factorization problem. We derive a combinatorial characterization of the solutions to this problem and show that the problem is NP-complete. We derive an integer linear programming solution to the VAF factorization problem in the case of error-free data and extend this solution to real data with a probabilistic model for errors. The resulting AncesTree algorithm is better able to identify ancestral relationships between individual mutations than existing approaches, particularly in ultra-deep sequencing data when high read counts for mutations yield high confidence VAFs. Availability and implementation: An implementation of AncesTree is available at: http://compbio.cs.brown.edu/software. Contact: braphael@brown.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2015-06-15 2015-06-10 /pmc/articles/PMC4542783/ /pubmed/26072510 http://dx.doi.org/10.1093/bioinformatics/btv261 Text en © The Author 2015. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License(http://creativecommons.org/licenses/by-nc/3.0/),which permits non-commercial reuse, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Ismb/Eccb 2015 Proceedings Papers Committee July 10 to July 14, 2015, Dublin, Ireland
El-Kebir, Mohammed
Oesper, Layla
Acheson-Field, Hannah
Raphael, Benjamin J.
Reconstruction of clonal trees and tumor composition from multi-sample sequencing data
title Reconstruction of clonal trees and tumor composition from multi-sample sequencing data
title_full Reconstruction of clonal trees and tumor composition from multi-sample sequencing data
title_fullStr Reconstruction of clonal trees and tumor composition from multi-sample sequencing data
title_full_unstemmed Reconstruction of clonal trees and tumor composition from multi-sample sequencing data
title_short Reconstruction of clonal trees and tumor composition from multi-sample sequencing data
title_sort reconstruction of clonal trees and tumor composition from multi-sample sequencing data
topic Ismb/Eccb 2015 Proceedings Papers Committee July 10 to July 14, 2015, Dublin, Ireland
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4542783/
https://www.ncbi.nlm.nih.gov/pubmed/26072510
http://dx.doi.org/10.1093/bioinformatics/btv261
work_keys_str_mv AT elkebirmohammed reconstructionofclonaltreesandtumorcompositionfrommultisamplesequencingdata
AT oesperlayla reconstructionofclonaltreesandtumorcompositionfrommultisamplesequencingdata
AT achesonfieldhannah reconstructionofclonaltreesandtumorcompositionfrommultisamplesequencingdata
AT raphaelbenjaminj reconstructionofclonaltreesandtumorcompositionfrommultisamplesequencingdata