Cargando…

ConDoR: Tumor phylogeny inference with a copy-number constrained mutation loss model

Tumors consist of subpopulations of cells that harbor distinct collections of somatic mutations. These mutations range in scale from single nucleotide variants (SNVs) to large-scale copy-number aberrations (CNAs). While many approaches infer tumor phylogenies using SNVs as phylogenetic markers, CNAs...

Descripción completa

Detalles Bibliográficos
Autores principales: Sashittal, Palash, Zhang, Haochen, Iacobuzio-Donahue, Christine A., Raphael, Benjamin J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9882003/
https://www.ncbi.nlm.nih.gov/pubmed/36711528
http://dx.doi.org/10.1101/2023.01.05.522408
_version_ 1784879223010230272
author Sashittal, Palash
Zhang, Haochen
Iacobuzio-Donahue, Christine A.
Raphael, Benjamin J.
author_facet Sashittal, Palash
Zhang, Haochen
Iacobuzio-Donahue, Christine A.
Raphael, Benjamin J.
author_sort Sashittal, Palash
collection PubMed
description Tumors consist of subpopulations of cells that harbor distinct collections of somatic mutations. These mutations range in scale from single nucleotide variants (SNVs) to large-scale copy-number aberrations (CNAs). While many approaches infer tumor phylogenies using SNVs as phylogenetic markers, CNAs that overlap SNVs may lead to erroneous phylogenetic inference. Specifically, an SNV may be lost in a cell due to a deletion of the genomic segment containing the SNV. Unfortunately, no current single-cell DNA sequencing (scDNA-seq) technology produces accurate measurements of both SNVs and CNAs. For instance, recent targeted scDNA-seq technologies, such as Mission Bio Tapestri, measure SNVs with high fidelity in individual cells, but yield much less reliable measurements of CNAs. We introduce a new evolutionary model, the constrained k-Dollo model, that uses SNVs as phylogenetic markers and partial information about CNAs in the form of clustering of cells with similar copy-number profiles. This copy-number clustering constrains where loss of SNVs can occur in the phylogeny. We develop ConDoR (Constrained Dollo Reconstruction), an algorithm to infer tumor phylogenies from targeted scDNA-seq data using the constrained k-Dollo model. We show that ConDoR outperforms existing methods on simulated data. We use ConDoR to analyze a new multi-region targeted scDNA-seq dataset of 2153 cells from a pancreatic ductal adenocarcinoma (PDAC) tumor and produce a more plausible phylogeny compared to existing methods that conforms to histological results for the tumor from a previous study. We also analyze a metastatic colorectal cancer dataset, deriving a more parsimonious phylogeny than previously published analyses and with a simpler monoclonal origin of metastasis compared to the original study. CODE AVAILABILITY: Software is available at https://github.com/raphael-group/constrained-Dollo
format Online
Article
Text
id pubmed-9882003
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-98820032023-01-28 ConDoR: Tumor phylogeny inference with a copy-number constrained mutation loss model Sashittal, Palash Zhang, Haochen Iacobuzio-Donahue, Christine A. Raphael, Benjamin J. bioRxiv Article Tumors consist of subpopulations of cells that harbor distinct collections of somatic mutations. These mutations range in scale from single nucleotide variants (SNVs) to large-scale copy-number aberrations (CNAs). While many approaches infer tumor phylogenies using SNVs as phylogenetic markers, CNAs that overlap SNVs may lead to erroneous phylogenetic inference. Specifically, an SNV may be lost in a cell due to a deletion of the genomic segment containing the SNV. Unfortunately, no current single-cell DNA sequencing (scDNA-seq) technology produces accurate measurements of both SNVs and CNAs. For instance, recent targeted scDNA-seq technologies, such as Mission Bio Tapestri, measure SNVs with high fidelity in individual cells, but yield much less reliable measurements of CNAs. We introduce a new evolutionary model, the constrained k-Dollo model, that uses SNVs as phylogenetic markers and partial information about CNAs in the form of clustering of cells with similar copy-number profiles. This copy-number clustering constrains where loss of SNVs can occur in the phylogeny. We develop ConDoR (Constrained Dollo Reconstruction), an algorithm to infer tumor phylogenies from targeted scDNA-seq data using the constrained k-Dollo model. We show that ConDoR outperforms existing methods on simulated data. We use ConDoR to analyze a new multi-region targeted scDNA-seq dataset of 2153 cells from a pancreatic ductal adenocarcinoma (PDAC) tumor and produce a more plausible phylogeny compared to existing methods that conforms to histological results for the tumor from a previous study. We also analyze a metastatic colorectal cancer dataset, deriving a more parsimonious phylogeny than previously published analyses and with a simpler monoclonal origin of metastasis compared to the original study. CODE AVAILABILITY: Software is available at https://github.com/raphael-group/constrained-Dollo Cold Spring Harbor Laboratory 2023-01-06 /pmc/articles/PMC9882003/ /pubmed/36711528 http://dx.doi.org/10.1101/2023.01.05.522408 Text en https://creativecommons.org/licenses/by-nc/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (https://creativecommons.org/licenses/by-nc/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Sashittal, Palash
Zhang, Haochen
Iacobuzio-Donahue, Christine A.
Raphael, Benjamin J.
ConDoR: Tumor phylogeny inference with a copy-number constrained mutation loss model
title ConDoR: Tumor phylogeny inference with a copy-number constrained mutation loss model
title_full ConDoR: Tumor phylogeny inference with a copy-number constrained mutation loss model
title_fullStr ConDoR: Tumor phylogeny inference with a copy-number constrained mutation loss model
title_full_unstemmed ConDoR: Tumor phylogeny inference with a copy-number constrained mutation loss model
title_short ConDoR: Tumor phylogeny inference with a copy-number constrained mutation loss model
title_sort condor: tumor phylogeny inference with a copy-number constrained mutation loss model
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9882003/
https://www.ncbi.nlm.nih.gov/pubmed/36711528
http://dx.doi.org/10.1101/2023.01.05.522408
work_keys_str_mv AT sashittalpalash condortumorphylogenyinferencewithacopynumberconstrainedmutationlossmodel
AT zhanghaochen condortumorphylogenyinferencewithacopynumberconstrainedmutationlossmodel
AT iacobuziodonahuechristinea condortumorphylogenyinferencewithacopynumberconstrainedmutationlossmodel
AT raphaelbenjaminj condortumorphylogenyinferencewithacopynumberconstrainedmutationlossmodel