Cargando…
A statistical approach for tracking clonal dynamics in cancer using longitudinal next-generation sequencing data
MOTIVATION: Tumours are composed of distinct cancer cell populations (clones), which continuously adapt to their local micro-environment. Standard methods for clonal deconvolution seek to identify groups of mutations and estimate the prevalence of each group in the tumour, while considering its puri...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8055230/ https://www.ncbi.nlm.nih.gov/pubmed/32722772 http://dx.doi.org/10.1093/bioinformatics/btaa672 |
_version_ | 1783680413130031104 |
---|---|
author | Vavoulis, Dimitrios V Cutts, Anthony Taylor, Jenny C Schuh, Anna |
author_facet | Vavoulis, Dimitrios V Cutts, Anthony Taylor, Jenny C Schuh, Anna |
author_sort | Vavoulis, Dimitrios V |
collection | PubMed |
description | MOTIVATION: Tumours are composed of distinct cancer cell populations (clones), which continuously adapt to their local micro-environment. Standard methods for clonal deconvolution seek to identify groups of mutations and estimate the prevalence of each group in the tumour, while considering its purity and copy number profile. These methods have been applied on cross-sectional data and on longitudinal data after discarding information on the timing of sample collection. Two key questions are how can we incorporate such information in our analyses and is there any benefit in doing so? RESULTS: We developed a clonal deconvolution method, which incorporates explicitly the temporal spacing of longitudinally sampled tumours. By merging a Dirichlet Process Mixture Model with Gaussian Process priors and using as input a sequence of several sparsely collected samples, our method can reconstruct the temporal profile of the abundance of any mutation cluster supported by the data as a continuous function of time. We benchmarked our method on whole genome, whole exome and targeted sequencing data from patients with chronic lymphocytic leukaemia, on liquid biopsy data from a patient with melanoma and on synthetic data and we found that incorporating information on the timing of tissue collection improves model performance, as long as data of sufficient volume and complexity are available for estimating free model parameters. Thus, our approach is particularly useful when collecting a relatively long sequence of tumour samples is feasible, as in liquid cancers (e.g. leukaemia) and liquid biopsies. AVAILABILITY AND IMPLEMENTATION: The statistical methodology presented in this paper is freely available at github.com/dvav/clonosGP. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-8055230 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-80552302021-04-28 A statistical approach for tracking clonal dynamics in cancer using longitudinal next-generation sequencing data Vavoulis, Dimitrios V Cutts, Anthony Taylor, Jenny C Schuh, Anna Bioinformatics Original Papers MOTIVATION: Tumours are composed of distinct cancer cell populations (clones), which continuously adapt to their local micro-environment. Standard methods for clonal deconvolution seek to identify groups of mutations and estimate the prevalence of each group in the tumour, while considering its purity and copy number profile. These methods have been applied on cross-sectional data and on longitudinal data after discarding information on the timing of sample collection. Two key questions are how can we incorporate such information in our analyses and is there any benefit in doing so? RESULTS: We developed a clonal deconvolution method, which incorporates explicitly the temporal spacing of longitudinally sampled tumours. By merging a Dirichlet Process Mixture Model with Gaussian Process priors and using as input a sequence of several sparsely collected samples, our method can reconstruct the temporal profile of the abundance of any mutation cluster supported by the data as a continuous function of time. We benchmarked our method on whole genome, whole exome and targeted sequencing data from patients with chronic lymphocytic leukaemia, on liquid biopsy data from a patient with melanoma and on synthetic data and we found that incorporating information on the timing of tissue collection improves model performance, as long as data of sufficient volume and complexity are available for estimating free model parameters. Thus, our approach is particularly useful when collecting a relatively long sequence of tumour samples is feasible, as in liquid cancers (e.g. leukaemia) and liquid biopsies. AVAILABILITY AND IMPLEMENTATION: The statistical methodology presented in this paper is freely available at github.com/dvav/clonosGP. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2020-07-28 /pmc/articles/PMC8055230/ /pubmed/32722772 http://dx.doi.org/10.1093/bioinformatics/btaa672 Text en © The Author(s) 2020. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Papers Vavoulis, Dimitrios V Cutts, Anthony Taylor, Jenny C Schuh, Anna A statistical approach for tracking clonal dynamics in cancer using longitudinal next-generation sequencing data |
title | A statistical approach for tracking clonal dynamics in cancer using longitudinal next-generation sequencing data |
title_full | A statistical approach for tracking clonal dynamics in cancer using longitudinal next-generation sequencing data |
title_fullStr | A statistical approach for tracking clonal dynamics in cancer using longitudinal next-generation sequencing data |
title_full_unstemmed | A statistical approach for tracking clonal dynamics in cancer using longitudinal next-generation sequencing data |
title_short | A statistical approach for tracking clonal dynamics in cancer using longitudinal next-generation sequencing data |
title_sort | statistical approach for tracking clonal dynamics in cancer using longitudinal next-generation sequencing data |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8055230/ https://www.ncbi.nlm.nih.gov/pubmed/32722772 http://dx.doi.org/10.1093/bioinformatics/btaa672 |
work_keys_str_mv | AT vavoulisdimitriosv astatisticalapproachfortrackingclonaldynamicsincancerusinglongitudinalnextgenerationsequencingdata AT cuttsanthony astatisticalapproachfortrackingclonaldynamicsincancerusinglongitudinalnextgenerationsequencingdata AT taylorjennyc astatisticalapproachfortrackingclonaldynamicsincancerusinglongitudinalnextgenerationsequencingdata AT schuhanna astatisticalapproachfortrackingclonaldynamicsincancerusinglongitudinalnextgenerationsequencingdata AT vavoulisdimitriosv statisticalapproachfortrackingclonaldynamicsincancerusinglongitudinalnextgenerationsequencingdata AT cuttsanthony statisticalapproachfortrackingclonaldynamicsincancerusinglongitudinalnextgenerationsequencingdata AT taylorjennyc statisticalapproachfortrackingclonaldynamicsincancerusinglongitudinalnextgenerationsequencingdata AT schuhanna statisticalapproachfortrackingclonaldynamicsincancerusinglongitudinalnextgenerationsequencingdata |