Cargando…

A statistical approach for tracking clonal dynamics in cancer using longitudinal next-generation sequencing data

MOTIVATION: Tumours are composed of distinct cancer cell populations (clones), which continuously adapt to their local micro-environment. Standard methods for clonal deconvolution seek to identify groups of mutations and estimate the prevalence of each group in the tumour, while considering its puri...

Descripción completa

Detalles Bibliográficos
Autores principales: Vavoulis, Dimitrios V, Cutts, Anthony, Taylor, Jenny C, Schuh, Anna
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8055230/
https://www.ncbi.nlm.nih.gov/pubmed/32722772
http://dx.doi.org/10.1093/bioinformatics/btaa672
_version_ 1783680413130031104
author Vavoulis, Dimitrios V
Cutts, Anthony
Taylor, Jenny C
Schuh, Anna
author_facet Vavoulis, Dimitrios V
Cutts, Anthony
Taylor, Jenny C
Schuh, Anna
author_sort Vavoulis, Dimitrios V
collection PubMed
description MOTIVATION: Tumours are composed of distinct cancer cell populations (clones), which continuously adapt to their local micro-environment. Standard methods for clonal deconvolution seek to identify groups of mutations and estimate the prevalence of each group in the tumour, while considering its purity and copy number profile. These methods have been applied on cross-sectional data and on longitudinal data after discarding information on the timing of sample collection. Two key questions are how can we incorporate such information in our analyses and is there any benefit in doing so? RESULTS: We developed a clonal deconvolution method, which incorporates explicitly the temporal spacing of longitudinally sampled tumours. By merging a Dirichlet Process Mixture Model with Gaussian Process priors and using as input a sequence of several sparsely collected samples, our method can reconstruct the temporal profile of the abundance of any mutation cluster supported by the data as a continuous function of time. We benchmarked our method on whole genome, whole exome and targeted sequencing data from patients with chronic lymphocytic leukaemia, on liquid biopsy data from a patient with melanoma and on synthetic data and we found that incorporating information on the timing of tissue collection improves model performance, as long as data of sufficient volume and complexity are available for estimating free model parameters. Thus, our approach is particularly useful when collecting a relatively long sequence of tumour samples is feasible, as in liquid cancers (e.g. leukaemia) and liquid biopsies. AVAILABILITY AND IMPLEMENTATION: The statistical methodology presented in this paper is freely available at github.com/dvav/clonosGP. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-8055230
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-80552302021-04-28 A statistical approach for tracking clonal dynamics in cancer using longitudinal next-generation sequencing data Vavoulis, Dimitrios V Cutts, Anthony Taylor, Jenny C Schuh, Anna Bioinformatics Original Papers MOTIVATION: Tumours are composed of distinct cancer cell populations (clones), which continuously adapt to their local micro-environment. Standard methods for clonal deconvolution seek to identify groups of mutations and estimate the prevalence of each group in the tumour, while considering its purity and copy number profile. These methods have been applied on cross-sectional data and on longitudinal data after discarding information on the timing of sample collection. Two key questions are how can we incorporate such information in our analyses and is there any benefit in doing so? RESULTS: We developed a clonal deconvolution method, which incorporates explicitly the temporal spacing of longitudinally sampled tumours. By merging a Dirichlet Process Mixture Model with Gaussian Process priors and using as input a sequence of several sparsely collected samples, our method can reconstruct the temporal profile of the abundance of any mutation cluster supported by the data as a continuous function of time. We benchmarked our method on whole genome, whole exome and targeted sequencing data from patients with chronic lymphocytic leukaemia, on liquid biopsy data from a patient with melanoma and on synthetic data and we found that incorporating information on the timing of tissue collection improves model performance, as long as data of sufficient volume and complexity are available for estimating free model parameters. Thus, our approach is particularly useful when collecting a relatively long sequence of tumour samples is feasible, as in liquid cancers (e.g. leukaemia) and liquid biopsies. AVAILABILITY AND IMPLEMENTATION: The statistical methodology presented in this paper is freely available at github.com/dvav/clonosGP. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2020-07-28 /pmc/articles/PMC8055230/ /pubmed/32722772 http://dx.doi.org/10.1093/bioinformatics/btaa672 Text en © The Author(s) 2020. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Vavoulis, Dimitrios V
Cutts, Anthony
Taylor, Jenny C
Schuh, Anna
A statistical approach for tracking clonal dynamics in cancer using longitudinal next-generation sequencing data
title A statistical approach for tracking clonal dynamics in cancer using longitudinal next-generation sequencing data
title_full A statistical approach for tracking clonal dynamics in cancer using longitudinal next-generation sequencing data
title_fullStr A statistical approach for tracking clonal dynamics in cancer using longitudinal next-generation sequencing data
title_full_unstemmed A statistical approach for tracking clonal dynamics in cancer using longitudinal next-generation sequencing data
title_short A statistical approach for tracking clonal dynamics in cancer using longitudinal next-generation sequencing data
title_sort statistical approach for tracking clonal dynamics in cancer using longitudinal next-generation sequencing data
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8055230/
https://www.ncbi.nlm.nih.gov/pubmed/32722772
http://dx.doi.org/10.1093/bioinformatics/btaa672
work_keys_str_mv AT vavoulisdimitriosv astatisticalapproachfortrackingclonaldynamicsincancerusinglongitudinalnextgenerationsequencingdata
AT cuttsanthony astatisticalapproachfortrackingclonaldynamicsincancerusinglongitudinalnextgenerationsequencingdata
AT taylorjennyc astatisticalapproachfortrackingclonaldynamicsincancerusinglongitudinalnextgenerationsequencingdata
AT schuhanna astatisticalapproachfortrackingclonaldynamicsincancerusinglongitudinalnextgenerationsequencingdata
AT vavoulisdimitriosv statisticalapproachfortrackingclonaldynamicsincancerusinglongitudinalnextgenerationsequencingdata
AT cuttsanthony statisticalapproachfortrackingclonaldynamicsincancerusinglongitudinalnextgenerationsequencingdata
AT taylorjennyc statisticalapproachfortrackingclonaldynamicsincancerusinglongitudinalnextgenerationsequencingdata
AT schuhanna statisticalapproachfortrackingclonaldynamicsincancerusinglongitudinalnextgenerationsequencingdata