Cargando…

Benchmarking pipelines for subclonal deconvolution of bulk tumour sequencing data

Intratumour heterogeneity provides tumours with the ability to adapt and acquire treatment resistance. The development of more effective and personalised treatments for cancers, therefore, requires accurate characterisation of the clonal architecture of tumours, enabling evolutionary dynamics to be...

Descripción completa

Detalles Bibliográficos
Autores principales: Tanner, Georgette, Westhead, David R., Droop, Alastair, Stead, Lucy F.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8569188/
https://www.ncbi.nlm.nih.gov/pubmed/34737285
http://dx.doi.org/10.1038/s41467-021-26698-7
_version_ 1784594598891356160
author Tanner, Georgette
Westhead, David R.
Droop, Alastair
Stead, Lucy F.
author_facet Tanner, Georgette
Westhead, David R.
Droop, Alastair
Stead, Lucy F.
author_sort Tanner, Georgette
collection PubMed
description Intratumour heterogeneity provides tumours with the ability to adapt and acquire treatment resistance. The development of more effective and personalised treatments for cancers, therefore, requires accurate characterisation of the clonal architecture of tumours, enabling evolutionary dynamics to be tracked. Many methods exist for achieving this from bulk tumour sequencing data, involving identifying mutations and performing subclonal deconvolution, but there is a lack of systematic benchmarking to inform researchers on which are most accurate, and how dataset characteristics impact performance. To address this, we use the most comprehensive tumour genome simulation tool available for such purposes to create 80 bulk tumour whole exome sequencing datasets of differing depths, tumour complexities, and purities, and use these to benchmark subclonal deconvolution pipelines. We conclude that i) tumour complexity does not impact accuracy, ii) increasing either purity or purity-corrected sequencing depth improves accuracy, and iii) the optimal pipeline consists of Mutect2, FACETS and PyClone-VI. We have made our benchmarking datasets publicly available for future use.
format Online
Article
Text
id pubmed-8569188
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-85691882021-11-15 Benchmarking pipelines for subclonal deconvolution of bulk tumour sequencing data Tanner, Georgette Westhead, David R. Droop, Alastair Stead, Lucy F. Nat Commun Article Intratumour heterogeneity provides tumours with the ability to adapt and acquire treatment resistance. The development of more effective and personalised treatments for cancers, therefore, requires accurate characterisation of the clonal architecture of tumours, enabling evolutionary dynamics to be tracked. Many methods exist for achieving this from bulk tumour sequencing data, involving identifying mutations and performing subclonal deconvolution, but there is a lack of systematic benchmarking to inform researchers on which are most accurate, and how dataset characteristics impact performance. To address this, we use the most comprehensive tumour genome simulation tool available for such purposes to create 80 bulk tumour whole exome sequencing datasets of differing depths, tumour complexities, and purities, and use these to benchmark subclonal deconvolution pipelines. We conclude that i) tumour complexity does not impact accuracy, ii) increasing either purity or purity-corrected sequencing depth improves accuracy, and iii) the optimal pipeline consists of Mutect2, FACETS and PyClone-VI. We have made our benchmarking datasets publicly available for future use. Nature Publishing Group UK 2021-11-04 /pmc/articles/PMC8569188/ /pubmed/34737285 http://dx.doi.org/10.1038/s41467-021-26698-7 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Tanner, Georgette
Westhead, David R.
Droop, Alastair
Stead, Lucy F.
Benchmarking pipelines for subclonal deconvolution of bulk tumour sequencing data
title Benchmarking pipelines for subclonal deconvolution of bulk tumour sequencing data
title_full Benchmarking pipelines for subclonal deconvolution of bulk tumour sequencing data
title_fullStr Benchmarking pipelines for subclonal deconvolution of bulk tumour sequencing data
title_full_unstemmed Benchmarking pipelines for subclonal deconvolution of bulk tumour sequencing data
title_short Benchmarking pipelines for subclonal deconvolution of bulk tumour sequencing data
title_sort benchmarking pipelines for subclonal deconvolution of bulk tumour sequencing data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8569188/
https://www.ncbi.nlm.nih.gov/pubmed/34737285
http://dx.doi.org/10.1038/s41467-021-26698-7
work_keys_str_mv AT tannergeorgette benchmarkingpipelinesforsubclonaldeconvolutionofbulktumoursequencingdata
AT westheaddavidr benchmarkingpipelinesforsubclonaldeconvolutionofbulktumoursequencingdata
AT droopalastair benchmarkingpipelinesforsubclonaldeconvolutionofbulktumoursequencingdata
AT steadlucyf benchmarkingpipelinesforsubclonaldeconvolutionofbulktumoursequencingdata