Cargando…

Power and pitfalls of computational methods for inferring clone phylogenies and mutation orders from bulk sequencing data

Tumors harbor extensive genetic heterogeneity in the form of distinct clone genotypes that arise over time and across different tissues and regions in cancer. Many computational methods produce clone phylogenies from population bulk sequencing data collected from multiple tumor samples from a patien...

Descripción completa

Detalles Bibliográficos
Autores principales: Miura, Sayaka, Vu, Tracy, Deng, Jiamin, Buturla, Tiffany, Oladeinde, Olumide, Choi, Jiyeong, Kumar, Sudhir
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7044161/
https://www.ncbi.nlm.nih.gov/pubmed/32103044
http://dx.doi.org/10.1038/s41598-020-59006-2
_version_ 1783501508106518528
author Miura, Sayaka
Vu, Tracy
Deng, Jiamin
Buturla, Tiffany
Oladeinde, Olumide
Choi, Jiyeong
Kumar, Sudhir
author_facet Miura, Sayaka
Vu, Tracy
Deng, Jiamin
Buturla, Tiffany
Oladeinde, Olumide
Choi, Jiyeong
Kumar, Sudhir
author_sort Miura, Sayaka
collection PubMed
description Tumors harbor extensive genetic heterogeneity in the form of distinct clone genotypes that arise over time and across different tissues and regions in cancer. Many computational methods produce clone phylogenies from population bulk sequencing data collected from multiple tumor samples from a patient. These clone phylogenies are used to infer mutation order and clone origins during tumor progression, rendering the selection of the appropriate clonal deconvolution method critical. Surprisingly, absolute and relative accuracies of these methods in correctly inferring clone phylogenies are yet to consistently assessed. Therefore, we evaluated the performance of seven computational methods. The accuracy of the reconstructed mutation order and inferred clone groupings varied extensively among methods. All the tested methods showed limited ability to identify ancestral clone sequences present in tumor samples correctly. The presence of copy number alterations, the occurrence of multiple seeding events among tumor sites during metastatic tumor evolution, and extensive intermixture of cancer cells among tumors hindered the detection of clones and the inference of clone phylogenies for all methods tested. Overall, CloneFinder, MACHINA, and LICHeE showed the highest overall accuracy, but none of the methods performed well for all simulated datasets. So, we present guidelines for selecting methods for data analysis.
format Online
Article
Text
id pubmed-7044161
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-70441612020-03-03 Power and pitfalls of computational methods for inferring clone phylogenies and mutation orders from bulk sequencing data Miura, Sayaka Vu, Tracy Deng, Jiamin Buturla, Tiffany Oladeinde, Olumide Choi, Jiyeong Kumar, Sudhir Sci Rep Article Tumors harbor extensive genetic heterogeneity in the form of distinct clone genotypes that arise over time and across different tissues and regions in cancer. Many computational methods produce clone phylogenies from population bulk sequencing data collected from multiple tumor samples from a patient. These clone phylogenies are used to infer mutation order and clone origins during tumor progression, rendering the selection of the appropriate clonal deconvolution method critical. Surprisingly, absolute and relative accuracies of these methods in correctly inferring clone phylogenies are yet to consistently assessed. Therefore, we evaluated the performance of seven computational methods. The accuracy of the reconstructed mutation order and inferred clone groupings varied extensively among methods. All the tested methods showed limited ability to identify ancestral clone sequences present in tumor samples correctly. The presence of copy number alterations, the occurrence of multiple seeding events among tumor sites during metastatic tumor evolution, and extensive intermixture of cancer cells among tumors hindered the detection of clones and the inference of clone phylogenies for all methods tested. Overall, CloneFinder, MACHINA, and LICHeE showed the highest overall accuracy, but none of the methods performed well for all simulated datasets. So, we present guidelines for selecting methods for data analysis. Nature Publishing Group UK 2020-02-26 /pmc/articles/PMC7044161/ /pubmed/32103044 http://dx.doi.org/10.1038/s41598-020-59006-2 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Miura, Sayaka
Vu, Tracy
Deng, Jiamin
Buturla, Tiffany
Oladeinde, Olumide
Choi, Jiyeong
Kumar, Sudhir
Power and pitfalls of computational methods for inferring clone phylogenies and mutation orders from bulk sequencing data
title Power and pitfalls of computational methods for inferring clone phylogenies and mutation orders from bulk sequencing data
title_full Power and pitfalls of computational methods for inferring clone phylogenies and mutation orders from bulk sequencing data
title_fullStr Power and pitfalls of computational methods for inferring clone phylogenies and mutation orders from bulk sequencing data
title_full_unstemmed Power and pitfalls of computational methods for inferring clone phylogenies and mutation orders from bulk sequencing data
title_short Power and pitfalls of computational methods for inferring clone phylogenies and mutation orders from bulk sequencing data
title_sort power and pitfalls of computational methods for inferring clone phylogenies and mutation orders from bulk sequencing data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7044161/
https://www.ncbi.nlm.nih.gov/pubmed/32103044
http://dx.doi.org/10.1038/s41598-020-59006-2
work_keys_str_mv AT miurasayaka powerandpitfallsofcomputationalmethodsforinferringclonephylogeniesandmutationordersfrombulksequencingdata
AT vutracy powerandpitfallsofcomputationalmethodsforinferringclonephylogeniesandmutationordersfrombulksequencingdata
AT dengjiamin powerandpitfallsofcomputationalmethodsforinferringclonephylogeniesandmutationordersfrombulksequencingdata
AT buturlatiffany powerandpitfallsofcomputationalmethodsforinferringclonephylogeniesandmutationordersfrombulksequencingdata
AT oladeindeolumide powerandpitfallsofcomputationalmethodsforinferringclonephylogeniesandmutationordersfrombulksequencingdata
AT choijiyeong powerandpitfallsofcomputationalmethodsforinferringclonephylogeniesandmutationordersfrombulksequencingdata
AT kumarsudhir powerandpitfallsofcomputationalmethodsforinferringclonephylogeniesandmutationordersfrombulksequencingdata