Cargando…

Accounting for Errors in Data Improves Divergence Time Estimates in Single-cell Cancer Evolution

Single-cell sequencing provides a new way to explore the evolutionary history of cells. Compared to traditional bulk sequencing, where a population of heterogeneous cells is pooled to form a single observation, single-cell sequencing isolates and amplifies genetic material from individual cells, the...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Kylie, Moravec, Jiří C, Gavryushkin, Alex, Welch, David, Drummond, Alexei J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9356729/
https://www.ncbi.nlm.nih.gov/pubmed/35733333
http://dx.doi.org/10.1093/molbev/msac143
_version_ 1784763581532733440
author Chen, Kylie
Moravec, Jiří C
Gavryushkin, Alex
Welch, David
Drummond, Alexei J
author_facet Chen, Kylie
Moravec, Jiří C
Gavryushkin, Alex
Welch, David
Drummond, Alexei J
author_sort Chen, Kylie
collection PubMed
description Single-cell sequencing provides a new way to explore the evolutionary history of cells. Compared to traditional bulk sequencing, where a population of heterogeneous cells is pooled to form a single observation, single-cell sequencing isolates and amplifies genetic material from individual cells, thereby preserving the information about the origin of the sequences. However, single-cell data are more error-prone than bulk sequencing data due to the limited genomic material available per cell. Here, we present error and mutation models for evolutionary inference of single-cell data within a mature and extensible Bayesian framework, BEAST2. Our framework enables integration with biologically informative models such as relaxed molecular clocks and population dynamic models. Our simulations show that modeling errors increase the accuracy of relative divergence times and substitution parameters. We reconstruct the phylogenetic history of a colorectal cancer patient and a healthy patient from single-cell DNA sequencing data. We find that the estimated times of terminal splitting events are shifted forward in time compared to models which ignore errors. We observed that not accounting for errors can overestimate the phylogenetic diversity in single-cell DNA sequencing data. We estimate that 30–50% of the apparent diversity can be attributed to error. Our work enables a full Bayesian approach capable of accounting for errors in the data within the integrative Bayesian software framework BEAST2.
format Online
Article
Text
id pubmed-9356729
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-93567292022-08-09 Accounting for Errors in Data Improves Divergence Time Estimates in Single-cell Cancer Evolution Chen, Kylie Moravec, Jiří C Gavryushkin, Alex Welch, David Drummond, Alexei J Mol Biol Evol Methods Single-cell sequencing provides a new way to explore the evolutionary history of cells. Compared to traditional bulk sequencing, where a population of heterogeneous cells is pooled to form a single observation, single-cell sequencing isolates and amplifies genetic material from individual cells, thereby preserving the information about the origin of the sequences. However, single-cell data are more error-prone than bulk sequencing data due to the limited genomic material available per cell. Here, we present error and mutation models for evolutionary inference of single-cell data within a mature and extensible Bayesian framework, BEAST2. Our framework enables integration with biologically informative models such as relaxed molecular clocks and population dynamic models. Our simulations show that modeling errors increase the accuracy of relative divergence times and substitution parameters. We reconstruct the phylogenetic history of a colorectal cancer patient and a healthy patient from single-cell DNA sequencing data. We find that the estimated times of terminal splitting events are shifted forward in time compared to models which ignore errors. We observed that not accounting for errors can overestimate the phylogenetic diversity in single-cell DNA sequencing data. We estimate that 30–50% of the apparent diversity can be attributed to error. Our work enables a full Bayesian approach capable of accounting for errors in the data within the integrative Bayesian software framework BEAST2. Oxford University Press 2022-06-23 /pmc/articles/PMC9356729/ /pubmed/35733333 http://dx.doi.org/10.1093/molbev/msac143 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of Society for Molecular Biology and Evolution. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods
Chen, Kylie
Moravec, Jiří C
Gavryushkin, Alex
Welch, David
Drummond, Alexei J
Accounting for Errors in Data Improves Divergence Time Estimates in Single-cell Cancer Evolution
title Accounting for Errors in Data Improves Divergence Time Estimates in Single-cell Cancer Evolution
title_full Accounting for Errors in Data Improves Divergence Time Estimates in Single-cell Cancer Evolution
title_fullStr Accounting for Errors in Data Improves Divergence Time Estimates in Single-cell Cancer Evolution
title_full_unstemmed Accounting for Errors in Data Improves Divergence Time Estimates in Single-cell Cancer Evolution
title_short Accounting for Errors in Data Improves Divergence Time Estimates in Single-cell Cancer Evolution
title_sort accounting for errors in data improves divergence time estimates in single-cell cancer evolution
topic Methods
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9356729/
https://www.ncbi.nlm.nih.gov/pubmed/35733333
http://dx.doi.org/10.1093/molbev/msac143
work_keys_str_mv AT chenkylie accountingforerrorsindataimprovesdivergencetimeestimatesinsinglecellcancerevolution
AT moravecjiric accountingforerrorsindataimprovesdivergencetimeestimatesinsinglecellcancerevolution
AT gavryushkinalex accountingforerrorsindataimprovesdivergencetimeestimatesinsinglecellcancerevolution
AT welchdavid accountingforerrorsindataimprovesdivergencetimeestimatesinsinglecellcancerevolution
AT drummondalexeij accountingforerrorsindataimprovesdivergencetimeestimatesinsinglecellcancerevolution