Cargando…
Theoretical guarantees for phylogeny inference from single-cell lineage tracing
Lineage-tracing technologies based on Clustered Regularly Interspaced Short Palindromic Repeats and CRISPR-associated protein 9 (CRISPR-Cas9) genome editing have emerged as a powerful tool for investigating development in single-cell contexts, but exact reconstruction of the underlying clonal relati...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
National Academy of Sciences
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10041172/ https://www.ncbi.nlm.nih.gov/pubmed/36927151 http://dx.doi.org/10.1073/pnas.2203352120 |
_version_ | 1784912650659954688 |
---|---|
author | Wang, Robert Zhang, Richard Khodaverdian, Alex Yosef, Nir |
author_facet | Wang, Robert Zhang, Richard Khodaverdian, Alex Yosef, Nir |
author_sort | Wang, Robert |
collection | PubMed |
description | Lineage-tracing technologies based on Clustered Regularly Interspaced Short Palindromic Repeats and CRISPR-associated protein 9 (CRISPR-Cas9) genome editing have emerged as a powerful tool for investigating development in single-cell contexts, but exact reconstruction of the underlying clonal relationships in experiment is complicated by features of the data. These complications are functions of the experimental parameters in these systems, such as the Cas9 cutting rate, the diversity of indel outcomes, and the rate of missing data. In this paper, we develop two theoretically grounded algorithms for the reconstruction of the underlying single-cell phylogenetic tree as well as asymptotic bounds for the number of recording sites necessary for exact recapitulation of the ground truth phylogeny at high probability. In doing so, we explore the relationship between the problem difficulty and the experimental parameters, with implications for experimental design. Lastly, we provide simulations showing the empirical performance of these algorithms and showing that the trends in the asymptotic bounds hold empirically. Overall, this work provides a theoretical analysis of phylogenetic reconstruction in single-cell CRISPR-Cas9 lineage-tracing technologies. |
format | Online Article Text |
id | pubmed-10041172 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | National Academy of Sciences |
record_format | MEDLINE/PubMed |
spelling | pubmed-100411722023-03-28 Theoretical guarantees for phylogeny inference from single-cell lineage tracing Wang, Robert Zhang, Richard Khodaverdian, Alex Yosef, Nir Proc Natl Acad Sci U S A Biological Sciences Lineage-tracing technologies based on Clustered Regularly Interspaced Short Palindromic Repeats and CRISPR-associated protein 9 (CRISPR-Cas9) genome editing have emerged as a powerful tool for investigating development in single-cell contexts, but exact reconstruction of the underlying clonal relationships in experiment is complicated by features of the data. These complications are functions of the experimental parameters in these systems, such as the Cas9 cutting rate, the diversity of indel outcomes, and the rate of missing data. In this paper, we develop two theoretically grounded algorithms for the reconstruction of the underlying single-cell phylogenetic tree as well as asymptotic bounds for the number of recording sites necessary for exact recapitulation of the ground truth phylogeny at high probability. In doing so, we explore the relationship between the problem difficulty and the experimental parameters, with implications for experimental design. Lastly, we provide simulations showing the empirical performance of these algorithms and showing that the trends in the asymptotic bounds hold empirically. Overall, this work provides a theoretical analysis of phylogenetic reconstruction in single-cell CRISPR-Cas9 lineage-tracing technologies. National Academy of Sciences 2023-03-16 2023-03-21 /pmc/articles/PMC10041172/ /pubmed/36927151 http://dx.doi.org/10.1073/pnas.2203352120 Text en Copyright © 2023 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by/4.0/This open access article is distributed under Creative Commons Attribution License 4.0 (CC BY) (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Biological Sciences Wang, Robert Zhang, Richard Khodaverdian, Alex Yosef, Nir Theoretical guarantees for phylogeny inference from single-cell lineage tracing |
title | Theoretical guarantees for phylogeny inference from single-cell lineage tracing |
title_full | Theoretical guarantees for phylogeny inference from single-cell lineage tracing |
title_fullStr | Theoretical guarantees for phylogeny inference from single-cell lineage tracing |
title_full_unstemmed | Theoretical guarantees for phylogeny inference from single-cell lineage tracing |
title_short | Theoretical guarantees for phylogeny inference from single-cell lineage tracing |
title_sort | theoretical guarantees for phylogeny inference from single-cell lineage tracing |
topic | Biological Sciences |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10041172/ https://www.ncbi.nlm.nih.gov/pubmed/36927151 http://dx.doi.org/10.1073/pnas.2203352120 |
work_keys_str_mv | AT wangrobert theoreticalguaranteesforphylogenyinferencefromsinglecelllineagetracing AT zhangrichard theoreticalguaranteesforphylogenyinferencefromsinglecelllineagetracing AT khodaverdianalex theoreticalguaranteesforphylogenyinferencefromsinglecelllineagetracing AT yosefnir theoreticalguaranteesforphylogenyinferencefromsinglecelllineagetracing |