Cargando…

Theoretical guarantees for phylogeny inference from single-cell lineage tracing

Lineage-tracing technologies based on Clustered Regularly Interspaced Short Palindromic Repeats and CRISPR-associated protein 9 (CRISPR-Cas9) genome editing have emerged as a powerful tool for investigating development in single-cell contexts, but exact reconstruction of the underlying clonal relati...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Robert, Zhang, Richard, Khodaverdian, Alex, Yosef, Nir
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Academy of Sciences 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10041172/
https://www.ncbi.nlm.nih.gov/pubmed/36927151
http://dx.doi.org/10.1073/pnas.2203352120
_version_ 1784912650659954688
author Wang, Robert
Zhang, Richard
Khodaverdian, Alex
Yosef, Nir
author_facet Wang, Robert
Zhang, Richard
Khodaverdian, Alex
Yosef, Nir
author_sort Wang, Robert
collection PubMed
description Lineage-tracing technologies based on Clustered Regularly Interspaced Short Palindromic Repeats and CRISPR-associated protein 9 (CRISPR-Cas9) genome editing have emerged as a powerful tool for investigating development in single-cell contexts, but exact reconstruction of the underlying clonal relationships in experiment is complicated by features of the data. These complications are functions of the experimental parameters in these systems, such as the Cas9 cutting rate, the diversity of indel outcomes, and the rate of missing data. In this paper, we develop two theoretically grounded algorithms for the reconstruction of the underlying single-cell phylogenetic tree as well as asymptotic bounds for the number of recording sites necessary for exact recapitulation of the ground truth phylogeny at high probability. In doing so, we explore the relationship between the problem difficulty and the experimental parameters, with implications for experimental design. Lastly, we provide simulations showing the empirical performance of these algorithms and showing that the trends in the asymptotic bounds hold empirically. Overall, this work provides a theoretical analysis of phylogenetic reconstruction in single-cell CRISPR-Cas9 lineage-tracing technologies.
format Online
Article
Text
id pubmed-10041172
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher National Academy of Sciences
record_format MEDLINE/PubMed
spelling pubmed-100411722023-03-28 Theoretical guarantees for phylogeny inference from single-cell lineage tracing Wang, Robert Zhang, Richard Khodaverdian, Alex Yosef, Nir Proc Natl Acad Sci U S A Biological Sciences Lineage-tracing technologies based on Clustered Regularly Interspaced Short Palindromic Repeats and CRISPR-associated protein 9 (CRISPR-Cas9) genome editing have emerged as a powerful tool for investigating development in single-cell contexts, but exact reconstruction of the underlying clonal relationships in experiment is complicated by features of the data. These complications are functions of the experimental parameters in these systems, such as the Cas9 cutting rate, the diversity of indel outcomes, and the rate of missing data. In this paper, we develop two theoretically grounded algorithms for the reconstruction of the underlying single-cell phylogenetic tree as well as asymptotic bounds for the number of recording sites necessary for exact recapitulation of the ground truth phylogeny at high probability. In doing so, we explore the relationship between the problem difficulty and the experimental parameters, with implications for experimental design. Lastly, we provide simulations showing the empirical performance of these algorithms and showing that the trends in the asymptotic bounds hold empirically. Overall, this work provides a theoretical analysis of phylogenetic reconstruction in single-cell CRISPR-Cas9 lineage-tracing technologies. National Academy of Sciences 2023-03-16 2023-03-21 /pmc/articles/PMC10041172/ /pubmed/36927151 http://dx.doi.org/10.1073/pnas.2203352120 Text en Copyright © 2023 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by/4.0/This open access article is distributed under Creative Commons Attribution License 4.0 (CC BY) (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Biological Sciences
Wang, Robert
Zhang, Richard
Khodaverdian, Alex
Yosef, Nir
Theoretical guarantees for phylogeny inference from single-cell lineage tracing
title Theoretical guarantees for phylogeny inference from single-cell lineage tracing
title_full Theoretical guarantees for phylogeny inference from single-cell lineage tracing
title_fullStr Theoretical guarantees for phylogeny inference from single-cell lineage tracing
title_full_unstemmed Theoretical guarantees for phylogeny inference from single-cell lineage tracing
title_short Theoretical guarantees for phylogeny inference from single-cell lineage tracing
title_sort theoretical guarantees for phylogeny inference from single-cell lineage tracing
topic Biological Sciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10041172/
https://www.ncbi.nlm.nih.gov/pubmed/36927151
http://dx.doi.org/10.1073/pnas.2203352120
work_keys_str_mv AT wangrobert theoreticalguaranteesforphylogenyinferencefromsinglecelllineagetracing
AT zhangrichard theoreticalguaranteesforphylogenyinferencefromsinglecelllineagetracing
AT khodaverdianalex theoreticalguaranteesforphylogenyinferencefromsinglecelllineagetracing
AT yosefnir theoreticalguaranteesforphylogenyinferencefromsinglecelllineagetracing