Cargando…

Enumeration of binary trees compatible with a perfect phylogeny

Evolutionary models used for describing molecular sequence variation suppose that at a non-recombining genomic segment, sequences share ancestry that can be represented as a genealogy—a rooted, binary, timed tree, with tips corresponding to individual sequences. Under the infinitely-many-sites mutat...

Descripción completa

Detalles Bibliográficos
Autores principales: Palacios, Julia A., Bhaskar, Anand, Disanto, Filippo, Rosenberg, Noah A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Berlin Heidelberg 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9098623/
https://www.ncbi.nlm.nih.gov/pubmed/35552538
http://dx.doi.org/10.1007/s00285-022-01748-w
_version_ 1784706426969522176
author Palacios, Julia A.
Bhaskar, Anand
Disanto, Filippo
Rosenberg, Noah A.
author_facet Palacios, Julia A.
Bhaskar, Anand
Disanto, Filippo
Rosenberg, Noah A.
author_sort Palacios, Julia A.
collection PubMed
description Evolutionary models used for describing molecular sequence variation suppose that at a non-recombining genomic segment, sequences share ancestry that can be represented as a genealogy—a rooted, binary, timed tree, with tips corresponding to individual sequences. Under the infinitely-many-sites mutation model, mutations are randomly superimposed along the branches of the genealogy, so that every mutation occurs at a chromosomal site that has not previously mutated; if a mutation occurs at an interior branch, then all individuals descending from that branch carry the mutation. The implication is that observed patterns of molecular variation from this model impose combinatorial constraints on the hidden state space of genealogies. In particular, observed molecular variation can be represented in the form of a perfect phylogeny, a tree structure that fully encodes the mutational differences among sequences. For a sample of n sequences, a perfect phylogeny might not possess n distinct leaves, and hence might be compatible with many possible binary tree structures that could describe the evolutionary relationships among the n sequences. Here, we investigate enumerative properties of the set of binary ranked and unranked tree shapes that are compatible with a perfect phylogeny, and hence, the binary ranked and unranked tree shapes conditioned on an observed pattern of mutations under the infinitely-many-sites mutation model. We provide a recursive enumeration of these shapes. We consider both perfect phylogenies that can be represented as binary and those that are multifurcating. The results have implications for computational aspects of the statistical inference of evolutionary parameters that underlie sets of molecular sequences.
format Online
Article
Text
id pubmed-9098623
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Springer Berlin Heidelberg
record_format MEDLINE/PubMed
spelling pubmed-90986232022-05-14 Enumeration of binary trees compatible with a perfect phylogeny Palacios, Julia A. Bhaskar, Anand Disanto, Filippo Rosenberg, Noah A. J Math Biol Article Evolutionary models used for describing molecular sequence variation suppose that at a non-recombining genomic segment, sequences share ancestry that can be represented as a genealogy—a rooted, binary, timed tree, with tips corresponding to individual sequences. Under the infinitely-many-sites mutation model, mutations are randomly superimposed along the branches of the genealogy, so that every mutation occurs at a chromosomal site that has not previously mutated; if a mutation occurs at an interior branch, then all individuals descending from that branch carry the mutation. The implication is that observed patterns of molecular variation from this model impose combinatorial constraints on the hidden state space of genealogies. In particular, observed molecular variation can be represented in the form of a perfect phylogeny, a tree structure that fully encodes the mutational differences among sequences. For a sample of n sequences, a perfect phylogeny might not possess n distinct leaves, and hence might be compatible with many possible binary tree structures that could describe the evolutionary relationships among the n sequences. Here, we investigate enumerative properties of the set of binary ranked and unranked tree shapes that are compatible with a perfect phylogeny, and hence, the binary ranked and unranked tree shapes conditioned on an observed pattern of mutations under the infinitely-many-sites mutation model. We provide a recursive enumeration of these shapes. We consider both perfect phylogenies that can be represented as binary and those that are multifurcating. The results have implications for computational aspects of the statistical inference of evolutionary parameters that underlie sets of molecular sequences. Springer Berlin Heidelberg 2022-05-12 2022 /pmc/articles/PMC9098623/ /pubmed/35552538 http://dx.doi.org/10.1007/s00285-022-01748-w Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. (https://creativecommons.org/licenses/by/4.0/)
spellingShingle Article
Palacios, Julia A.
Bhaskar, Anand
Disanto, Filippo
Rosenberg, Noah A.
Enumeration of binary trees compatible with a perfect phylogeny
title Enumeration of binary trees compatible with a perfect phylogeny
title_full Enumeration of binary trees compatible with a perfect phylogeny
title_fullStr Enumeration of binary trees compatible with a perfect phylogeny
title_full_unstemmed Enumeration of binary trees compatible with a perfect phylogeny
title_short Enumeration of binary trees compatible with a perfect phylogeny
title_sort enumeration of binary trees compatible with a perfect phylogeny
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9098623/
https://www.ncbi.nlm.nih.gov/pubmed/35552538
http://dx.doi.org/10.1007/s00285-022-01748-w
work_keys_str_mv AT palaciosjuliaa enumerationofbinarytreescompatiblewithaperfectphylogeny
AT bhaskaranand enumerationofbinarytreescompatiblewithaperfectphylogeny
AT disantofilippo enumerationofbinarytreescompatiblewithaperfectphylogeny
AT rosenbergnoaha enumerationofbinarytreescompatiblewithaperfectphylogeny