Cargando…

Transmission Trees on a Known Pathogen Phylogeny: Enumeration and Sampling

One approach to the reconstruction of infectious disease transmission trees from pathogen genomic data has been to use a phylogenetic tree, reconstructed from pathogen sequences, and annotate its internal nodes to provide a reconstruction of which host each lineage was in at each point in time. If o...

Descripción completa

Detalles Bibliográficos
Autores principales: Hall, Matthew D, Colijn, Caroline
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6526902/
https://www.ncbi.nlm.nih.gov/pubmed/30873529
http://dx.doi.org/10.1093/molbev/msz058
_version_ 1783419965196468224
author Hall, Matthew D
Colijn, Caroline
author_facet Hall, Matthew D
Colijn, Caroline
author_sort Hall, Matthew D
collection PubMed
description One approach to the reconstruction of infectious disease transmission trees from pathogen genomic data has been to use a phylogenetic tree, reconstructed from pathogen sequences, and annotate its internal nodes to provide a reconstruction of which host each lineage was in at each point in time. If only one pathogen lineage can be transmitted to a new host (i.e., the transmission bottleneck is complete), this corresponds to partitioning the nodes of the phylogeny into connected regions, each of which represents evolution in an individual host. These partitions define the possible transmission trees that are consistent with a given phylogenetic tree. However, the mathematical properties of the transmission trees given a phylogeny remain largely unexplored. Here, we describe a procedure to calculate the number of possible transmission trees for a given phylogeny, and we then show how to uniformly sample from these transmission trees. The procedure is outlined for situations where one sample is available from each host and trees do not have branch lengths, and we also provide extensions for incomplete sampling, multiple sampling, and the application to time trees in a situation where limits on the period during which each host could have been infected and infectious are known. The sampling algorithm is available as an R package (STraTUS).
format Online
Article
Text
id pubmed-6526902
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-65269022019-05-28 Transmission Trees on a Known Pathogen Phylogeny: Enumeration and Sampling Hall, Matthew D Colijn, Caroline Mol Biol Evol Methods One approach to the reconstruction of infectious disease transmission trees from pathogen genomic data has been to use a phylogenetic tree, reconstructed from pathogen sequences, and annotate its internal nodes to provide a reconstruction of which host each lineage was in at each point in time. If only one pathogen lineage can be transmitted to a new host (i.e., the transmission bottleneck is complete), this corresponds to partitioning the nodes of the phylogeny into connected regions, each of which represents evolution in an individual host. These partitions define the possible transmission trees that are consistent with a given phylogenetic tree. However, the mathematical properties of the transmission trees given a phylogeny remain largely unexplored. Here, we describe a procedure to calculate the number of possible transmission trees for a given phylogeny, and we then show how to uniformly sample from these transmission trees. The procedure is outlined for situations where one sample is available from each host and trees do not have branch lengths, and we also provide extensions for incomplete sampling, multiple sampling, and the application to time trees in a situation where limits on the period during which each host could have been infected and infectious are known. The sampling algorithm is available as an R package (STraTUS). Oxford University Press 2019-06 2019-03-14 /pmc/articles/PMC6526902/ /pubmed/30873529 http://dx.doi.org/10.1093/molbev/msz058 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods
Hall, Matthew D
Colijn, Caroline
Transmission Trees on a Known Pathogen Phylogeny: Enumeration and Sampling
title Transmission Trees on a Known Pathogen Phylogeny: Enumeration and Sampling
title_full Transmission Trees on a Known Pathogen Phylogeny: Enumeration and Sampling
title_fullStr Transmission Trees on a Known Pathogen Phylogeny: Enumeration and Sampling
title_full_unstemmed Transmission Trees on a Known Pathogen Phylogeny: Enumeration and Sampling
title_short Transmission Trees on a Known Pathogen Phylogeny: Enumeration and Sampling
title_sort transmission trees on a known pathogen phylogeny: enumeration and sampling
topic Methods
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6526902/
https://www.ncbi.nlm.nih.gov/pubmed/30873529
http://dx.doi.org/10.1093/molbev/msz058
work_keys_str_mv AT hallmatthewd transmissiontreesonaknownpathogenphylogenyenumerationandsampling
AT colijncaroline transmissiontreesonaknownpathogenphylogenyenumerationandsampling