Cargando…

Sampling and summarizing transmission trees with multi-strain infections

MOTIVATION: The combination of genomic and epidemiological data holds the potential to enable accurate pathogen transmission history inference. However, the inference of outbreak transmission histories remains challenging due to various factors such as within-host pathogen diversity and multi-strain...

Descripción completa

Detalles Bibliográficos
Autores principales: Sashittal, Palash, El-Kebir, Mohammed
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7355255/
https://www.ncbi.nlm.nih.gov/pubmed/32657399
http://dx.doi.org/10.1093/bioinformatics/btaa438
_version_ 1783558238081384448
author Sashittal, Palash
El-Kebir, Mohammed
author_facet Sashittal, Palash
El-Kebir, Mohammed
author_sort Sashittal, Palash
collection PubMed
description MOTIVATION: The combination of genomic and epidemiological data holds the potential to enable accurate pathogen transmission history inference. However, the inference of outbreak transmission histories remains challenging due to various factors such as within-host pathogen diversity and multi-strain infections. Current computational methods ignore within-host diversity and/or multi-strain infections, often failing to accurately infer the transmission history. Thus, there is a need for efficient computational methods for transmission tree inference that accommodate the complexities of real data. RESULTS: We formulate the direct transmission inference (DTI) problem for inferring transmission trees that support multi-strain infections given a timed phylogeny and additional epidemiological data. We establish hardness for the decision and counting version of the DTI problem. We introduce Transmission Tree Uniform Sampler (TiTUS), a method that uses SATISFIABILITY to almost uniformly sample from the space of transmission trees. We introduce criteria that prioritize parsimonious transmission trees that we subsequently summarize using a novel consensus tree approach. We demonstrate TiTUS’s ability to accurately reconstruct transmission trees on simulated data as well as a documented HIV transmission chain. AVAILABILITY AND IMPLEMENTATION: https://github.com/elkebir-group/TiTUS. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-7355255
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-73552552020-07-16 Sampling and summarizing transmission trees with multi-strain infections Sashittal, Palash El-Kebir, Mohammed Bioinformatics Population Genomics and Molecular Evolution MOTIVATION: The combination of genomic and epidemiological data holds the potential to enable accurate pathogen transmission history inference. However, the inference of outbreak transmission histories remains challenging due to various factors such as within-host pathogen diversity and multi-strain infections. Current computational methods ignore within-host diversity and/or multi-strain infections, often failing to accurately infer the transmission history. Thus, there is a need for efficient computational methods for transmission tree inference that accommodate the complexities of real data. RESULTS: We formulate the direct transmission inference (DTI) problem for inferring transmission trees that support multi-strain infections given a timed phylogeny and additional epidemiological data. We establish hardness for the decision and counting version of the DTI problem. We introduce Transmission Tree Uniform Sampler (TiTUS), a method that uses SATISFIABILITY to almost uniformly sample from the space of transmission trees. We introduce criteria that prioritize parsimonious transmission trees that we subsequently summarize using a novel consensus tree approach. We demonstrate TiTUS’s ability to accurately reconstruct transmission trees on simulated data as well as a documented HIV transmission chain. AVAILABILITY AND IMPLEMENTATION: https://github.com/elkebir-group/TiTUS. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2020-07 2020-07-13 /pmc/articles/PMC7355255/ /pubmed/32657399 http://dx.doi.org/10.1093/bioinformatics/btaa438 Text en © The Author(s) 2020. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Population Genomics and Molecular Evolution
Sashittal, Palash
El-Kebir, Mohammed
Sampling and summarizing transmission trees with multi-strain infections
title Sampling and summarizing transmission trees with multi-strain infections
title_full Sampling and summarizing transmission trees with multi-strain infections
title_fullStr Sampling and summarizing transmission trees with multi-strain infections
title_full_unstemmed Sampling and summarizing transmission trees with multi-strain infections
title_short Sampling and summarizing transmission trees with multi-strain infections
title_sort sampling and summarizing transmission trees with multi-strain infections
topic Population Genomics and Molecular Evolution
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7355255/
https://www.ncbi.nlm.nih.gov/pubmed/32657399
http://dx.doi.org/10.1093/bioinformatics/btaa438
work_keys_str_mv AT sashittalpalash samplingandsummarizingtransmissiontreeswithmultistraininfections
AT elkebirmohammed samplingandsummarizingtransmissiontreeswithmultistraininfections