Cargando…
Sampling and summarizing transmission trees with multi-strain infections
MOTIVATION: The combination of genomic and epidemiological data holds the potential to enable accurate pathogen transmission history inference. However, the inference of outbreak transmission histories remains challenging due to various factors such as within-host pathogen diversity and multi-strain...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7355255/ https://www.ncbi.nlm.nih.gov/pubmed/32657399 http://dx.doi.org/10.1093/bioinformatics/btaa438 |
_version_ | 1783558238081384448 |
---|---|
author | Sashittal, Palash El-Kebir, Mohammed |
author_facet | Sashittal, Palash El-Kebir, Mohammed |
author_sort | Sashittal, Palash |
collection | PubMed |
description | MOTIVATION: The combination of genomic and epidemiological data holds the potential to enable accurate pathogen transmission history inference. However, the inference of outbreak transmission histories remains challenging due to various factors such as within-host pathogen diversity and multi-strain infections. Current computational methods ignore within-host diversity and/or multi-strain infections, often failing to accurately infer the transmission history. Thus, there is a need for efficient computational methods for transmission tree inference that accommodate the complexities of real data. RESULTS: We formulate the direct transmission inference (DTI) problem for inferring transmission trees that support multi-strain infections given a timed phylogeny and additional epidemiological data. We establish hardness for the decision and counting version of the DTI problem. We introduce Transmission Tree Uniform Sampler (TiTUS), a method that uses SATISFIABILITY to almost uniformly sample from the space of transmission trees. We introduce criteria that prioritize parsimonious transmission trees that we subsequently summarize using a novel consensus tree approach. We demonstrate TiTUS’s ability to accurately reconstruct transmission trees on simulated data as well as a documented HIV transmission chain. AVAILABILITY AND IMPLEMENTATION: https://github.com/elkebir-group/TiTUS. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-7355255 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-73552552020-07-16 Sampling and summarizing transmission trees with multi-strain infections Sashittal, Palash El-Kebir, Mohammed Bioinformatics Population Genomics and Molecular Evolution MOTIVATION: The combination of genomic and epidemiological data holds the potential to enable accurate pathogen transmission history inference. However, the inference of outbreak transmission histories remains challenging due to various factors such as within-host pathogen diversity and multi-strain infections. Current computational methods ignore within-host diversity and/or multi-strain infections, often failing to accurately infer the transmission history. Thus, there is a need for efficient computational methods for transmission tree inference that accommodate the complexities of real data. RESULTS: We formulate the direct transmission inference (DTI) problem for inferring transmission trees that support multi-strain infections given a timed phylogeny and additional epidemiological data. We establish hardness for the decision and counting version of the DTI problem. We introduce Transmission Tree Uniform Sampler (TiTUS), a method that uses SATISFIABILITY to almost uniformly sample from the space of transmission trees. We introduce criteria that prioritize parsimonious transmission trees that we subsequently summarize using a novel consensus tree approach. We demonstrate TiTUS’s ability to accurately reconstruct transmission trees on simulated data as well as a documented HIV transmission chain. AVAILABILITY AND IMPLEMENTATION: https://github.com/elkebir-group/TiTUS. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2020-07 2020-07-13 /pmc/articles/PMC7355255/ /pubmed/32657399 http://dx.doi.org/10.1093/bioinformatics/btaa438 Text en © The Author(s) 2020. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Population Genomics and Molecular Evolution Sashittal, Palash El-Kebir, Mohammed Sampling and summarizing transmission trees with multi-strain infections |
title | Sampling and summarizing transmission trees with multi-strain infections |
title_full | Sampling and summarizing transmission trees with multi-strain infections |
title_fullStr | Sampling and summarizing transmission trees with multi-strain infections |
title_full_unstemmed | Sampling and summarizing transmission trees with multi-strain infections |
title_short | Sampling and summarizing transmission trees with multi-strain infections |
title_sort | sampling and summarizing transmission trees with multi-strain infections |
topic | Population Genomics and Molecular Evolution |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7355255/ https://www.ncbi.nlm.nih.gov/pubmed/32657399 http://dx.doi.org/10.1093/bioinformatics/btaa438 |
work_keys_str_mv | AT sashittalpalash samplingandsummarizingtransmissiontreeswithmultistraininfections AT elkebirmohammed samplingandsummarizingtransmissiontreeswithmultistraininfections |