Cargando…

Two-phase importance sampling for inference about transmission trees

There has been growing interest in the statistics community to develop methods for inferring transmission pathways of infectious pathogens from molecular sequence data. For many datasets, the computational challenge lies in the huge dimension of the missing data. Here, we introduce an importance sam...

Descripción completa

Detalles Bibliográficos
Autores principales: Numminen, Elina, Chewapreecha, Claire, Sirén, Jukka, Turner, Claudia, Turner, Paul, Bentley, Stephen D., Corander, Jukka
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4211445/
https://www.ncbi.nlm.nih.gov/pubmed/25253455
http://dx.doi.org/10.1098/rspb.2014.1324
_version_ 1782341570443345920
author Numminen, Elina
Chewapreecha, Claire
Sirén, Jukka
Turner, Claudia
Turner, Paul
Bentley, Stephen D.
Corander, Jukka
author_facet Numminen, Elina
Chewapreecha, Claire
Sirén, Jukka
Turner, Claudia
Turner, Paul
Bentley, Stephen D.
Corander, Jukka
author_sort Numminen, Elina
collection PubMed
description There has been growing interest in the statistics community to develop methods for inferring transmission pathways of infectious pathogens from molecular sequence data. For many datasets, the computational challenge lies in the huge dimension of the missing data. Here, we introduce an importance sampling scheme in which the transmission trees and phylogenies of pathogens are both sampled from reasonable importance distributions, alleviating the inference. Using this approach, arbitrary models of transmission could be considered, contrary to many earlier proposed methods. We illustrate the scheme by analysing transmissions of Streptococcus pneumoniae from household to household within a refugee camp, using data in which only a fraction of hosts is observed, but which is still rich enough to unravel the within-household transmission dynamics and pairs of households between whom transmission is plausible. We observe that while probability of direct transmission is low even for the most prominent cases of transmission, still those pairs of households are geographically much closer to each other than expected under random proximity.
format Online
Article
Text
id pubmed-4211445
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher The Royal Society
record_format MEDLINE/PubMed
spelling pubmed-42114452014-11-07 Two-phase importance sampling for inference about transmission trees Numminen, Elina Chewapreecha, Claire Sirén, Jukka Turner, Claudia Turner, Paul Bentley, Stephen D. Corander, Jukka Proc Biol Sci Research Articles There has been growing interest in the statistics community to develop methods for inferring transmission pathways of infectious pathogens from molecular sequence data. For many datasets, the computational challenge lies in the huge dimension of the missing data. Here, we introduce an importance sampling scheme in which the transmission trees and phylogenies of pathogens are both sampled from reasonable importance distributions, alleviating the inference. Using this approach, arbitrary models of transmission could be considered, contrary to many earlier proposed methods. We illustrate the scheme by analysing transmissions of Streptococcus pneumoniae from household to household within a refugee camp, using data in which only a fraction of hosts is observed, but which is still rich enough to unravel the within-household transmission dynamics and pairs of households between whom transmission is plausible. We observe that while probability of direct transmission is low even for the most prominent cases of transmission, still those pairs of households are geographically much closer to each other than expected under random proximity. The Royal Society 2014-11-07 /pmc/articles/PMC4211445/ /pubmed/25253455 http://dx.doi.org/10.1098/rspb.2014.1324 Text en http://creativecommons.org/licenses/by/4.0/ © 2014 The Authors. Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/, which permits unrestricted use, provided the original author and source are credited.
spellingShingle Research Articles
Numminen, Elina
Chewapreecha, Claire
Sirén, Jukka
Turner, Claudia
Turner, Paul
Bentley, Stephen D.
Corander, Jukka
Two-phase importance sampling for inference about transmission trees
title Two-phase importance sampling for inference about transmission trees
title_full Two-phase importance sampling for inference about transmission trees
title_fullStr Two-phase importance sampling for inference about transmission trees
title_full_unstemmed Two-phase importance sampling for inference about transmission trees
title_short Two-phase importance sampling for inference about transmission trees
title_sort two-phase importance sampling for inference about transmission trees
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4211445/
https://www.ncbi.nlm.nih.gov/pubmed/25253455
http://dx.doi.org/10.1098/rspb.2014.1324
work_keys_str_mv AT numminenelina twophaseimportancesamplingforinferenceabouttransmissiontrees
AT chewapreechaclaire twophaseimportancesamplingforinferenceabouttransmissiontrees
AT sirenjukka twophaseimportancesamplingforinferenceabouttransmissiontrees
AT turnerclaudia twophaseimportancesamplingforinferenceabouttransmissiontrees
AT turnerpaul twophaseimportancesamplingforinferenceabouttransmissiontrees
AT bentleystephend twophaseimportancesamplingforinferenceabouttransmissiontrees
AT coranderjukka twophaseimportancesamplingforinferenceabouttransmissiontrees