Cargando…

Using genetic data to identify transmission risk factors: Statistical assessment and application to tuberculosis transmission

Identifying host factors that influence infectious disease transmission is an important step toward developing interventions to reduce disease incidence. Recent advances in methods for reconstructing infectious disease transmission events using pathogen genomic and epidemiological data open the door...

Descripción completa

Detalles Bibliográficos
Autores principales: Goldstein, Isaac H., Bayer, Damon, Barilar, Ivan, Kizito, Balladiah, Matsiri, Ogopotse, Modongo, Chawangwa, Zetola, Nicola M., Niemann, Stefan, Minin, Volodymyr M., Shin, Sanghyuk S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9754595/
https://www.ncbi.nlm.nih.gov/pubmed/36469509
http://dx.doi.org/10.1371/journal.pcbi.1010696
_version_ 1784851234651373568
author Goldstein, Isaac H.
Bayer, Damon
Barilar, Ivan
Kizito, Balladiah
Matsiri, Ogopotse
Modongo, Chawangwa
Zetola, Nicola M.
Niemann, Stefan
Minin, Volodymyr M.
Shin, Sanghyuk S.
author_facet Goldstein, Isaac H.
Bayer, Damon
Barilar, Ivan
Kizito, Balladiah
Matsiri, Ogopotse
Modongo, Chawangwa
Zetola, Nicola M.
Niemann, Stefan
Minin, Volodymyr M.
Shin, Sanghyuk S.
author_sort Goldstein, Isaac H.
collection PubMed
description Identifying host factors that influence infectious disease transmission is an important step toward developing interventions to reduce disease incidence. Recent advances in methods for reconstructing infectious disease transmission events using pathogen genomic and epidemiological data open the door for investigation of host factors that affect onward transmission. While most transmission reconstruction methods are designed to work with densely sampled outbreaks, these methods are making their way into surveillance studies, where the fraction of sampled cases with sequenced pathogens could be relatively low. Surveillance studies that use transmission event reconstruction then use the reconstructed events as response variables (i.e., infection source status of each sampled case) and use host characteristics as predictors (e.g., presence of HIV infection) in regression models. We use simulations to study estimation of the effect of a host factor on probability of being an infection source via this multi-step inferential procedure. Using TransPhylo—a widely-used method for Bayesian estimation of infectious disease transmission events—and logistic regression, we find that low sensitivity of identifying infection sources leads to dilution of the signal, biasing logistic regression coefficients toward zero. We show that increasing the proportion of sampled cases improves sensitivity and some, but not all properties of the logistic regression inference. Application of these approaches to real world data from a population-based TB study in Botswana fails to detect an association between HIV infection and probability of being a TB infection source. We conclude that application of a pipeline, where one first uses TransPhylo and sparsely sampled surveillance data to infer transmission events and then estimates effects of host characteristics on probabilities of these events, should be accompanied by a realistic simulation study to better understand biases stemming from imprecise transmission event inference.
format Online
Article
Text
id pubmed-9754595
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-97545952022-12-16 Using genetic data to identify transmission risk factors: Statistical assessment and application to tuberculosis transmission Goldstein, Isaac H. Bayer, Damon Barilar, Ivan Kizito, Balladiah Matsiri, Ogopotse Modongo, Chawangwa Zetola, Nicola M. Niemann, Stefan Minin, Volodymyr M. Shin, Sanghyuk S. PLoS Comput Biol Research Article Identifying host factors that influence infectious disease transmission is an important step toward developing interventions to reduce disease incidence. Recent advances in methods for reconstructing infectious disease transmission events using pathogen genomic and epidemiological data open the door for investigation of host factors that affect onward transmission. While most transmission reconstruction methods are designed to work with densely sampled outbreaks, these methods are making their way into surveillance studies, where the fraction of sampled cases with sequenced pathogens could be relatively low. Surveillance studies that use transmission event reconstruction then use the reconstructed events as response variables (i.e., infection source status of each sampled case) and use host characteristics as predictors (e.g., presence of HIV infection) in regression models. We use simulations to study estimation of the effect of a host factor on probability of being an infection source via this multi-step inferential procedure. Using TransPhylo—a widely-used method for Bayesian estimation of infectious disease transmission events—and logistic regression, we find that low sensitivity of identifying infection sources leads to dilution of the signal, biasing logistic regression coefficients toward zero. We show that increasing the proportion of sampled cases improves sensitivity and some, but not all properties of the logistic regression inference. Application of these approaches to real world data from a population-based TB study in Botswana fails to detect an association between HIV infection and probability of being a TB infection source. We conclude that application of a pipeline, where one first uses TransPhylo and sparsely sampled surveillance data to infer transmission events and then estimates effects of host characteristics on probabilities of these events, should be accompanied by a realistic simulation study to better understand biases stemming from imprecise transmission event inference. Public Library of Science 2022-12-05 /pmc/articles/PMC9754595/ /pubmed/36469509 http://dx.doi.org/10.1371/journal.pcbi.1010696 Text en © 2022 Goldstein et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Goldstein, Isaac H.
Bayer, Damon
Barilar, Ivan
Kizito, Balladiah
Matsiri, Ogopotse
Modongo, Chawangwa
Zetola, Nicola M.
Niemann, Stefan
Minin, Volodymyr M.
Shin, Sanghyuk S.
Using genetic data to identify transmission risk factors: Statistical assessment and application to tuberculosis transmission
title Using genetic data to identify transmission risk factors: Statistical assessment and application to tuberculosis transmission
title_full Using genetic data to identify transmission risk factors: Statistical assessment and application to tuberculosis transmission
title_fullStr Using genetic data to identify transmission risk factors: Statistical assessment and application to tuberculosis transmission
title_full_unstemmed Using genetic data to identify transmission risk factors: Statistical assessment and application to tuberculosis transmission
title_short Using genetic data to identify transmission risk factors: Statistical assessment and application to tuberculosis transmission
title_sort using genetic data to identify transmission risk factors: statistical assessment and application to tuberculosis transmission
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9754595/
https://www.ncbi.nlm.nih.gov/pubmed/36469509
http://dx.doi.org/10.1371/journal.pcbi.1010696
work_keys_str_mv AT goldsteinisaach usinggeneticdatatoidentifytransmissionriskfactorsstatisticalassessmentandapplicationtotuberculosistransmission
AT bayerdamon usinggeneticdatatoidentifytransmissionriskfactorsstatisticalassessmentandapplicationtotuberculosistransmission
AT barilarivan usinggeneticdatatoidentifytransmissionriskfactorsstatisticalassessmentandapplicationtotuberculosistransmission
AT kizitoballadiah usinggeneticdatatoidentifytransmissionriskfactorsstatisticalassessmentandapplicationtotuberculosistransmission
AT matsiriogopotse usinggeneticdatatoidentifytransmissionriskfactorsstatisticalassessmentandapplicationtotuberculosistransmission
AT modongochawangwa usinggeneticdatatoidentifytransmissionriskfactorsstatisticalassessmentandapplicationtotuberculosistransmission
AT zetolanicolam usinggeneticdatatoidentifytransmissionriskfactorsstatisticalassessmentandapplicationtotuberculosistransmission
AT niemannstefan usinggeneticdatatoidentifytransmissionriskfactorsstatisticalassessmentandapplicationtotuberculosistransmission
AT mininvolodymyrm usinggeneticdatatoidentifytransmissionriskfactorsstatisticalassessmentandapplicationtotuberculosistransmission
AT shinsanghyuks usinggeneticdatatoidentifytransmissionriskfactorsstatisticalassessmentandapplicationtotuberculosistransmission