Cargando…

Combining biomarker and virus phylogenetic models improves HIV-1 epidemiological source identification

To identify and stop active HIV transmission chains new epidemiological techniques are needed. Here, we describe the development of a multi-biomarker augmentation to phylogenetic inference of the underlying transmission history in a local population. HIV biomarkers are measurable biological quantiti...

Descripción completa

Detalles Bibliográficos
Autores principales: Lundgren, Erik, Romero-Severson, Ethan, Albert, Jan, Leitner, Thomas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9455879/
https://www.ncbi.nlm.nih.gov/pubmed/36026480
http://dx.doi.org/10.1371/journal.pcbi.1009741
_version_ 1784785676709920768
author Lundgren, Erik
Romero-Severson, Ethan
Albert, Jan
Leitner, Thomas
author_facet Lundgren, Erik
Romero-Severson, Ethan
Albert, Jan
Leitner, Thomas
author_sort Lundgren, Erik
collection PubMed
description To identify and stop active HIV transmission chains new epidemiological techniques are needed. Here, we describe the development of a multi-biomarker augmentation to phylogenetic inference of the underlying transmission history in a local population. HIV biomarkers are measurable biological quantities that have some relationship to the amount of time someone has been infected with HIV. To train our model, we used five biomarkers based on real data from serological assays, HIV sequence data, and target cell counts in longitudinally followed, untreated patients with known infection times. The biomarkers were modeled with a mixed effects framework to allow for patient specific variation and general trends, and fit to patient data using Markov Chain Monte Carlo (MCMC) methods. Subsequently, the density of the unobserved infection time conditional on observed biomarkers were obtained by integrating out the random effects from the model fit. This probabilistic information about infection times was incorporated into the likelihood function for the transmission history and phylogenetic tree reconstruction, informed by the HIV sequence data. To critically test our methodology, we developed a coalescent-based simulation framework that generates phylogenies and biomarkers given a specific or general transmission history. Testing on many epidemiological scenarios showed that biomarker augmented phylogenetics can reach 90% accuracy under idealized situations. Under realistic within-host HIV-1 evolution, involving substantial within-host diversification and frequent transmission of multiple lineages, the average accuracy was at about 50% in transmission clusters involving 5–50 hosts. Realistic biomarker data added on average 16 percentage points over using the phylogeny alone. Using more biomarkers improved the performance. Shorter temporal spacing between transmission events and increased transmission heterogeneity reduced reconstruction accuracy, but larger clusters were not harder to get right. More sequence data per infected host also improved accuracy. We show that the method is robust to incomplete sampling and that adding biomarkers improves reconstructions of real HIV-1 transmission histories. The technology presented here could allow for better prevention programs by providing data for locally informed and tailored strategies.
format Online
Article
Text
id pubmed-9455879
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-94558792022-09-09 Combining biomarker and virus phylogenetic models improves HIV-1 epidemiological source identification Lundgren, Erik Romero-Severson, Ethan Albert, Jan Leitner, Thomas PLoS Comput Biol Research Article To identify and stop active HIV transmission chains new epidemiological techniques are needed. Here, we describe the development of a multi-biomarker augmentation to phylogenetic inference of the underlying transmission history in a local population. HIV biomarkers are measurable biological quantities that have some relationship to the amount of time someone has been infected with HIV. To train our model, we used five biomarkers based on real data from serological assays, HIV sequence data, and target cell counts in longitudinally followed, untreated patients with known infection times. The biomarkers were modeled with a mixed effects framework to allow for patient specific variation and general trends, and fit to patient data using Markov Chain Monte Carlo (MCMC) methods. Subsequently, the density of the unobserved infection time conditional on observed biomarkers were obtained by integrating out the random effects from the model fit. This probabilistic information about infection times was incorporated into the likelihood function for the transmission history and phylogenetic tree reconstruction, informed by the HIV sequence data. To critically test our methodology, we developed a coalescent-based simulation framework that generates phylogenies and biomarkers given a specific or general transmission history. Testing on many epidemiological scenarios showed that biomarker augmented phylogenetics can reach 90% accuracy under idealized situations. Under realistic within-host HIV-1 evolution, involving substantial within-host diversification and frequent transmission of multiple lineages, the average accuracy was at about 50% in transmission clusters involving 5–50 hosts. Realistic biomarker data added on average 16 percentage points over using the phylogeny alone. Using more biomarkers improved the performance. Shorter temporal spacing between transmission events and increased transmission heterogeneity reduced reconstruction accuracy, but larger clusters were not harder to get right. More sequence data per infected host also improved accuracy. We show that the method is robust to incomplete sampling and that adding biomarkers improves reconstructions of real HIV-1 transmission histories. The technology presented here could allow for better prevention programs by providing data for locally informed and tailored strategies. Public Library of Science 2022-08-26 /pmc/articles/PMC9455879/ /pubmed/36026480 http://dx.doi.org/10.1371/journal.pcbi.1009741 Text en https://creativecommons.org/publicdomain/zero/1.0/This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication.
spellingShingle Research Article
Lundgren, Erik
Romero-Severson, Ethan
Albert, Jan
Leitner, Thomas
Combining biomarker and virus phylogenetic models improves HIV-1 epidemiological source identification
title Combining biomarker and virus phylogenetic models improves HIV-1 epidemiological source identification
title_full Combining biomarker and virus phylogenetic models improves HIV-1 epidemiological source identification
title_fullStr Combining biomarker and virus phylogenetic models improves HIV-1 epidemiological source identification
title_full_unstemmed Combining biomarker and virus phylogenetic models improves HIV-1 epidemiological source identification
title_short Combining biomarker and virus phylogenetic models improves HIV-1 epidemiological source identification
title_sort combining biomarker and virus phylogenetic models improves hiv-1 epidemiological source identification
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9455879/
https://www.ncbi.nlm.nih.gov/pubmed/36026480
http://dx.doi.org/10.1371/journal.pcbi.1009741
work_keys_str_mv AT lundgrenerik combiningbiomarkerandvirusphylogeneticmodelsimproveshiv1epidemiologicalsourceidentification
AT romeroseversonethan combiningbiomarkerandvirusphylogeneticmodelsimproveshiv1epidemiologicalsourceidentification
AT albertjan combiningbiomarkerandvirusphylogeneticmodelsimproveshiv1epidemiologicalsourceidentification
AT leitnerthomas combiningbiomarkerandvirusphylogeneticmodelsimproveshiv1epidemiologicalsourceidentification