Cargando…

Transmission bottleneck size estimation from de novo viral genetic variation

Sequencing of viral infections has become increasingly common over the last decade. Deep sequencing data in particular have proven useful in characterizing the roles that genetic drift and natural selection play in shaping within-host viral populations. They have also been used to estimate transmiss...

Descripción completa

Detalles Bibliográficos
Autores principales: Shi, Teresa, Harris, Jeremy D., Martin, Michael A., Koelle, Katia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10462048/
https://www.ncbi.nlm.nih.gov/pubmed/37645981
http://dx.doi.org/10.1101/2023.08.14.553219
_version_ 1785097979805302784
author Shi, Teresa
Harris, Jeremy D.
Martin, Michael A.
Koelle, Katia
author_facet Shi, Teresa
Harris, Jeremy D.
Martin, Michael A.
Koelle, Katia
author_sort Shi, Teresa
collection PubMed
description Sequencing of viral infections has become increasingly common over the last decade. Deep sequencing data in particular have proven useful in characterizing the roles that genetic drift and natural selection play in shaping within-host viral populations. They have also been used to estimate transmission bottleneck sizes from identified donor-recipient pairs. These bottleneck sizes quantify the number of viral particles that establish genetic lineages in the recipient host and are important to estimate due to their impact on viral evolution. Current approaches for estimating bottleneck sizes exclusively consider the subset of viral sites that are observed as polymorphic in the donor individual. However, allele frequencies can change dramatically over the course of an individual’s infection, such that sites that are polymorphic in the donor at the time of transmission may not be polymorphic in the donor at the time of sampling and allele frequencies at donor-polymorphic sites may change dramatically over the course of a recipient’s infection. Because of this, transmission bottleneck sizes estimated using allele frequencies observed at a donor’s polymorphic sites may be considerable underestimates of true bottleneck sizes. Here, we present a new statistical approach for instead estimating bottleneck sizes using patterns of viral genetic variation that arose de novo within a recipient individual. Specifically, our approach makes use of the number of clonal viral variants observed in a transmission pair, defined as the number of viral sites that are monomorphic in both the donor and the recipient but carry different alleles. We first test our approach on a simulated dataset and then apply it to both influenza A virus sequence data and SARS-CoV-2 sequence data from identified transmission pairs. Our results confirm the existence of extremely tight transmission bottlenecks for these two respiratory viruses, using an approach that does not tend to underestimate transmission bottleneck sizes.
format Online
Article
Text
id pubmed-10462048
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-104620482023-08-29 Transmission bottleneck size estimation from de novo viral genetic variation Shi, Teresa Harris, Jeremy D. Martin, Michael A. Koelle, Katia bioRxiv Article Sequencing of viral infections has become increasingly common over the last decade. Deep sequencing data in particular have proven useful in characterizing the roles that genetic drift and natural selection play in shaping within-host viral populations. They have also been used to estimate transmission bottleneck sizes from identified donor-recipient pairs. These bottleneck sizes quantify the number of viral particles that establish genetic lineages in the recipient host and are important to estimate due to their impact on viral evolution. Current approaches for estimating bottleneck sizes exclusively consider the subset of viral sites that are observed as polymorphic in the donor individual. However, allele frequencies can change dramatically over the course of an individual’s infection, such that sites that are polymorphic in the donor at the time of transmission may not be polymorphic in the donor at the time of sampling and allele frequencies at donor-polymorphic sites may change dramatically over the course of a recipient’s infection. Because of this, transmission bottleneck sizes estimated using allele frequencies observed at a donor’s polymorphic sites may be considerable underestimates of true bottleneck sizes. Here, we present a new statistical approach for instead estimating bottleneck sizes using patterns of viral genetic variation that arose de novo within a recipient individual. Specifically, our approach makes use of the number of clonal viral variants observed in a transmission pair, defined as the number of viral sites that are monomorphic in both the donor and the recipient but carry different alleles. We first test our approach on a simulated dataset and then apply it to both influenza A virus sequence data and SARS-CoV-2 sequence data from identified transmission pairs. Our results confirm the existence of extremely tight transmission bottlenecks for these two respiratory viruses, using an approach that does not tend to underestimate transmission bottleneck sizes. Cold Spring Harbor Laboratory 2023-08-14 /pmc/articles/PMC10462048/ /pubmed/37645981 http://dx.doi.org/10.1101/2023.08.14.553219 Text en https://creativecommons.org/licenses/by-nd/4.0/This work is licensed under a Creative Commons Attribution-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, and only so long as attribution is given to the creator. The license allows for commercial use.
spellingShingle Article
Shi, Teresa
Harris, Jeremy D.
Martin, Michael A.
Koelle, Katia
Transmission bottleneck size estimation from de novo viral genetic variation
title Transmission bottleneck size estimation from de novo viral genetic variation
title_full Transmission bottleneck size estimation from de novo viral genetic variation
title_fullStr Transmission bottleneck size estimation from de novo viral genetic variation
title_full_unstemmed Transmission bottleneck size estimation from de novo viral genetic variation
title_short Transmission bottleneck size estimation from de novo viral genetic variation
title_sort transmission bottleneck size estimation from de novo viral genetic variation
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10462048/
https://www.ncbi.nlm.nih.gov/pubmed/37645981
http://dx.doi.org/10.1101/2023.08.14.553219
work_keys_str_mv AT shiteresa transmissionbottlenecksizeestimationfromdenovoviralgeneticvariation
AT harrisjeremyd transmissionbottlenecksizeestimationfromdenovoviralgeneticvariation
AT martinmichaela transmissionbottlenecksizeestimationfromdenovoviralgeneticvariation
AT koellekatia transmissionbottlenecksizeestimationfromdenovoviralgeneticvariation