Cargando…
Inferring selection effects in SARS-CoV-2 with Bayesian Viral Allele Selection
The global effort to sequence millions of SARS-CoV-2 genomes has provided an unprecedented view of viral evolution. Characterizing how selection acts on SARS-CoV-2 is critical to developing effective, long-lasting vaccines and other treatments, but the scale and complexity of genomic surveillance da...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9779722/ https://www.ncbi.nlm.nih.gov/pubmed/36508459 http://dx.doi.org/10.1371/journal.pgen.1010540 |
_version_ | 1784856679403225088 |
---|---|
author | Jankowiak, Martin Obermeyer, Fritz H. Lemieux, Jacob E. |
author_facet | Jankowiak, Martin Obermeyer, Fritz H. Lemieux, Jacob E. |
author_sort | Jankowiak, Martin |
collection | PubMed |
description | The global effort to sequence millions of SARS-CoV-2 genomes has provided an unprecedented view of viral evolution. Characterizing how selection acts on SARS-CoV-2 is critical to developing effective, long-lasting vaccines and other treatments, but the scale and complexity of genomic surveillance data make rigorous analysis challenging. To meet this challenge, we develop Bayesian Viral Allele Selection (BVAS), a principled and scalable probabilistic method for inferring the genetic determinants of differential viral fitness and the relative growth rates of viral lineages, including newly emergent lineages. After demonstrating the accuracy and efficacy of our method through simulation, we apply BVAS to 6.9 million SARS-CoV-2 genomes. We identify numerous mutations that increase fitness, including previously identified mutations in the SARS-CoV-2 Spike and Nucleocapsid proteins, as well as mutations in non-structural proteins whose contribution to fitness is less well characterized. In addition, we extend our baseline model to identify mutations whose fitness exhibits strong dependence on vaccination status as well as pairwise interaction effects, i.e. epistasis. Strikingly, both these analyses point to the pivotal role played by the N501 residue in the Spike protein. Our method, which couples Bayesian variable selection with a diffusion approximation in allele frequency space, lays a foundation for identifying fitness-associated mutations under the assumption that most alleles are neutral. |
format | Online Article Text |
id | pubmed-9779722 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-97797222022-12-23 Inferring selection effects in SARS-CoV-2 with Bayesian Viral Allele Selection Jankowiak, Martin Obermeyer, Fritz H. Lemieux, Jacob E. PLoS Genet Research Article The global effort to sequence millions of SARS-CoV-2 genomes has provided an unprecedented view of viral evolution. Characterizing how selection acts on SARS-CoV-2 is critical to developing effective, long-lasting vaccines and other treatments, but the scale and complexity of genomic surveillance data make rigorous analysis challenging. To meet this challenge, we develop Bayesian Viral Allele Selection (BVAS), a principled and scalable probabilistic method for inferring the genetic determinants of differential viral fitness and the relative growth rates of viral lineages, including newly emergent lineages. After demonstrating the accuracy and efficacy of our method through simulation, we apply BVAS to 6.9 million SARS-CoV-2 genomes. We identify numerous mutations that increase fitness, including previously identified mutations in the SARS-CoV-2 Spike and Nucleocapsid proteins, as well as mutations in non-structural proteins whose contribution to fitness is less well characterized. In addition, we extend our baseline model to identify mutations whose fitness exhibits strong dependence on vaccination status as well as pairwise interaction effects, i.e. epistasis. Strikingly, both these analyses point to the pivotal role played by the N501 residue in the Spike protein. Our method, which couples Bayesian variable selection with a diffusion approximation in allele frequency space, lays a foundation for identifying fitness-associated mutations under the assumption that most alleles are neutral. Public Library of Science 2022-12-12 /pmc/articles/PMC9779722/ /pubmed/36508459 http://dx.doi.org/10.1371/journal.pgen.1010540 Text en © 2022 Jankowiak et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Jankowiak, Martin Obermeyer, Fritz H. Lemieux, Jacob E. Inferring selection effects in SARS-CoV-2 with Bayesian Viral Allele Selection |
title | Inferring selection effects in SARS-CoV-2 with Bayesian Viral Allele Selection |
title_full | Inferring selection effects in SARS-CoV-2 with Bayesian Viral Allele Selection |
title_fullStr | Inferring selection effects in SARS-CoV-2 with Bayesian Viral Allele Selection |
title_full_unstemmed | Inferring selection effects in SARS-CoV-2 with Bayesian Viral Allele Selection |
title_short | Inferring selection effects in SARS-CoV-2 with Bayesian Viral Allele Selection |
title_sort | inferring selection effects in sars-cov-2 with bayesian viral allele selection |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9779722/ https://www.ncbi.nlm.nih.gov/pubmed/36508459 http://dx.doi.org/10.1371/journal.pgen.1010540 |
work_keys_str_mv | AT jankowiakmartin inferringselectioneffectsinsarscov2withbayesianviralalleleselection AT obermeyerfritzh inferringselectioneffectsinsarscov2withbayesianviralalleleselection AT lemieuxjacobe inferringselectioneffectsinsarscov2withbayesianviralalleleselection |