Cargando…

Inferring selection effects in SARS-CoV-2 with Bayesian Viral Allele Selection

The global effort to sequence millions of SARS-CoV-2 genomes has provided an unprecedented view of viral evolution. Characterizing how selection acts on SARS-CoV-2 is critical to developing effective, long-lasting vaccines and other treatments, but the scale and complexity of genomic surveillance da...

Descripción completa

Detalles Bibliográficos
Autores principales: Jankowiak, Martin, Obermeyer, Fritz H., Lemieux, Jacob E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9779722/
https://www.ncbi.nlm.nih.gov/pubmed/36508459
http://dx.doi.org/10.1371/journal.pgen.1010540
_version_ 1784856679403225088
author Jankowiak, Martin
Obermeyer, Fritz H.
Lemieux, Jacob E.
author_facet Jankowiak, Martin
Obermeyer, Fritz H.
Lemieux, Jacob E.
author_sort Jankowiak, Martin
collection PubMed
description The global effort to sequence millions of SARS-CoV-2 genomes has provided an unprecedented view of viral evolution. Characterizing how selection acts on SARS-CoV-2 is critical to developing effective, long-lasting vaccines and other treatments, but the scale and complexity of genomic surveillance data make rigorous analysis challenging. To meet this challenge, we develop Bayesian Viral Allele Selection (BVAS), a principled and scalable probabilistic method for inferring the genetic determinants of differential viral fitness and the relative growth rates of viral lineages, including newly emergent lineages. After demonstrating the accuracy and efficacy of our method through simulation, we apply BVAS to 6.9 million SARS-CoV-2 genomes. We identify numerous mutations that increase fitness, including previously identified mutations in the SARS-CoV-2 Spike and Nucleocapsid proteins, as well as mutations in non-structural proteins whose contribution to fitness is less well characterized. In addition, we extend our baseline model to identify mutations whose fitness exhibits strong dependence on vaccination status as well as pairwise interaction effects, i.e. epistasis. Strikingly, both these analyses point to the pivotal role played by the N501 residue in the Spike protein. Our method, which couples Bayesian variable selection with a diffusion approximation in allele frequency space, lays a foundation for identifying fitness-associated mutations under the assumption that most alleles are neutral.
format Online
Article
Text
id pubmed-9779722
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-97797222022-12-23 Inferring selection effects in SARS-CoV-2 with Bayesian Viral Allele Selection Jankowiak, Martin Obermeyer, Fritz H. Lemieux, Jacob E. PLoS Genet Research Article The global effort to sequence millions of SARS-CoV-2 genomes has provided an unprecedented view of viral evolution. Characterizing how selection acts on SARS-CoV-2 is critical to developing effective, long-lasting vaccines and other treatments, but the scale and complexity of genomic surveillance data make rigorous analysis challenging. To meet this challenge, we develop Bayesian Viral Allele Selection (BVAS), a principled and scalable probabilistic method for inferring the genetic determinants of differential viral fitness and the relative growth rates of viral lineages, including newly emergent lineages. After demonstrating the accuracy and efficacy of our method through simulation, we apply BVAS to 6.9 million SARS-CoV-2 genomes. We identify numerous mutations that increase fitness, including previously identified mutations in the SARS-CoV-2 Spike and Nucleocapsid proteins, as well as mutations in non-structural proteins whose contribution to fitness is less well characterized. In addition, we extend our baseline model to identify mutations whose fitness exhibits strong dependence on vaccination status as well as pairwise interaction effects, i.e. epistasis. Strikingly, both these analyses point to the pivotal role played by the N501 residue in the Spike protein. Our method, which couples Bayesian variable selection with a diffusion approximation in allele frequency space, lays a foundation for identifying fitness-associated mutations under the assumption that most alleles are neutral. Public Library of Science 2022-12-12 /pmc/articles/PMC9779722/ /pubmed/36508459 http://dx.doi.org/10.1371/journal.pgen.1010540 Text en © 2022 Jankowiak et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Jankowiak, Martin
Obermeyer, Fritz H.
Lemieux, Jacob E.
Inferring selection effects in SARS-CoV-2 with Bayesian Viral Allele Selection
title Inferring selection effects in SARS-CoV-2 with Bayesian Viral Allele Selection
title_full Inferring selection effects in SARS-CoV-2 with Bayesian Viral Allele Selection
title_fullStr Inferring selection effects in SARS-CoV-2 with Bayesian Viral Allele Selection
title_full_unstemmed Inferring selection effects in SARS-CoV-2 with Bayesian Viral Allele Selection
title_short Inferring selection effects in SARS-CoV-2 with Bayesian Viral Allele Selection
title_sort inferring selection effects in sars-cov-2 with bayesian viral allele selection
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9779722/
https://www.ncbi.nlm.nih.gov/pubmed/36508459
http://dx.doi.org/10.1371/journal.pgen.1010540
work_keys_str_mv AT jankowiakmartin inferringselectioneffectsinsarscov2withbayesianviralalleleselection
AT obermeyerfritzh inferringselectioneffectsinsarscov2withbayesianviralalleleselection
AT lemieuxjacobe inferringselectioneffectsinsarscov2withbayesianviralalleleselection