Cargando…

Mutation rate, selection, and epistasis inferred from RNA virus haplotypes via neural posterior estimation

RNA viruses are particularly notorious for their high levels of genetic diversity, which is generated through the forces of mutation and natural selection. However, disentangling these two forces is a considerable challenge, and this may lead to widely divergent estimates of viral mutation rates, as...

Descripción completa

Detalles Bibliográficos
Autores principales: Caspi, Itamar, Meir, Moran, Ben Nun, Nadav, Abu Rass, Reem, Yakhini, Uri, Stern, Adi, Ram, Yoav
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10256221/
https://www.ncbi.nlm.nih.gov/pubmed/37305706
http://dx.doi.org/10.1093/ve/vead033
_version_ 1785057056708886528
author Caspi, Itamar
Meir, Moran
Ben Nun, Nadav
Abu Rass, Reem
Yakhini, Uri
Stern, Adi
Ram, Yoav
author_facet Caspi, Itamar
Meir, Moran
Ben Nun, Nadav
Abu Rass, Reem
Yakhini, Uri
Stern, Adi
Ram, Yoav
author_sort Caspi, Itamar
collection PubMed
description RNA viruses are particularly notorious for their high levels of genetic diversity, which is generated through the forces of mutation and natural selection. However, disentangling these two forces is a considerable challenge, and this may lead to widely divergent estimates of viral mutation rates, as well as difficulties in inferring the fitness effects of mutations. Here, we develop, test, and apply an approach aimed at inferring the mutation rate and key parameters that govern natural selection, from haplotype sequences covering full-length genomes of an evolving virus population. Our approach employs neural posterior estimation, a computational technique that applies simulation-based inference with neural networks to jointly infer multiple model parameters. We first tested our approach on synthetic data simulated using different mutation rates and selection parameters while accounting for sequencing errors. Reassuringly, the inferred parameter estimates were accurate and unbiased. We then applied our approach to haplotype sequencing data from a serial passaging experiment with the MS2 bacteriophage, a virus that parasites Escherichia coli. We estimated that the mutation rate of this phage is around 0.2 mutations per genome per replication cycle (95% highest density interval: 0.051–0.56). We validated this finding with two different approaches based on single-locus models that gave similar estimates but with much broader posterior distributions. Furthermore, we found evidence for reciprocal sign epistasis between four strongly beneficial mutations that all reside in an RNA stem loop that controls the expression of the viral lysis protein, responsible for lysing host cells and viral egress. We surmise that there is a fine balance between over- and underexpression of lysis that leads to this pattern of epistasis. To recap, we have developed an approach for joint inference of the mutation rate and selection parameters from full haplotype data with sequencing errors and used it to reveal features governing MS2 evolution.
format Online
Article
Text
id pubmed-10256221
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-102562212023-06-10 Mutation rate, selection, and epistasis inferred from RNA virus haplotypes via neural posterior estimation Caspi, Itamar Meir, Moran Ben Nun, Nadav Abu Rass, Reem Yakhini, Uri Stern, Adi Ram, Yoav Virus Evol Research Article RNA viruses are particularly notorious for their high levels of genetic diversity, which is generated through the forces of mutation and natural selection. However, disentangling these two forces is a considerable challenge, and this may lead to widely divergent estimates of viral mutation rates, as well as difficulties in inferring the fitness effects of mutations. Here, we develop, test, and apply an approach aimed at inferring the mutation rate and key parameters that govern natural selection, from haplotype sequences covering full-length genomes of an evolving virus population. Our approach employs neural posterior estimation, a computational technique that applies simulation-based inference with neural networks to jointly infer multiple model parameters. We first tested our approach on synthetic data simulated using different mutation rates and selection parameters while accounting for sequencing errors. Reassuringly, the inferred parameter estimates were accurate and unbiased. We then applied our approach to haplotype sequencing data from a serial passaging experiment with the MS2 bacteriophage, a virus that parasites Escherichia coli. We estimated that the mutation rate of this phage is around 0.2 mutations per genome per replication cycle (95% highest density interval: 0.051–0.56). We validated this finding with two different approaches based on single-locus models that gave similar estimates but with much broader posterior distributions. Furthermore, we found evidence for reciprocal sign epistasis between four strongly beneficial mutations that all reside in an RNA stem loop that controls the expression of the viral lysis protein, responsible for lysing host cells and viral egress. We surmise that there is a fine balance between over- and underexpression of lysis that leads to this pattern of epistasis. To recap, we have developed an approach for joint inference of the mutation rate and selection parameters from full haplotype data with sequencing errors and used it to reveal features governing MS2 evolution. Oxford University Press 2023-05-20 /pmc/articles/PMC10256221/ /pubmed/37305706 http://dx.doi.org/10.1093/ve/vead033 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Caspi, Itamar
Meir, Moran
Ben Nun, Nadav
Abu Rass, Reem
Yakhini, Uri
Stern, Adi
Ram, Yoav
Mutation rate, selection, and epistasis inferred from RNA virus haplotypes via neural posterior estimation
title Mutation rate, selection, and epistasis inferred from RNA virus haplotypes via neural posterior estimation
title_full Mutation rate, selection, and epistasis inferred from RNA virus haplotypes via neural posterior estimation
title_fullStr Mutation rate, selection, and epistasis inferred from RNA virus haplotypes via neural posterior estimation
title_full_unstemmed Mutation rate, selection, and epistasis inferred from RNA virus haplotypes via neural posterior estimation
title_short Mutation rate, selection, and epistasis inferred from RNA virus haplotypes via neural posterior estimation
title_sort mutation rate, selection, and epistasis inferred from rna virus haplotypes via neural posterior estimation
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10256221/
https://www.ncbi.nlm.nih.gov/pubmed/37305706
http://dx.doi.org/10.1093/ve/vead033
work_keys_str_mv AT caspiitamar mutationrateselectionandepistasisinferredfromrnavirushaplotypesvianeuralposteriorestimation
AT meirmoran mutationrateselectionandepistasisinferredfromrnavirushaplotypesvianeuralposteriorestimation
AT bennunnadav mutationrateselectionandepistasisinferredfromrnavirushaplotypesvianeuralposteriorestimation
AT aburassreem mutationrateselectionandepistasisinferredfromrnavirushaplotypesvianeuralposteriorestimation
AT yakhiniuri mutationrateselectionandepistasisinferredfromrnavirushaplotypesvianeuralposteriorestimation
AT sternadi mutationrateselectionandepistasisinferredfromrnavirushaplotypesvianeuralposteriorestimation
AT ramyoav mutationrateselectionandepistasisinferredfromrnavirushaplotypesvianeuralposteriorestimation