Cargando…

Correlated substitutions reveal SARS-like coronaviruses recombine frequently with a diverse set of structured gene pools

Quantifying SARS-like coronavirus (SL-CoV) evolution is critical to understanding the origins of SARS-CoV-2 and the molecular processes that could underlie future epidemic viruses. While genomic analyses suggest recombination was a factor in the emergence of SARS-CoV-2, few studies have quantified r...

Descripción completa

Detalles Bibliográficos
Autores principales: Preska Steinberg, Asher, Silander, Olin K., Kussell, Edo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Academy of Sciences 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9945976/
https://www.ncbi.nlm.nih.gov/pubmed/36693089
http://dx.doi.org/10.1073/pnas.2206945119
_version_ 1784892239541960704
author Preska Steinberg, Asher
Silander, Olin K.
Kussell, Edo
author_facet Preska Steinberg, Asher
Silander, Olin K.
Kussell, Edo
author_sort Preska Steinberg, Asher
collection PubMed
description Quantifying SARS-like coronavirus (SL-CoV) evolution is critical to understanding the origins of SARS-CoV-2 and the molecular processes that could underlie future epidemic viruses. While genomic analyses suggest recombination was a factor in the emergence of SARS-CoV-2, few studies have quantified recombination rates among SL-CoVs. Here, we infer recombination rates of SL-CoVs from correlated substitutions in sequencing data using a coalescent model with recombination. Our computationally-efficient, non-phylogenetic method infers recombination parameters of both sampled sequences and the unsampled gene pools with which they recombine. We apply this approach to infer recombination parameters for a range of positive-sense RNA viruses. We then analyze a set of 191 SL-CoV sequences (including SARS-CoV-2) and find that ORF1ab and S genes frequently undergo recombination. We identify which SL-CoV sequence clusters have recombined with shared gene pools, and show that these pools have distinct structures and high recombination rates, with multiple recombination events occurring per synonymous substitution. We find that individual genes have recombined with different viral reservoirs. By decoupling contributions from mutation and recombination, we recover the phylogeny of non-recombined portions for many of these SL-CoVs, including the position of SARS-CoV-2 in this clonal phylogeny. Lastly, by analyzing >400,000 SARS-CoV-2 whole genome sequences, we show current diversity levels are insufficient to infer the within-population recombination rate of the virus since the pandemic began. Our work offers new methods for inferring recombination rates in RNA viruses with implications for understanding recombination in SARS-CoV-2 evolution and the structure of clonal relationships and gene pools shaping its origins.
format Online
Article
Text
id pubmed-9945976
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher National Academy of Sciences
record_format MEDLINE/PubMed
spelling pubmed-99459762023-02-23 Correlated substitutions reveal SARS-like coronaviruses recombine frequently with a diverse set of structured gene pools Preska Steinberg, Asher Silander, Olin K. Kussell, Edo Proc Natl Acad Sci U S A Biological Sciences Quantifying SARS-like coronavirus (SL-CoV) evolution is critical to understanding the origins of SARS-CoV-2 and the molecular processes that could underlie future epidemic viruses. While genomic analyses suggest recombination was a factor in the emergence of SARS-CoV-2, few studies have quantified recombination rates among SL-CoVs. Here, we infer recombination rates of SL-CoVs from correlated substitutions in sequencing data using a coalescent model with recombination. Our computationally-efficient, non-phylogenetic method infers recombination parameters of both sampled sequences and the unsampled gene pools with which they recombine. We apply this approach to infer recombination parameters for a range of positive-sense RNA viruses. We then analyze a set of 191 SL-CoV sequences (including SARS-CoV-2) and find that ORF1ab and S genes frequently undergo recombination. We identify which SL-CoV sequence clusters have recombined with shared gene pools, and show that these pools have distinct structures and high recombination rates, with multiple recombination events occurring per synonymous substitution. We find that individual genes have recombined with different viral reservoirs. By decoupling contributions from mutation and recombination, we recover the phylogeny of non-recombined portions for many of these SL-CoVs, including the position of SARS-CoV-2 in this clonal phylogeny. Lastly, by analyzing >400,000 SARS-CoV-2 whole genome sequences, we show current diversity levels are insufficient to infer the within-population recombination rate of the virus since the pandemic began. Our work offers new methods for inferring recombination rates in RNA viruses with implications for understanding recombination in SARS-CoV-2 evolution and the structure of clonal relationships and gene pools shaping its origins. National Academy of Sciences 2023-01-24 2023-01-31 /pmc/articles/PMC9945976/ /pubmed/36693089 http://dx.doi.org/10.1073/pnas.2206945119 Text en Copyright © 2023 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by-nc-nd/4.0/This open access article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND) (https://creativecommons.org/licenses/by-nc-nd/4.0/) .
spellingShingle Biological Sciences
Preska Steinberg, Asher
Silander, Olin K.
Kussell, Edo
Correlated substitutions reveal SARS-like coronaviruses recombine frequently with a diverse set of structured gene pools
title Correlated substitutions reveal SARS-like coronaviruses recombine frequently with a diverse set of structured gene pools
title_full Correlated substitutions reveal SARS-like coronaviruses recombine frequently with a diverse set of structured gene pools
title_fullStr Correlated substitutions reveal SARS-like coronaviruses recombine frequently with a diverse set of structured gene pools
title_full_unstemmed Correlated substitutions reveal SARS-like coronaviruses recombine frequently with a diverse set of structured gene pools
title_short Correlated substitutions reveal SARS-like coronaviruses recombine frequently with a diverse set of structured gene pools
title_sort correlated substitutions reveal sars-like coronaviruses recombine frequently with a diverse set of structured gene pools
topic Biological Sciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9945976/
https://www.ncbi.nlm.nih.gov/pubmed/36693089
http://dx.doi.org/10.1073/pnas.2206945119
work_keys_str_mv AT preskasteinbergasher correlatedsubstitutionsrevealsarslikecoronavirusesrecombinefrequentlywithadiversesetofstructuredgenepools
AT silanderolink correlatedsubstitutionsrevealsarslikecoronavirusesrecombinefrequentlywithadiversesetofstructuredgenepools
AT kusselledo correlatedsubstitutionsrevealsarslikecoronavirusesrecombinefrequentlywithadiversesetofstructuredgenepools