Cargando…

Genome analysis of SARS-CoV-2 isolates from a population reveals the rapid selective sweep of a haplotype carrying many pre-existing and new mutations

To understand the mechanism underlying the evolution of SARS-CoV-2 in a population, we sequenced 92 viral genomes from Assam, India. Analysis of these and database sequences revealed a complete selective sweep of a haplotype in Assam carrying 13 pre-existing variants, including a high leap in freque...

Descripción completa

Detalles Bibliográficos
Autores principales: Bhattacharjee, Maloyjo Joyraj, Bhattacharya, Anupam, Kashyap, Bhaswati, Taw, Manash Jyoti, Li, Wen-Hsiung, Mukherjee, Ashis K., Khan, Mojibur Rohman
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10474745/
https://www.ncbi.nlm.nih.gov/pubmed/37658381
http://dx.doi.org/10.1186/s12985-023-02139-3
_version_ 1785100568762515456
author Bhattacharjee, Maloyjo Joyraj
Bhattacharya, Anupam
Kashyap, Bhaswati
Taw, Manash Jyoti
Li, Wen-Hsiung
Mukherjee, Ashis K.
Khan, Mojibur Rohman
author_facet Bhattacharjee, Maloyjo Joyraj
Bhattacharya, Anupam
Kashyap, Bhaswati
Taw, Manash Jyoti
Li, Wen-Hsiung
Mukherjee, Ashis K.
Khan, Mojibur Rohman
author_sort Bhattacharjee, Maloyjo Joyraj
collection PubMed
description To understand the mechanism underlying the evolution of SARS-CoV-2 in a population, we sequenced 92 viral genomes from Assam, India. Analysis of these and database sequences revealed a complete selective sweep of a haplotype in Assam carrying 13 pre-existing variants, including a high leap in frequency of a variant on ORF8, which is involved in immune evasion. A comparative study between sequences of same lineage and similar time frames in and outside Assam showed that 10 of the 13 pre-existing variants had a frequency ranging from 96 to 99%, and the remaining 3 had a low frequency outside Assam. Using a phylogenetic approach to infer sequential occurrences of variants we found that the variant Phe120del on ORF8, which had a low frequency (1.75%) outside Assam, is at the base of the phylogenetic tree of variants and became totally fixed (100%) in Assam population. Based on this observation, we inferred that the variant on ORF8 had a selective advantage, so it carried the haplotype to reach the100% frequency. The haplotype also carried 32 pre-existing variants at a frequency from 1.00 to 80.00% outside Assam. Those of these variants that are more closely linked to the S-protein locus, which often carries advantageous mutations and is tightly linked to the ORF8 locus, retained higher frequencies, while the less tightly linked variants showed lower frequencies, likely due to recombination among co- circulating variants in Assam. The ratios of non-synonymous substitutions to synonymous substitutions suggested that some genes such as those coding for the S-protein and non-structural proteins underwent positive selection while others were subject to purifying selection during their evolution in Assam. Furthermore, we observed negative correlation of the Ct value of qRT-PCR of the patients with abundant ORF6 transcripts, suggesting that ORF6 can be used as a marker for estimating viral titer. In conclusion, our in-depth analysis of SARS-CoV-2 genomes in a regional population reveals the mechanism and dynamics of viral evolution. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12985-023-02139-3.
format Online
Article
Text
id pubmed-10474745
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-104747452023-09-03 Genome analysis of SARS-CoV-2 isolates from a population reveals the rapid selective sweep of a haplotype carrying many pre-existing and new mutations Bhattacharjee, Maloyjo Joyraj Bhattacharya, Anupam Kashyap, Bhaswati Taw, Manash Jyoti Li, Wen-Hsiung Mukherjee, Ashis K. Khan, Mojibur Rohman Virol J Research To understand the mechanism underlying the evolution of SARS-CoV-2 in a population, we sequenced 92 viral genomes from Assam, India. Analysis of these and database sequences revealed a complete selective sweep of a haplotype in Assam carrying 13 pre-existing variants, including a high leap in frequency of a variant on ORF8, which is involved in immune evasion. A comparative study between sequences of same lineage and similar time frames in and outside Assam showed that 10 of the 13 pre-existing variants had a frequency ranging from 96 to 99%, and the remaining 3 had a low frequency outside Assam. Using a phylogenetic approach to infer sequential occurrences of variants we found that the variant Phe120del on ORF8, which had a low frequency (1.75%) outside Assam, is at the base of the phylogenetic tree of variants and became totally fixed (100%) in Assam population. Based on this observation, we inferred that the variant on ORF8 had a selective advantage, so it carried the haplotype to reach the100% frequency. The haplotype also carried 32 pre-existing variants at a frequency from 1.00 to 80.00% outside Assam. Those of these variants that are more closely linked to the S-protein locus, which often carries advantageous mutations and is tightly linked to the ORF8 locus, retained higher frequencies, while the less tightly linked variants showed lower frequencies, likely due to recombination among co- circulating variants in Assam. The ratios of non-synonymous substitutions to synonymous substitutions suggested that some genes such as those coding for the S-protein and non-structural proteins underwent positive selection while others were subject to purifying selection during their evolution in Assam. Furthermore, we observed negative correlation of the Ct value of qRT-PCR of the patients with abundant ORF6 transcripts, suggesting that ORF6 can be used as a marker for estimating viral titer. In conclusion, our in-depth analysis of SARS-CoV-2 genomes in a regional population reveals the mechanism and dynamics of viral evolution. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12985-023-02139-3. BioMed Central 2023-09-01 /pmc/articles/PMC10474745/ /pubmed/37658381 http://dx.doi.org/10.1186/s12985-023-02139-3 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Bhattacharjee, Maloyjo Joyraj
Bhattacharya, Anupam
Kashyap, Bhaswati
Taw, Manash Jyoti
Li, Wen-Hsiung
Mukherjee, Ashis K.
Khan, Mojibur Rohman
Genome analysis of SARS-CoV-2 isolates from a population reveals the rapid selective sweep of a haplotype carrying many pre-existing and new mutations
title Genome analysis of SARS-CoV-2 isolates from a population reveals the rapid selective sweep of a haplotype carrying many pre-existing and new mutations
title_full Genome analysis of SARS-CoV-2 isolates from a population reveals the rapid selective sweep of a haplotype carrying many pre-existing and new mutations
title_fullStr Genome analysis of SARS-CoV-2 isolates from a population reveals the rapid selective sweep of a haplotype carrying many pre-existing and new mutations
title_full_unstemmed Genome analysis of SARS-CoV-2 isolates from a population reveals the rapid selective sweep of a haplotype carrying many pre-existing and new mutations
title_short Genome analysis of SARS-CoV-2 isolates from a population reveals the rapid selective sweep of a haplotype carrying many pre-existing and new mutations
title_sort genome analysis of sars-cov-2 isolates from a population reveals the rapid selective sweep of a haplotype carrying many pre-existing and new mutations
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10474745/
https://www.ncbi.nlm.nih.gov/pubmed/37658381
http://dx.doi.org/10.1186/s12985-023-02139-3
work_keys_str_mv AT bhattacharjeemaloyjojoyraj genomeanalysisofsarscov2isolatesfromapopulationrevealstherapidselectivesweepofahaplotypecarryingmanypreexistingandnewmutations
AT bhattacharyaanupam genomeanalysisofsarscov2isolatesfromapopulationrevealstherapidselectivesweepofahaplotypecarryingmanypreexistingandnewmutations
AT kashyapbhaswati genomeanalysisofsarscov2isolatesfromapopulationrevealstherapidselectivesweepofahaplotypecarryingmanypreexistingandnewmutations
AT tawmanashjyoti genomeanalysisofsarscov2isolatesfromapopulationrevealstherapidselectivesweepofahaplotypecarryingmanypreexistingandnewmutations
AT liwenhsiung genomeanalysisofsarscov2isolatesfromapopulationrevealstherapidselectivesweepofahaplotypecarryingmanypreexistingandnewmutations
AT mukherjeeashisk genomeanalysisofsarscov2isolatesfromapopulationrevealstherapidselectivesweepofahaplotypecarryingmanypreexistingandnewmutations
AT khanmojiburrohman genomeanalysisofsarscov2isolatesfromapopulationrevealstherapidselectivesweepofahaplotypecarryingmanypreexistingandnewmutations