Cargando…

Variation in synonymous nucleotide composition among genomes of sarbecoviruses and consequences for the origin of COVID-19

The subgenus Sarbecovirus includes two human viruses, SARS-CoV and SARS-CoV-2, respectively responsible for the SARS epidemic and COVID-19 pandemic, as well as many bat viruses and two pangolin viruses. Here, the synonymous nucleotide composition (SNC) of Sarbecovirus genomes was analysed by examini...

Descripción completa

Detalles Bibliográficos
Autor principal: Hassanin, Alexandre
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier B.V. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9200079/
https://www.ncbi.nlm.nih.gov/pubmed/35700806
http://dx.doi.org/10.1016/j.gene.2022.146641
_version_ 1784727987286966272
author Hassanin, Alexandre
author_facet Hassanin, Alexandre
author_sort Hassanin, Alexandre
collection PubMed
description The subgenus Sarbecovirus includes two human viruses, SARS-CoV and SARS-CoV-2, respectively responsible for the SARS epidemic and COVID-19 pandemic, as well as many bat viruses and two pangolin viruses. Here, the synonymous nucleotide composition (SNC) of Sarbecovirus genomes was analysed by examining third codon-positions, dinucleotides, and degenerate codons. The results show evidence for the eight following groups: (i) SARS-CoV related coronaviruses (SCoVrC including many bat viruses from China), (ii) SARS-CoV-2 related coronaviruses (SCoV2rC; including five bat viruses from Cambodia, Thailand and Yunnan), (iii) pangolin sarbecoviruses, (iv) three bat sarbecoviruses showing evidence of recombination between SCoVrC and SCoV2rC genomes, (v) two highly divergent bat sarbecoviruses from Yunnan, (vi) the bat sarbecovirus from Japan, (vii) the bat sarbecovirus from Bulgaria, and (viii) the bat sarbecovirus from Kenya. All these groups can be diagnosed by specific nucleotide compositional features except the one concerned by recombination between SCoVrC and SCoV2rC. In particular, SCoV2rC genomes have less cytosines and more uracils at third codon-positions than other sarbecoviruses, whereas the genomes of pangolin sarbecoviruses show more adenines at third codon-positions. I suggest that taxonomic differences in the imbalanced nucleotide pools available in host cells during viral replication can explain the eight groups of SNC here detected among Sarbecovirus genomes. A related effect due to hibernating bats and their latitudinal distribution is also discussed. I conclude that the two independent host switches from Rhinolophus bats to pangolins resulted in convergent mutational constraints and that SARS-CoV-2 emerged directly from a horseshoe bat sarbecovirus.
format Online
Article
Text
id pubmed-9200079
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier B.V.
record_format MEDLINE/PubMed
spelling pubmed-92000792022-06-16 Variation in synonymous nucleotide composition among genomes of sarbecoviruses and consequences for the origin of COVID-19 Hassanin, Alexandre Gene Article The subgenus Sarbecovirus includes two human viruses, SARS-CoV and SARS-CoV-2, respectively responsible for the SARS epidemic and COVID-19 pandemic, as well as many bat viruses and two pangolin viruses. Here, the synonymous nucleotide composition (SNC) of Sarbecovirus genomes was analysed by examining third codon-positions, dinucleotides, and degenerate codons. The results show evidence for the eight following groups: (i) SARS-CoV related coronaviruses (SCoVrC including many bat viruses from China), (ii) SARS-CoV-2 related coronaviruses (SCoV2rC; including five bat viruses from Cambodia, Thailand and Yunnan), (iii) pangolin sarbecoviruses, (iv) three bat sarbecoviruses showing evidence of recombination between SCoVrC and SCoV2rC genomes, (v) two highly divergent bat sarbecoviruses from Yunnan, (vi) the bat sarbecovirus from Japan, (vii) the bat sarbecovirus from Bulgaria, and (viii) the bat sarbecovirus from Kenya. All these groups can be diagnosed by specific nucleotide compositional features except the one concerned by recombination between SCoVrC and SCoV2rC. In particular, SCoV2rC genomes have less cytosines and more uracils at third codon-positions than other sarbecoviruses, whereas the genomes of pangolin sarbecoviruses show more adenines at third codon-positions. I suggest that taxonomic differences in the imbalanced nucleotide pools available in host cells during viral replication can explain the eight groups of SNC here detected among Sarbecovirus genomes. A related effect due to hibernating bats and their latitudinal distribution is also discussed. I conclude that the two independent host switches from Rhinolophus bats to pangolins resulted in convergent mutational constraints and that SARS-CoV-2 emerged directly from a horseshoe bat sarbecovirus. Elsevier B.V. 2022-08-15 2022-06-11 /pmc/articles/PMC9200079/ /pubmed/35700806 http://dx.doi.org/10.1016/j.gene.2022.146641 Text en © 2022 Elsevier B.V. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
Hassanin, Alexandre
Variation in synonymous nucleotide composition among genomes of sarbecoviruses and consequences for the origin of COVID-19
title Variation in synonymous nucleotide composition among genomes of sarbecoviruses and consequences for the origin of COVID-19
title_full Variation in synonymous nucleotide composition among genomes of sarbecoviruses and consequences for the origin of COVID-19
title_fullStr Variation in synonymous nucleotide composition among genomes of sarbecoviruses and consequences for the origin of COVID-19
title_full_unstemmed Variation in synonymous nucleotide composition among genomes of sarbecoviruses and consequences for the origin of COVID-19
title_short Variation in synonymous nucleotide composition among genomes of sarbecoviruses and consequences for the origin of COVID-19
title_sort variation in synonymous nucleotide composition among genomes of sarbecoviruses and consequences for the origin of covid-19
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9200079/
https://www.ncbi.nlm.nih.gov/pubmed/35700806
http://dx.doi.org/10.1016/j.gene.2022.146641
work_keys_str_mv AT hassaninalexandre variationinsynonymousnucleotidecompositionamonggenomesofsarbecovirusesandconsequencesfortheoriginofcovid19