Cargando…
Variation in synonymous nucleotide composition among genomes of sarbecoviruses and consequences for the origin of COVID-19
The subgenus Sarbecovirus includes two human viruses, SARS-CoV and SARS-CoV-2, respectively responsible for the SARS epidemic and COVID-19 pandemic, as well as many bat viruses and two pangolin viruses. Here, the synonymous nucleotide composition (SNC) of Sarbecovirus genomes was analysed by examini...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier B.V.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9200079/ https://www.ncbi.nlm.nih.gov/pubmed/35700806 http://dx.doi.org/10.1016/j.gene.2022.146641 |
_version_ | 1784727987286966272 |
---|---|
author | Hassanin, Alexandre |
author_facet | Hassanin, Alexandre |
author_sort | Hassanin, Alexandre |
collection | PubMed |
description | The subgenus Sarbecovirus includes two human viruses, SARS-CoV and SARS-CoV-2, respectively responsible for the SARS epidemic and COVID-19 pandemic, as well as many bat viruses and two pangolin viruses. Here, the synonymous nucleotide composition (SNC) of Sarbecovirus genomes was analysed by examining third codon-positions, dinucleotides, and degenerate codons. The results show evidence for the eight following groups: (i) SARS-CoV related coronaviruses (SCoVrC including many bat viruses from China), (ii) SARS-CoV-2 related coronaviruses (SCoV2rC; including five bat viruses from Cambodia, Thailand and Yunnan), (iii) pangolin sarbecoviruses, (iv) three bat sarbecoviruses showing evidence of recombination between SCoVrC and SCoV2rC genomes, (v) two highly divergent bat sarbecoviruses from Yunnan, (vi) the bat sarbecovirus from Japan, (vii) the bat sarbecovirus from Bulgaria, and (viii) the bat sarbecovirus from Kenya. All these groups can be diagnosed by specific nucleotide compositional features except the one concerned by recombination between SCoVrC and SCoV2rC. In particular, SCoV2rC genomes have less cytosines and more uracils at third codon-positions than other sarbecoviruses, whereas the genomes of pangolin sarbecoviruses show more adenines at third codon-positions. I suggest that taxonomic differences in the imbalanced nucleotide pools available in host cells during viral replication can explain the eight groups of SNC here detected among Sarbecovirus genomes. A related effect due to hibernating bats and their latitudinal distribution is also discussed. I conclude that the two independent host switches from Rhinolophus bats to pangolins resulted in convergent mutational constraints and that SARS-CoV-2 emerged directly from a horseshoe bat sarbecovirus. |
format | Online Article Text |
id | pubmed-9200079 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Elsevier B.V. |
record_format | MEDLINE/PubMed |
spelling | pubmed-92000792022-06-16 Variation in synonymous nucleotide composition among genomes of sarbecoviruses and consequences for the origin of COVID-19 Hassanin, Alexandre Gene Article The subgenus Sarbecovirus includes two human viruses, SARS-CoV and SARS-CoV-2, respectively responsible for the SARS epidemic and COVID-19 pandemic, as well as many bat viruses and two pangolin viruses. Here, the synonymous nucleotide composition (SNC) of Sarbecovirus genomes was analysed by examining third codon-positions, dinucleotides, and degenerate codons. The results show evidence for the eight following groups: (i) SARS-CoV related coronaviruses (SCoVrC including many bat viruses from China), (ii) SARS-CoV-2 related coronaviruses (SCoV2rC; including five bat viruses from Cambodia, Thailand and Yunnan), (iii) pangolin sarbecoviruses, (iv) three bat sarbecoviruses showing evidence of recombination between SCoVrC and SCoV2rC genomes, (v) two highly divergent bat sarbecoviruses from Yunnan, (vi) the bat sarbecovirus from Japan, (vii) the bat sarbecovirus from Bulgaria, and (viii) the bat sarbecovirus from Kenya. All these groups can be diagnosed by specific nucleotide compositional features except the one concerned by recombination between SCoVrC and SCoV2rC. In particular, SCoV2rC genomes have less cytosines and more uracils at third codon-positions than other sarbecoviruses, whereas the genomes of pangolin sarbecoviruses show more adenines at third codon-positions. I suggest that taxonomic differences in the imbalanced nucleotide pools available in host cells during viral replication can explain the eight groups of SNC here detected among Sarbecovirus genomes. A related effect due to hibernating bats and their latitudinal distribution is also discussed. I conclude that the two independent host switches from Rhinolophus bats to pangolins resulted in convergent mutational constraints and that SARS-CoV-2 emerged directly from a horseshoe bat sarbecovirus. Elsevier B.V. 2022-08-15 2022-06-11 /pmc/articles/PMC9200079/ /pubmed/35700806 http://dx.doi.org/10.1016/j.gene.2022.146641 Text en © 2022 Elsevier B.V. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Article Hassanin, Alexandre Variation in synonymous nucleotide composition among genomes of sarbecoviruses and consequences for the origin of COVID-19 |
title | Variation in synonymous nucleotide composition among genomes of sarbecoviruses and consequences for the origin of COVID-19 |
title_full | Variation in synonymous nucleotide composition among genomes of sarbecoviruses and consequences for the origin of COVID-19 |
title_fullStr | Variation in synonymous nucleotide composition among genomes of sarbecoviruses and consequences for the origin of COVID-19 |
title_full_unstemmed | Variation in synonymous nucleotide composition among genomes of sarbecoviruses and consequences for the origin of COVID-19 |
title_short | Variation in synonymous nucleotide composition among genomes of sarbecoviruses and consequences for the origin of COVID-19 |
title_sort | variation in synonymous nucleotide composition among genomes of sarbecoviruses and consequences for the origin of covid-19 |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9200079/ https://www.ncbi.nlm.nih.gov/pubmed/35700806 http://dx.doi.org/10.1016/j.gene.2022.146641 |
work_keys_str_mv | AT hassaninalexandre variationinsynonymousnucleotidecompositionamonggenomesofsarbecovirusesandconsequencesfortheoriginofcovid19 |