Cargando…

Global variation in SARS-CoV-2 proteome and its implication in pre-lockdown emergence and dissemination of 5 dominant SARS-CoV-2 clades

SARS-CoV-2 is currently causing major havoc worldwide with its efficient transmission and propagation. To track the emergence as well as the persistence of mutations during the early stage of the pandemic, a comparative analysis of SARS-CoV-2 whole proteome sequences has been performed by considerin...

Descripción completa

Detalles Bibliográficos
Autores principales: Patro, L Ponoop Prasad, Sathyaseelan, Chakkarai, Uttamrao, Patil Pranita, Rathinavelan, Thenmalarchelvi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier B.V. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8233849/
https://www.ncbi.nlm.nih.gov/pubmed/34147651
http://dx.doi.org/10.1016/j.meegid.2021.104973
_version_ 1783713945318588416
author Patro, L Ponoop Prasad
Sathyaseelan, Chakkarai
Uttamrao, Patil Pranita
Rathinavelan, Thenmalarchelvi
author_facet Patro, L Ponoop Prasad
Sathyaseelan, Chakkarai
Uttamrao, Patil Pranita
Rathinavelan, Thenmalarchelvi
author_sort Patro, L Ponoop Prasad
collection PubMed
description SARS-CoV-2 is currently causing major havoc worldwide with its efficient transmission and propagation. To track the emergence as well as the persistence of mutations during the early stage of the pandemic, a comparative analysis of SARS-CoV-2 whole proteome sequences has been performed by considering manually curated 31,389 whole genome sequences from 84 countries. Among the 7 highly recurring (percentage frequency≥10%) mutations (Nsp2:T85I, Nsp6:L37F, Nsp12:P323L, Spike:D614G, ORF3a:Q57H, N protein:R203K and N protein:G204R), N protein:R203K and N protein: G204R are co-occurring (dependent) mutations. Nsp12:P323L and Spike:D614G often appear simultaneously. The highly recurring Spike:D614G, Nsp12:P323L and Nsp6:L37F as well as moderately recurring (percentage frequency between ≥1 and <10%) ORF3a:G251V and ORF8:L84S mutations have led to4 major clades in addition to a clade that lacks high recurring mutations. Further, the occurrence of ORF3a:Q57H&Nsp2:T85I, ORF3a:Q57H and N protein:R203K&G204R along with Nsp12:P323L&Spike:D614G has led to 3 additional sub-clades. Similarly, occurrence of Nsp6:L37F and ORF3a:G251V together has led to the emergence of a sub-clade. Nonetheless, ORF8:L84S does not occur along with ORF3a:G251V or Nsp6:L37F. Intriguingly, ORF3a:G251V and ORF8:L84S are found to occur independent of Nsp12:P323L and Spike:D614G mutations. These clades have evolved during the early stage of the pandemic and have disseminated across several countries. Further, Nsp10 is found to be highly resistant to mutations, thus, it can be exploited for drug/vaccine development and the corresponding gene sequence can be used for the diagnosis. Concisely, the study reports the SARS-CoV-2 antigens diversity across the globe during the early stage of the pandemic and facilitates the understanding of viral evolution.
format Online
Article
Text
id pubmed-8233849
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Elsevier B.V.
record_format MEDLINE/PubMed
spelling pubmed-82338492021-06-28 Global variation in SARS-CoV-2 proteome and its implication in pre-lockdown emergence and dissemination of 5 dominant SARS-CoV-2 clades Patro, L Ponoop Prasad Sathyaseelan, Chakkarai Uttamrao, Patil Pranita Rathinavelan, Thenmalarchelvi Infect Genet Evol Research Paper SARS-CoV-2 is currently causing major havoc worldwide with its efficient transmission and propagation. To track the emergence as well as the persistence of mutations during the early stage of the pandemic, a comparative analysis of SARS-CoV-2 whole proteome sequences has been performed by considering manually curated 31,389 whole genome sequences from 84 countries. Among the 7 highly recurring (percentage frequency≥10%) mutations (Nsp2:T85I, Nsp6:L37F, Nsp12:P323L, Spike:D614G, ORF3a:Q57H, N protein:R203K and N protein:G204R), N protein:R203K and N protein: G204R are co-occurring (dependent) mutations. Nsp12:P323L and Spike:D614G often appear simultaneously. The highly recurring Spike:D614G, Nsp12:P323L and Nsp6:L37F as well as moderately recurring (percentage frequency between ≥1 and <10%) ORF3a:G251V and ORF8:L84S mutations have led to4 major clades in addition to a clade that lacks high recurring mutations. Further, the occurrence of ORF3a:Q57H&Nsp2:T85I, ORF3a:Q57H and N protein:R203K&G204R along with Nsp12:P323L&Spike:D614G has led to 3 additional sub-clades. Similarly, occurrence of Nsp6:L37F and ORF3a:G251V together has led to the emergence of a sub-clade. Nonetheless, ORF8:L84S does not occur along with ORF3a:G251V or Nsp6:L37F. Intriguingly, ORF3a:G251V and ORF8:L84S are found to occur independent of Nsp12:P323L and Spike:D614G mutations. These clades have evolved during the early stage of the pandemic and have disseminated across several countries. Further, Nsp10 is found to be highly resistant to mutations, thus, it can be exploited for drug/vaccine development and the corresponding gene sequence can be used for the diagnosis. Concisely, the study reports the SARS-CoV-2 antigens diversity across the globe during the early stage of the pandemic and facilitates the understanding of viral evolution. Elsevier B.V. 2021-09 2021-06-18 /pmc/articles/PMC8233849/ /pubmed/34147651 http://dx.doi.org/10.1016/j.meegid.2021.104973 Text en © 2021 Elsevier B.V. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Research Paper
Patro, L Ponoop Prasad
Sathyaseelan, Chakkarai
Uttamrao, Patil Pranita
Rathinavelan, Thenmalarchelvi
Global variation in SARS-CoV-2 proteome and its implication in pre-lockdown emergence and dissemination of 5 dominant SARS-CoV-2 clades
title Global variation in SARS-CoV-2 proteome and its implication in pre-lockdown emergence and dissemination of 5 dominant SARS-CoV-2 clades
title_full Global variation in SARS-CoV-2 proteome and its implication in pre-lockdown emergence and dissemination of 5 dominant SARS-CoV-2 clades
title_fullStr Global variation in SARS-CoV-2 proteome and its implication in pre-lockdown emergence and dissemination of 5 dominant SARS-CoV-2 clades
title_full_unstemmed Global variation in SARS-CoV-2 proteome and its implication in pre-lockdown emergence and dissemination of 5 dominant SARS-CoV-2 clades
title_short Global variation in SARS-CoV-2 proteome and its implication in pre-lockdown emergence and dissemination of 5 dominant SARS-CoV-2 clades
title_sort global variation in sars-cov-2 proteome and its implication in pre-lockdown emergence and dissemination of 5 dominant sars-cov-2 clades
topic Research Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8233849/
https://www.ncbi.nlm.nih.gov/pubmed/34147651
http://dx.doi.org/10.1016/j.meegid.2021.104973
work_keys_str_mv AT patrolponoopprasad globalvariationinsarscov2proteomeanditsimplicationinprelockdownemergenceanddisseminationof5dominantsarscov2clades
AT sathyaseelanchakkarai globalvariationinsarscov2proteomeanditsimplicationinprelockdownemergenceanddisseminationof5dominantsarscov2clades
AT uttamraopatilpranita globalvariationinsarscov2proteomeanditsimplicationinprelockdownemergenceanddisseminationof5dominantsarscov2clades
AT rathinavelanthenmalarchelvi globalvariationinsarscov2proteomeanditsimplicationinprelockdownemergenceanddisseminationof5dominantsarscov2clades