Cargando…

Genome-wide analysis of SARS-CoV-2 virus strains circulating worldwide implicates heterogeneity

Severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2), a novel evolutionary divergent RNA virus, is responsible for the present devastating COVID-19 pandemic. To explore the genomic signatures, we comprehensively analyzed 2,492 complete and/or near-complete genome sequences of SARS-CoV-2 stra...

Descripción completa

Detalles Bibliográficos
Autores principales: Islam, M. Rafiul, Hoque, M. Nazmul, Rahman, M. Shaminur, Alam, A. S. M. Rubayet Ul, Akther, Masuda, Puspo, J. Akter, Akter, Salma, Sultana, Munawar, Crandall, Keith A., Hossain, M. Anwar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7438523/
https://www.ncbi.nlm.nih.gov/pubmed/32814791
http://dx.doi.org/10.1038/s41598-020-70812-6
_version_ 1783572807896006656
author Islam, M. Rafiul
Hoque, M. Nazmul
Rahman, M. Shaminur
Alam, A. S. M. Rubayet Ul
Akther, Masuda
Puspo, J. Akter
Akter, Salma
Sultana, Munawar
Crandall, Keith A.
Hossain, M. Anwar
author_facet Islam, M. Rafiul
Hoque, M. Nazmul
Rahman, M. Shaminur
Alam, A. S. M. Rubayet Ul
Akther, Masuda
Puspo, J. Akter
Akter, Salma
Sultana, Munawar
Crandall, Keith A.
Hossain, M. Anwar
author_sort Islam, M. Rafiul
collection PubMed
description Severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2), a novel evolutionary divergent RNA virus, is responsible for the present devastating COVID-19 pandemic. To explore the genomic signatures, we comprehensively analyzed 2,492 complete and/or near-complete genome sequences of SARS-CoV-2 strains reported from across the globe to the GISAID database up to 30 March 2020. Genome-wide annotations revealed 1,516 nucleotide-level variations at different positions throughout the entire genome of SARS-CoV-2. Moreover, nucleotide (nt) deletion analysis found twelve deletion sites throughout the genome other than previously reported deletions at coding sequence of the ORF8 (open reading frame), spike, and ORF7a proteins, specifically in polyprotein ORF1ab (n = 9), ORF10 (n = 1), and 3´-UTR (n = 2). Evidence from the systematic gene-level mutational and protein profile analyses revealed a large number of amino acid (aa) substitutions (n = 744), demonstrating the viral proteins heterogeneous. Notably, residues of receptor-binding domain (RBD) showing crucial interactions with angiotensin-converting enzyme 2 (ACE2) and cross-reacting neutralizing antibody were found to be conserved among the analyzed virus strains, except for replacement of lysine with arginine at 378th position of the cryptic epitope of a Shanghai isolate, hCoV-19/Shanghai/SH0007/2020 (EPI_ISL_416320). Furthermore, our results of the preliminary epidemiological data on SARS-CoV-2 infections revealed that frequency of aa mutations were relatively higher in the SARS-CoV-2 genome sequences of Europe (43.07%) followed by Asia (38.09%), and North America (29.64%) while case fatality rates remained higher in the European temperate countries, such as Italy, Spain, Netherlands, France, England and Belgium. Thus, the present method of genome annotation employed at this early pandemic stage could be a promising tool for monitoring and tracking the continuously evolving pandemic situation, the associated genetic variants, and their implications for the development of effective control and prophylaxis strategies.
format Online
Article
Text
id pubmed-7438523
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-74385232020-08-21 Genome-wide analysis of SARS-CoV-2 virus strains circulating worldwide implicates heterogeneity Islam, M. Rafiul Hoque, M. Nazmul Rahman, M. Shaminur Alam, A. S. M. Rubayet Ul Akther, Masuda Puspo, J. Akter Akter, Salma Sultana, Munawar Crandall, Keith A. Hossain, M. Anwar Sci Rep Article Severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2), a novel evolutionary divergent RNA virus, is responsible for the present devastating COVID-19 pandemic. To explore the genomic signatures, we comprehensively analyzed 2,492 complete and/or near-complete genome sequences of SARS-CoV-2 strains reported from across the globe to the GISAID database up to 30 March 2020. Genome-wide annotations revealed 1,516 nucleotide-level variations at different positions throughout the entire genome of SARS-CoV-2. Moreover, nucleotide (nt) deletion analysis found twelve deletion sites throughout the genome other than previously reported deletions at coding sequence of the ORF8 (open reading frame), spike, and ORF7a proteins, specifically in polyprotein ORF1ab (n = 9), ORF10 (n = 1), and 3´-UTR (n = 2). Evidence from the systematic gene-level mutational and protein profile analyses revealed a large number of amino acid (aa) substitutions (n = 744), demonstrating the viral proteins heterogeneous. Notably, residues of receptor-binding domain (RBD) showing crucial interactions with angiotensin-converting enzyme 2 (ACE2) and cross-reacting neutralizing antibody were found to be conserved among the analyzed virus strains, except for replacement of lysine with arginine at 378th position of the cryptic epitope of a Shanghai isolate, hCoV-19/Shanghai/SH0007/2020 (EPI_ISL_416320). Furthermore, our results of the preliminary epidemiological data on SARS-CoV-2 infections revealed that frequency of aa mutations were relatively higher in the SARS-CoV-2 genome sequences of Europe (43.07%) followed by Asia (38.09%), and North America (29.64%) while case fatality rates remained higher in the European temperate countries, such as Italy, Spain, Netherlands, France, England and Belgium. Thus, the present method of genome annotation employed at this early pandemic stage could be a promising tool for monitoring and tracking the continuously evolving pandemic situation, the associated genetic variants, and their implications for the development of effective control and prophylaxis strategies. Nature Publishing Group UK 2020-08-19 /pmc/articles/PMC7438523/ /pubmed/32814791 http://dx.doi.org/10.1038/s41598-020-70812-6 Text en © The Author(s) 2020, corrected publication 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Islam, M. Rafiul
Hoque, M. Nazmul
Rahman, M. Shaminur
Alam, A. S. M. Rubayet Ul
Akther, Masuda
Puspo, J. Akter
Akter, Salma
Sultana, Munawar
Crandall, Keith A.
Hossain, M. Anwar
Genome-wide analysis of SARS-CoV-2 virus strains circulating worldwide implicates heterogeneity
title Genome-wide analysis of SARS-CoV-2 virus strains circulating worldwide implicates heterogeneity
title_full Genome-wide analysis of SARS-CoV-2 virus strains circulating worldwide implicates heterogeneity
title_fullStr Genome-wide analysis of SARS-CoV-2 virus strains circulating worldwide implicates heterogeneity
title_full_unstemmed Genome-wide analysis of SARS-CoV-2 virus strains circulating worldwide implicates heterogeneity
title_short Genome-wide analysis of SARS-CoV-2 virus strains circulating worldwide implicates heterogeneity
title_sort genome-wide analysis of sars-cov-2 virus strains circulating worldwide implicates heterogeneity
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7438523/
https://www.ncbi.nlm.nih.gov/pubmed/32814791
http://dx.doi.org/10.1038/s41598-020-70812-6
work_keys_str_mv AT islammrafiul genomewideanalysisofsarscov2virusstrainscirculatingworldwideimplicatesheterogeneity
AT hoquemnazmul genomewideanalysisofsarscov2virusstrainscirculatingworldwideimplicatesheterogeneity
AT rahmanmshaminur genomewideanalysisofsarscov2virusstrainscirculatingworldwideimplicatesheterogeneity
AT alamasmrubayetul genomewideanalysisofsarscov2virusstrainscirculatingworldwideimplicatesheterogeneity
AT akthermasuda genomewideanalysisofsarscov2virusstrainscirculatingworldwideimplicatesheterogeneity
AT puspojakter genomewideanalysisofsarscov2virusstrainscirculatingworldwideimplicatesheterogeneity
AT aktersalma genomewideanalysisofsarscov2virusstrainscirculatingworldwideimplicatesheterogeneity
AT sultanamunawar genomewideanalysisofsarscov2virusstrainscirculatingworldwideimplicatesheterogeneity
AT crandallkeitha genomewideanalysisofsarscov2virusstrainscirculatingworldwideimplicatesheterogeneity
AT hossainmanwar genomewideanalysisofsarscov2virusstrainscirculatingworldwideimplicatesheterogeneity