Cargando…
Analysis of the mutation dynamics of SARS-CoV-2 genome in the samples from Georgia State of the United States
BACKGROUND: The COVID-19 is caused by a novel coronavirus SARS-CoV-2, which started from China. It spread rapidly throughout the world and was later declared a pandemic by the WHO. Over the course of time, SARS-CoV-2 has mutated for survival advantages, and this led to multiple variants. Multiple st...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Published by Elsevier B.V.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9323210/ https://www.ncbi.nlm.nih.gov/pubmed/35905853 http://dx.doi.org/10.1016/j.gene.2022.146774 |
_version_ | 1784756494392098816 |
---|---|
author | Ahmad, Waqas Ahmad, Sarfraz Basha, Riyaz |
author_facet | Ahmad, Waqas Ahmad, Sarfraz Basha, Riyaz |
author_sort | Ahmad, Waqas |
collection | PubMed |
description | BACKGROUND: The COVID-19 is caused by a novel coronavirus SARS-CoV-2, which started from China. It spread rapidly throughout the world and was later declared a pandemic by the WHO. Over the course of time, SARS-CoV-2 has mutated for survival advantages, and this led to multiple variants. Multiple studies on mutations identification in SARS-CoV2 have been published covering extensive sample areas. The purpose of this study was to limit the sample area to the Georgia state in the U.S. and to analyze the genome sequences for mutation profiling across the genome and origin of variants. METHODS: The genome sequences (n = 3,970) were obtained from the NCBI database as of June 12, 2021, with the filter of being complete sequenced genomes, homo-sapiens host, and only from Georgia State of the U.S. NextClade, an online tool was used for the analysis of the sequences using Wuhan-Hu-1/2019 as a reference genome. The algorithm was sequence alignment, translation, mutation calling, phylogenetic placement, clade assignment, and quality control (QC). Thirty-six samples with bad QC were removed from the mutational analysis. RESULTS: A total 117,743 mutations in the nucleotides were identified (averaging 31.5 mutations per sample). The mutations A23403G, C3037T, C241T, and C14408T were detected in 98% of the samples. Also, a total of 75,517 mutations in the amino acid were identified (averaging 20.2 mutations per sample). The mutations D614G and P314L were identified in >97% samples whereas R203K, G204R, P681H, and N501Y were detected in >50% samples. Analysis also revealed 16 different clades with 20I (49.6%). Clades 20G (24.2%) and 20A (5.5%) being the most abundant, showed that SARS-CoV-2 in the Georgia State originated mainly from Southeast England, other parts of the U.S., and several countries in Western Europe. CONCLUSION: Looking at the three most common variants in Georgia State of the U.S., we could determine the primary locations of transmission or origin for the virus, and our analyses indicates that majority of the cases originated from Southeast England (Clade 20I), the U.S. itself (Clade 20G), and from Western Europe (Clade 20C). |
format | Online Article Text |
id | pubmed-9323210 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Published by Elsevier B.V. |
record_format | MEDLINE/PubMed |
spelling | pubmed-93232102022-07-27 Analysis of the mutation dynamics of SARS-CoV-2 genome in the samples from Georgia State of the United States Ahmad, Waqas Ahmad, Sarfraz Basha, Riyaz Gene Article BACKGROUND: The COVID-19 is caused by a novel coronavirus SARS-CoV-2, which started from China. It spread rapidly throughout the world and was later declared a pandemic by the WHO. Over the course of time, SARS-CoV-2 has mutated for survival advantages, and this led to multiple variants. Multiple studies on mutations identification in SARS-CoV2 have been published covering extensive sample areas. The purpose of this study was to limit the sample area to the Georgia state in the U.S. and to analyze the genome sequences for mutation profiling across the genome and origin of variants. METHODS: The genome sequences (n = 3,970) were obtained from the NCBI database as of June 12, 2021, with the filter of being complete sequenced genomes, homo-sapiens host, and only from Georgia State of the U.S. NextClade, an online tool was used for the analysis of the sequences using Wuhan-Hu-1/2019 as a reference genome. The algorithm was sequence alignment, translation, mutation calling, phylogenetic placement, clade assignment, and quality control (QC). Thirty-six samples with bad QC were removed from the mutational analysis. RESULTS: A total 117,743 mutations in the nucleotides were identified (averaging 31.5 mutations per sample). The mutations A23403G, C3037T, C241T, and C14408T were detected in 98% of the samples. Also, a total of 75,517 mutations in the amino acid were identified (averaging 20.2 mutations per sample). The mutations D614G and P314L were identified in >97% samples whereas R203K, G204R, P681H, and N501Y were detected in >50% samples. Analysis also revealed 16 different clades with 20I (49.6%). Clades 20G (24.2%) and 20A (5.5%) being the most abundant, showed that SARS-CoV-2 in the Georgia State originated mainly from Southeast England, other parts of the U.S., and several countries in Western Europe. CONCLUSION: Looking at the three most common variants in Georgia State of the U.S., we could determine the primary locations of transmission or origin for the virus, and our analyses indicates that majority of the cases originated from Southeast England (Clade 20I), the U.S. itself (Clade 20G), and from Western Europe (Clade 20C). Published by Elsevier B.V. 2022-10-20 2022-07-26 /pmc/articles/PMC9323210/ /pubmed/35905853 http://dx.doi.org/10.1016/j.gene.2022.146774 Text en © 2022 Published by Elsevier B.V. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Article Ahmad, Waqas Ahmad, Sarfraz Basha, Riyaz Analysis of the mutation dynamics of SARS-CoV-2 genome in the samples from Georgia State of the United States |
title | Analysis of the mutation dynamics of SARS-CoV-2 genome in the samples from Georgia State of the United States |
title_full | Analysis of the mutation dynamics of SARS-CoV-2 genome in the samples from Georgia State of the United States |
title_fullStr | Analysis of the mutation dynamics of SARS-CoV-2 genome in the samples from Georgia State of the United States |
title_full_unstemmed | Analysis of the mutation dynamics of SARS-CoV-2 genome in the samples from Georgia State of the United States |
title_short | Analysis of the mutation dynamics of SARS-CoV-2 genome in the samples from Georgia State of the United States |
title_sort | analysis of the mutation dynamics of sars-cov-2 genome in the samples from georgia state of the united states |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9323210/ https://www.ncbi.nlm.nih.gov/pubmed/35905853 http://dx.doi.org/10.1016/j.gene.2022.146774 |
work_keys_str_mv | AT ahmadwaqas analysisofthemutationdynamicsofsarscov2genomeinthesamplesfromgeorgiastateoftheunitedstates AT ahmadsarfraz analysisofthemutationdynamicsofsarscov2genomeinthesamplesfromgeorgiastateoftheunitedstates AT bashariyaz analysisofthemutationdynamicsofsarscov2genomeinthesamplesfromgeorgiastateoftheunitedstates |