Cargando…

Genomic chronicle of SARS-CoV-2: a mutational analysis with over 1 million genome sequences

Use of information technologies to analyse big data on SARS-CoV-2 genome provides an insight for tracking variations and examining the evolution of the virus. Nevertheless, storing, processing, alignment and analyses of these numerous genomes are still a challenge. In this study, over 1 million SARS...

Descripción completa

Detalles Bibliográficos
Autores principales: UĞUREL, Osman Mutluhan, ATA, Oğuz, TURGUT-BALIK, Dilek
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Scientific and Technological Research Council of Turkey 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8573839/
https://www.ncbi.nlm.nih.gov/pubmed/34803444
http://dx.doi.org/10.3906/biy-2106-8
_version_ 1784595499584585728
author UĞUREL, Osman Mutluhan
ATA, Oğuz
TURGUT-BALIK, Dilek
author_facet UĞUREL, Osman Mutluhan
ATA, Oğuz
TURGUT-BALIK, Dilek
author_sort UĞUREL, Osman Mutluhan
collection PubMed
description Use of information technologies to analyse big data on SARS-CoV-2 genome provides an insight for tracking variations and examining the evolution of the virus. Nevertheless, storing, processing, alignment and analyses of these numerous genomes are still a challenge. In this study, over 1 million SARS-CoV-2 genomes have been analysed to show distribution and relationship of variations that could enlighten development and evolution of the virus. In all genomes analysed in this study, a total of over 215M SNVs have been detected and average number of SNV per isolate was found to be 21.83. Single nucleotide variant (SNV) average is observed to reach 31.25 just in March 2021. The average variation number of isolates is increasing and compromising with total case numbers around the world. Remarkably, cytosine deamination, which is one of the most important biochemical processes in the evolutionary development of coronaviruses, accounts for 46% of all SNVs seen in SARS-CoV-2 genomes within 16 months. This study is one of the most comprehensive SARS-CoV-2 genomic analysis study in terms of number of genomes analysed in an academic publication so far, and reported results could be useful in monitoring the development of SARS-CoV-2.
format Online
Article
Text
id pubmed-8573839
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher The Scientific and Technological Research Council of Turkey
record_format MEDLINE/PubMed
spelling pubmed-85738392021-11-18 Genomic chronicle of SARS-CoV-2: a mutational analysis with over 1 million genome sequences UĞUREL, Osman Mutluhan ATA, Oğuz TURGUT-BALIK, Dilek Turk J Biol Article Use of information technologies to analyse big data on SARS-CoV-2 genome provides an insight for tracking variations and examining the evolution of the virus. Nevertheless, storing, processing, alignment and analyses of these numerous genomes are still a challenge. In this study, over 1 million SARS-CoV-2 genomes have been analysed to show distribution and relationship of variations that could enlighten development and evolution of the virus. In all genomes analysed in this study, a total of over 215M SNVs have been detected and average number of SNV per isolate was found to be 21.83. Single nucleotide variant (SNV) average is observed to reach 31.25 just in March 2021. The average variation number of isolates is increasing and compromising with total case numbers around the world. Remarkably, cytosine deamination, which is one of the most important biochemical processes in the evolutionary development of coronaviruses, accounts for 46% of all SNVs seen in SARS-CoV-2 genomes within 16 months. This study is one of the most comprehensive SARS-CoV-2 genomic analysis study in terms of number of genomes analysed in an academic publication so far, and reported results could be useful in monitoring the development of SARS-CoV-2. The Scientific and Technological Research Council of Turkey 2021-08-30 /pmc/articles/PMC8573839/ /pubmed/34803444 http://dx.doi.org/10.3906/biy-2106-8 Text en Copyright © 2021 The Author(s) https://creativecommons.org/licenses/by/4.0/This article is distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted use and redistribution provided that the original author and source are credited.
spellingShingle Article
UĞUREL, Osman Mutluhan
ATA, Oğuz
TURGUT-BALIK, Dilek
Genomic chronicle of SARS-CoV-2: a mutational analysis with over 1 million genome sequences
title Genomic chronicle of SARS-CoV-2: a mutational analysis with over 1 million genome sequences
title_full Genomic chronicle of SARS-CoV-2: a mutational analysis with over 1 million genome sequences
title_fullStr Genomic chronicle of SARS-CoV-2: a mutational analysis with over 1 million genome sequences
title_full_unstemmed Genomic chronicle of SARS-CoV-2: a mutational analysis with over 1 million genome sequences
title_short Genomic chronicle of SARS-CoV-2: a mutational analysis with over 1 million genome sequences
title_sort genomic chronicle of sars-cov-2: a mutational analysis with over 1 million genome sequences
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8573839/
https://www.ncbi.nlm.nih.gov/pubmed/34803444
http://dx.doi.org/10.3906/biy-2106-8
work_keys_str_mv AT ugurelosmanmutluhan genomicchronicleofsarscov2amutationalanalysiswithover1milliongenomesequences
AT ataoguz genomicchronicleofsarscov2amutationalanalysiswithover1milliongenomesequences
AT turgutbalikdilek genomicchronicleofsarscov2amutationalanalysiswithover1milliongenomesequences