Cargando…

Genome-wide analysis of Indian SARS-CoV-2 genomes for the identification of genetic mutation and SNP

The wave of COVID-19 is a big threat to the human population. Presently, the world is going through different phases of lock down in order to stop this wave of pandemic; India being no exception. We have also started the lock down on 23rd March 2020. In this current situation, apart from social dist...

Descripción completa

Detalles Bibliográficos
Autores principales: Saha, Indrajit, Ghosh, Nimisha, Maity, Debasree, Sharma, Nikhil, Sarkar, Jnanendra Prasad, Mitra, Kaushik
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier B.V. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7351671/
https://www.ncbi.nlm.nih.gov/pubmed/32659347
http://dx.doi.org/10.1016/j.meegid.2020.104457
_version_ 1783557484424724480
author Saha, Indrajit
Ghosh, Nimisha
Maity, Debasree
Sharma, Nikhil
Sarkar, Jnanendra Prasad
Mitra, Kaushik
author_facet Saha, Indrajit
Ghosh, Nimisha
Maity, Debasree
Sharma, Nikhil
Sarkar, Jnanendra Prasad
Mitra, Kaushik
author_sort Saha, Indrajit
collection PubMed
description The wave of COVID-19 is a big threat to the human population. Presently, the world is going through different phases of lock down in order to stop this wave of pandemic; India being no exception. We have also started the lock down on 23rd March 2020. In this current situation, apart from social distancing only a vaccine can be the proper solution to serve the population of human being. Thus it is important for all the nations to perform the genome-wide analysis in order to identify the genetic variation in Severe Acute Respiratory Syndrome Coronavirus-2 (SARS-CoV-2) so that proper vaccine can be designed. This fast motivated us to analyze publicly available 566 Indian complete or near complete SARS-CoV-2 genomes to find the mutation points as substitution, deletion and insertion. In this regard, we have performed the multiple sequence alignment in presence of reference sequence from NCBI. After the alignment, a consensus sequence is built to analyze each genome in order to identify the mutation points. As a consequence, we have found 933 substitutions, 2449 deletions and 2 insertions, in total 3384 unique mutation points, in 566 genomes across 29.9 K bp. Further, it has been classified into three groups as 100 clusters of mutations (mostly deletions), 1609 point mutations as substitution, deletion and insertion and 64 SNPs. These outcomes are visualized using BioCircos and bar plots as well as plotting entropy value of each genomic location. Moreover, phylogenetic analysis has also been performed to see the evolution of SARS-CoV-2 virus in India. It also shows the wide variation in tree which indeed vivid in genomic analysis. Finally, these SNPs can be the useful target for virus classification, designing and defining the effective dose of vaccine for the heterogeneous population.
format Online
Article
Text
id pubmed-7351671
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Elsevier B.V.
record_format MEDLINE/PubMed
spelling pubmed-73516712020-07-13 Genome-wide analysis of Indian SARS-CoV-2 genomes for the identification of genetic mutation and SNP Saha, Indrajit Ghosh, Nimisha Maity, Debasree Sharma, Nikhil Sarkar, Jnanendra Prasad Mitra, Kaushik Infect Genet Evol Research Paper The wave of COVID-19 is a big threat to the human population. Presently, the world is going through different phases of lock down in order to stop this wave of pandemic; India being no exception. We have also started the lock down on 23rd March 2020. In this current situation, apart from social distancing only a vaccine can be the proper solution to serve the population of human being. Thus it is important for all the nations to perform the genome-wide analysis in order to identify the genetic variation in Severe Acute Respiratory Syndrome Coronavirus-2 (SARS-CoV-2) so that proper vaccine can be designed. This fast motivated us to analyze publicly available 566 Indian complete or near complete SARS-CoV-2 genomes to find the mutation points as substitution, deletion and insertion. In this regard, we have performed the multiple sequence alignment in presence of reference sequence from NCBI. After the alignment, a consensus sequence is built to analyze each genome in order to identify the mutation points. As a consequence, we have found 933 substitutions, 2449 deletions and 2 insertions, in total 3384 unique mutation points, in 566 genomes across 29.9 K bp. Further, it has been classified into three groups as 100 clusters of mutations (mostly deletions), 1609 point mutations as substitution, deletion and insertion and 64 SNPs. These outcomes are visualized using BioCircos and bar plots as well as plotting entropy value of each genomic location. Moreover, phylogenetic analysis has also been performed to see the evolution of SARS-CoV-2 virus in India. It also shows the wide variation in tree which indeed vivid in genomic analysis. Finally, these SNPs can be the useful target for virus classification, designing and defining the effective dose of vaccine for the heterogeneous population. Elsevier B.V. 2020-11 2020-07-11 /pmc/articles/PMC7351671/ /pubmed/32659347 http://dx.doi.org/10.1016/j.meegid.2020.104457 Text en © 2020 Elsevier B.V. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Research Paper
Saha, Indrajit
Ghosh, Nimisha
Maity, Debasree
Sharma, Nikhil
Sarkar, Jnanendra Prasad
Mitra, Kaushik
Genome-wide analysis of Indian SARS-CoV-2 genomes for the identification of genetic mutation and SNP
title Genome-wide analysis of Indian SARS-CoV-2 genomes for the identification of genetic mutation and SNP
title_full Genome-wide analysis of Indian SARS-CoV-2 genomes for the identification of genetic mutation and SNP
title_fullStr Genome-wide analysis of Indian SARS-CoV-2 genomes for the identification of genetic mutation and SNP
title_full_unstemmed Genome-wide analysis of Indian SARS-CoV-2 genomes for the identification of genetic mutation and SNP
title_short Genome-wide analysis of Indian SARS-CoV-2 genomes for the identification of genetic mutation and SNP
title_sort genome-wide analysis of indian sars-cov-2 genomes for the identification of genetic mutation and snp
topic Research Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7351671/
https://www.ncbi.nlm.nih.gov/pubmed/32659347
http://dx.doi.org/10.1016/j.meegid.2020.104457
work_keys_str_mv AT sahaindrajit genomewideanalysisofindiansarscov2genomesfortheidentificationofgeneticmutationandsnp
AT ghoshnimisha genomewideanalysisofindiansarscov2genomesfortheidentificationofgeneticmutationandsnp
AT maitydebasree genomewideanalysisofindiansarscov2genomesfortheidentificationofgeneticmutationandsnp
AT sharmanikhil genomewideanalysisofindiansarscov2genomesfortheidentificationofgeneticmutationandsnp
AT sarkarjnanendraprasad genomewideanalysisofindiansarscov2genomesfortheidentificationofgeneticmutationandsnp
AT mitrakaushik genomewideanalysisofindiansarscov2genomesfortheidentificationofgeneticmutationandsnp