Cargando…

Bioinformatics pipeline unveils genetic variability to synthetic vaccine design for Indian SARS-CoV-2 genomes

In the worrisome scenarios of various waves of SARS-CoV-2 pandemic, a comprehensive bioinformatics pipeline is essential to analyse the virus genomes in order to understand its evolution, thereby identifying mutations as signature SNPs, conserved regions and subsequently to design epitope based synt...

Descripción completa

Detalles Bibliográficos
Autores principales: Ghosh, Nimisha, Saha, Indrajit, Sharma, Nikhil, Nandi, Suman
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier B.V. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9444899/
https://www.ncbi.nlm.nih.gov/pubmed/36116149
http://dx.doi.org/10.1016/j.intimp.2022.109224
_version_ 1784783330020950016
author Ghosh, Nimisha
Saha, Indrajit
Sharma, Nikhil
Nandi, Suman
author_facet Ghosh, Nimisha
Saha, Indrajit
Sharma, Nikhil
Nandi, Suman
author_sort Ghosh, Nimisha
collection PubMed
description In the worrisome scenarios of various waves of SARS-CoV-2 pandemic, a comprehensive bioinformatics pipeline is essential to analyse the virus genomes in order to understand its evolution, thereby identifying mutations as signature SNPs, conserved regions and subsequently to design epitope based synthetic vaccine. We have thus performed multiple sequence alignment of 4996 Indian SARS-CoV-2 genomes as a case study using MAFFT followed by phylogenetic analysis using Nextstrain to identify virus clades. Furthermore, based on the entropy of each genomic coordinate of the aligned sequences, conserved regions are identified. After refinement of the conserved regions, based on its length, one conserved region is identified for which the primers and probes are reported for virus detection. The refined conserved regions are also used to identify T-cell and B-cell epitopes along with their immunogenic and antigenic scores. Such scores are used for selecting the most immunogenic and antigenic epitopes. By executing this pipeline, 40 unique signature SNPs are identified resulting in 23 non-synonymous signature SNPs which provide 28 amino acid changes in protein. On the other hand, 12 conserved regions are selected based on refinement criteria out of which one is selected as the potential target for virus detection. Additionally, 22 MHC-I and 21 MHC-II restricted T-cell epitopes with 10 unique HLA alleles each and 17 B-cell epitopes are obtained for 12 conserved regions. All the results are validated both quantitatively and qualitatively which show that from genetic variability to synthetic vaccine design, the proposed pipeline can be used effectively to combat SARS-CoV-2.
format Online
Article
Text
id pubmed-9444899
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier B.V.
record_format MEDLINE/PubMed
spelling pubmed-94448992022-09-06 Bioinformatics pipeline unveils genetic variability to synthetic vaccine design for Indian SARS-CoV-2 genomes Ghosh, Nimisha Saha, Indrajit Sharma, Nikhil Nandi, Suman Int Immunopharmacol Article In the worrisome scenarios of various waves of SARS-CoV-2 pandemic, a comprehensive bioinformatics pipeline is essential to analyse the virus genomes in order to understand its evolution, thereby identifying mutations as signature SNPs, conserved regions and subsequently to design epitope based synthetic vaccine. We have thus performed multiple sequence alignment of 4996 Indian SARS-CoV-2 genomes as a case study using MAFFT followed by phylogenetic analysis using Nextstrain to identify virus clades. Furthermore, based on the entropy of each genomic coordinate of the aligned sequences, conserved regions are identified. After refinement of the conserved regions, based on its length, one conserved region is identified for which the primers and probes are reported for virus detection. The refined conserved regions are also used to identify T-cell and B-cell epitopes along with their immunogenic and antigenic scores. Such scores are used for selecting the most immunogenic and antigenic epitopes. By executing this pipeline, 40 unique signature SNPs are identified resulting in 23 non-synonymous signature SNPs which provide 28 amino acid changes in protein. On the other hand, 12 conserved regions are selected based on refinement criteria out of which one is selected as the potential target for virus detection. Additionally, 22 MHC-I and 21 MHC-II restricted T-cell epitopes with 10 unique HLA alleles each and 17 B-cell epitopes are obtained for 12 conserved regions. All the results are validated both quantitatively and qualitatively which show that from genetic variability to synthetic vaccine design, the proposed pipeline can be used effectively to combat SARS-CoV-2. Elsevier B.V. 2022-11 2022-09-06 /pmc/articles/PMC9444899/ /pubmed/36116149 http://dx.doi.org/10.1016/j.intimp.2022.109224 Text en © 2022 Elsevier B.V. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
Ghosh, Nimisha
Saha, Indrajit
Sharma, Nikhil
Nandi, Suman
Bioinformatics pipeline unveils genetic variability to synthetic vaccine design for Indian SARS-CoV-2 genomes
title Bioinformatics pipeline unveils genetic variability to synthetic vaccine design for Indian SARS-CoV-2 genomes
title_full Bioinformatics pipeline unveils genetic variability to synthetic vaccine design for Indian SARS-CoV-2 genomes
title_fullStr Bioinformatics pipeline unveils genetic variability to synthetic vaccine design for Indian SARS-CoV-2 genomes
title_full_unstemmed Bioinformatics pipeline unveils genetic variability to synthetic vaccine design for Indian SARS-CoV-2 genomes
title_short Bioinformatics pipeline unveils genetic variability to synthetic vaccine design for Indian SARS-CoV-2 genomes
title_sort bioinformatics pipeline unveils genetic variability to synthetic vaccine design for indian sars-cov-2 genomes
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9444899/
https://www.ncbi.nlm.nih.gov/pubmed/36116149
http://dx.doi.org/10.1016/j.intimp.2022.109224
work_keys_str_mv AT ghoshnimisha bioinformaticspipelineunveilsgeneticvariabilitytosyntheticvaccinedesignforindiansarscov2genomes
AT sahaindrajit bioinformaticspipelineunveilsgeneticvariabilitytosyntheticvaccinedesignforindiansarscov2genomes
AT sharmanikhil bioinformaticspipelineunveilsgeneticvariabilitytosyntheticvaccinedesignforindiansarscov2genomes
AT nandisuman bioinformaticspipelineunveilsgeneticvariabilitytosyntheticvaccinedesignforindiansarscov2genomes