Cargando…

Comparative Genomics and Integrated Network Approach Unveiled Undirected Phylogeny Patterns, Co-mutational Hot Spots, Functional Cross Talk, and Regulatory Interactions in SARS-CoV-2

The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) pandemic has resulted in 92 million cases in a span of 1 year. The study focuses on understanding population-specific variations attributing its high rate of infections in specific geographical regions particularly in the United States...

Descripción completa

Detalles Bibliográficos
Autores principales: Gupta, Vipin, Haider, Shazia, Verma, Mansi, Singhvi, Nirjara, Ponnusamy, Kalaisaran, Malik, Md. Zubbair, Verma, Helianthous, Kumar, Roshan, Sood, Utkarsh, Hira, Princy, Satija, Shiva, Singh, Yogendra, Lal, Rup
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society for Microbiology 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8573956/
https://www.ncbi.nlm.nih.gov/pubmed/33622851
http://dx.doi.org/10.1128/mSystems.00030-21
_version_ 1784595522999287808
author Gupta, Vipin
Haider, Shazia
Verma, Mansi
Singhvi, Nirjara
Ponnusamy, Kalaisaran
Malik, Md. Zubbair
Verma, Helianthous
Kumar, Roshan
Sood, Utkarsh
Hira, Princy
Satija, Shiva
Singh, Yogendra
Lal, Rup
author_facet Gupta, Vipin
Haider, Shazia
Verma, Mansi
Singhvi, Nirjara
Ponnusamy, Kalaisaran
Malik, Md. Zubbair
Verma, Helianthous
Kumar, Roshan
Sood, Utkarsh
Hira, Princy
Satija, Shiva
Singh, Yogendra
Lal, Rup
author_sort Gupta, Vipin
collection PubMed
description The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) pandemic has resulted in 92 million cases in a span of 1 year. The study focuses on understanding population-specific variations attributing its high rate of infections in specific geographical regions particularly in the United States. Rigorous phylogenomic network analysis of complete SARS-CoV-2 genomes (245) inferred five central clades named a (ancestral), b, c, d, and e (subtypes e1 and e2). Clade d and subclade e2 were found exclusively comprised of U.S. strains. Clades were distinguished by 10 co-mutational combinations in Nsp3, ORF8, Nsp13, S, Nsp12, Nsp2, and Nsp6. Our analysis revealed that only 67.46% of single nucleotide polymorphism (SNP) mutations were at the amino acid level. T1103P mutation in Nsp3 was predicted to increase protein stability in 238 strains except for 6 strains which were marked as ancestral type, whereas co-mutation (P409L and Y446C) in Nsp13 were found in 64 genomes from the United States highlighting its 100% co-occurrence. Docking highlighted mutation (D614G) caused reduction in binding of spike proteins with angiotensin-converting enzyme 2 (ACE2), but it also showed better interaction with the TMPRSS2 receptor contributing to high transmissibility among U.S. strains. We also found host proteins, MYO5A, MYO5B, and MYO5C, that had maximum interaction with viral proteins (nucleocapsid [N], spike [S], and membrane [M] proteins). Thus, blocking the internalization pathway by inhibiting MYO5 proteins which could be an effective target for coronavirus disease 2019 (COVID-19) treatment. The functional annotations of the host-pathogen interaction (HPI) network were found to be closely associated with hypoxia and thrombotic conditions, confirming the vulnerability and severity of infection. We also screened CpG islands in Nsp1 and N conferring the ability of SARS-CoV-2 to enter and trigger zinc antiviral protein (ZAP) activity inside the host cell. IMPORTANCE In the current study, we presented a global view of mutational pattern observed in SARS-CoV-2 virus transmission. This provided a who-infect-whom geographical model since the early pandemic. This is hitherto the most comprehensive comparative genomics analysis of full-length genomes for co-mutations at different geographical regions especially in U.S. strains. Compositional structural biology results suggested that mutations have a balance of opposing forces affecting pathogenicity suggesting that only a few mutations are effective at the translation level. Novel HPI analysis and CpG predictions elucidate the proof of concept of hypoxia and thrombotic conditions in several patients. Thus, the current study focuses the understanding of population-specific variations attributing a high rate of SARS-CoV-2 infections in specific geographical regions which may eventually be vital for the most severely affected countries and regions for sharp development of custom-made vindication strategies.
format Online
Article
Text
id pubmed-8573956
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher American Society for Microbiology
record_format MEDLINE/PubMed
spelling pubmed-85739562021-11-08 Comparative Genomics and Integrated Network Approach Unveiled Undirected Phylogeny Patterns, Co-mutational Hot Spots, Functional Cross Talk, and Regulatory Interactions in SARS-CoV-2 Gupta, Vipin Haider, Shazia Verma, Mansi Singhvi, Nirjara Ponnusamy, Kalaisaran Malik, Md. Zubbair Verma, Helianthous Kumar, Roshan Sood, Utkarsh Hira, Princy Satija, Shiva Singh, Yogendra Lal, Rup mSystems Research Article The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) pandemic has resulted in 92 million cases in a span of 1 year. The study focuses on understanding population-specific variations attributing its high rate of infections in specific geographical regions particularly in the United States. Rigorous phylogenomic network analysis of complete SARS-CoV-2 genomes (245) inferred five central clades named a (ancestral), b, c, d, and e (subtypes e1 and e2). Clade d and subclade e2 were found exclusively comprised of U.S. strains. Clades were distinguished by 10 co-mutational combinations in Nsp3, ORF8, Nsp13, S, Nsp12, Nsp2, and Nsp6. Our analysis revealed that only 67.46% of single nucleotide polymorphism (SNP) mutations were at the amino acid level. T1103P mutation in Nsp3 was predicted to increase protein stability in 238 strains except for 6 strains which were marked as ancestral type, whereas co-mutation (P409L and Y446C) in Nsp13 were found in 64 genomes from the United States highlighting its 100% co-occurrence. Docking highlighted mutation (D614G) caused reduction in binding of spike proteins with angiotensin-converting enzyme 2 (ACE2), but it also showed better interaction with the TMPRSS2 receptor contributing to high transmissibility among U.S. strains. We also found host proteins, MYO5A, MYO5B, and MYO5C, that had maximum interaction with viral proteins (nucleocapsid [N], spike [S], and membrane [M] proteins). Thus, blocking the internalization pathway by inhibiting MYO5 proteins which could be an effective target for coronavirus disease 2019 (COVID-19) treatment. The functional annotations of the host-pathogen interaction (HPI) network were found to be closely associated with hypoxia and thrombotic conditions, confirming the vulnerability and severity of infection. We also screened CpG islands in Nsp1 and N conferring the ability of SARS-CoV-2 to enter and trigger zinc antiviral protein (ZAP) activity inside the host cell. IMPORTANCE In the current study, we presented a global view of mutational pattern observed in SARS-CoV-2 virus transmission. This provided a who-infect-whom geographical model since the early pandemic. This is hitherto the most comprehensive comparative genomics analysis of full-length genomes for co-mutations at different geographical regions especially in U.S. strains. Compositional structural biology results suggested that mutations have a balance of opposing forces affecting pathogenicity suggesting that only a few mutations are effective at the translation level. Novel HPI analysis and CpG predictions elucidate the proof of concept of hypoxia and thrombotic conditions in several patients. Thus, the current study focuses the understanding of population-specific variations attributing a high rate of SARS-CoV-2 infections in specific geographical regions which may eventually be vital for the most severely affected countries and regions for sharp development of custom-made vindication strategies. American Society for Microbiology 2021-02-23 /pmc/articles/PMC8573956/ /pubmed/33622851 http://dx.doi.org/10.1128/mSystems.00030-21 Text en Copyright © 2021 Gupta et al. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Research Article
Gupta, Vipin
Haider, Shazia
Verma, Mansi
Singhvi, Nirjara
Ponnusamy, Kalaisaran
Malik, Md. Zubbair
Verma, Helianthous
Kumar, Roshan
Sood, Utkarsh
Hira, Princy
Satija, Shiva
Singh, Yogendra
Lal, Rup
Comparative Genomics and Integrated Network Approach Unveiled Undirected Phylogeny Patterns, Co-mutational Hot Spots, Functional Cross Talk, and Regulatory Interactions in SARS-CoV-2
title Comparative Genomics and Integrated Network Approach Unveiled Undirected Phylogeny Patterns, Co-mutational Hot Spots, Functional Cross Talk, and Regulatory Interactions in SARS-CoV-2
title_full Comparative Genomics and Integrated Network Approach Unveiled Undirected Phylogeny Patterns, Co-mutational Hot Spots, Functional Cross Talk, and Regulatory Interactions in SARS-CoV-2
title_fullStr Comparative Genomics and Integrated Network Approach Unveiled Undirected Phylogeny Patterns, Co-mutational Hot Spots, Functional Cross Talk, and Regulatory Interactions in SARS-CoV-2
title_full_unstemmed Comparative Genomics and Integrated Network Approach Unveiled Undirected Phylogeny Patterns, Co-mutational Hot Spots, Functional Cross Talk, and Regulatory Interactions in SARS-CoV-2
title_short Comparative Genomics and Integrated Network Approach Unveiled Undirected Phylogeny Patterns, Co-mutational Hot Spots, Functional Cross Talk, and Regulatory Interactions in SARS-CoV-2
title_sort comparative genomics and integrated network approach unveiled undirected phylogeny patterns, co-mutational hot spots, functional cross talk, and regulatory interactions in sars-cov-2
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8573956/
https://www.ncbi.nlm.nih.gov/pubmed/33622851
http://dx.doi.org/10.1128/mSystems.00030-21
work_keys_str_mv AT guptavipin comparativegenomicsandintegratednetworkapproachunveiledundirectedphylogenypatternscomutationalhotspotsfunctionalcrosstalkandregulatoryinteractionsinsarscov2
AT haidershazia comparativegenomicsandintegratednetworkapproachunveiledundirectedphylogenypatternscomutationalhotspotsfunctionalcrosstalkandregulatoryinteractionsinsarscov2
AT vermamansi comparativegenomicsandintegratednetworkapproachunveiledundirectedphylogenypatternscomutationalhotspotsfunctionalcrosstalkandregulatoryinteractionsinsarscov2
AT singhvinirjara comparativegenomicsandintegratednetworkapproachunveiledundirectedphylogenypatternscomutationalhotspotsfunctionalcrosstalkandregulatoryinteractionsinsarscov2
AT ponnusamykalaisaran comparativegenomicsandintegratednetworkapproachunveiledundirectedphylogenypatternscomutationalhotspotsfunctionalcrosstalkandregulatoryinteractionsinsarscov2
AT malikmdzubbair comparativegenomicsandintegratednetworkapproachunveiledundirectedphylogenypatternscomutationalhotspotsfunctionalcrosstalkandregulatoryinteractionsinsarscov2
AT vermahelianthous comparativegenomicsandintegratednetworkapproachunveiledundirectedphylogenypatternscomutationalhotspotsfunctionalcrosstalkandregulatoryinteractionsinsarscov2
AT kumarroshan comparativegenomicsandintegratednetworkapproachunveiledundirectedphylogenypatternscomutationalhotspotsfunctionalcrosstalkandregulatoryinteractionsinsarscov2
AT soodutkarsh comparativegenomicsandintegratednetworkapproachunveiledundirectedphylogenypatternscomutationalhotspotsfunctionalcrosstalkandregulatoryinteractionsinsarscov2
AT hiraprincy comparativegenomicsandintegratednetworkapproachunveiledundirectedphylogenypatternscomutationalhotspotsfunctionalcrosstalkandregulatoryinteractionsinsarscov2
AT satijashiva comparativegenomicsandintegratednetworkapproachunveiledundirectedphylogenypatternscomutationalhotspotsfunctionalcrosstalkandregulatoryinteractionsinsarscov2
AT singhyogendra comparativegenomicsandintegratednetworkapproachunveiledundirectedphylogenypatternscomutationalhotspotsfunctionalcrosstalkandregulatoryinteractionsinsarscov2
AT lalrup comparativegenomicsandintegratednetworkapproachunveiledundirectedphylogenypatternscomutationalhotspotsfunctionalcrosstalkandregulatoryinteractionsinsarscov2