Cargando…
Data-driven platform for identifying variants of interest in COVID-19 virus
New SARS-CoV-2 variants emerge as part of the virus’ adaptation to the human host. The Health Organizations are monitoring newly emerging variants with suspected impact on disease or vaccination efficacy as Variants Being Monitored (VBM), like Delta and Omicron. Genetic changes (SNVs) compared to th...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Research Network of Computational and Structural Biotechnology
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9162986/ https://www.ncbi.nlm.nih.gov/pubmed/35677774 http://dx.doi.org/10.1016/j.csbj.2022.06.005 |
_version_ | 1784719830101786624 |
---|---|
author | Ramarao-Milne, Priya Jain, Yatish Sng, Letitia M.F. Hosking, Brendan Lee, Carol Bayat, Arash Kuiper, Michael Wilson, Laurence O.W. Twine, Natalie A. Bauer, Denis C. |
author_facet | Ramarao-Milne, Priya Jain, Yatish Sng, Letitia M.F. Hosking, Brendan Lee, Carol Bayat, Arash Kuiper, Michael Wilson, Laurence O.W. Twine, Natalie A. Bauer, Denis C. |
author_sort | Ramarao-Milne, Priya |
collection | PubMed |
description | New SARS-CoV-2 variants emerge as part of the virus’ adaptation to the human host. The Health Organizations are monitoring newly emerging variants with suspected impact on disease or vaccination efficacy as Variants Being Monitored (VBM), like Delta and Omicron. Genetic changes (SNVs) compared to the Wuhan variant characterize VBMs with current emphasis on the spike protein and lineage markers. However, monitoring VBMs in such a way might miss SNVs with functional effect on disease. Here we introduce a lineage-agnostic genome-wide approach to identify SNVs associated with disease. We curated a case-control dataset of 10,520 samples and identified 117 SNVs significantly associated with adverse patient outcome. While 40% (47) SNV are already monitored and 36% (43) are in the spike protein, we also identified 70 new SNVs that are associated with disease outcome. 31 of these are disease-worsening and predominantly located in the 3′-5′ exonuclease (NSP14) with structural modelling revealing a concise cluster in the Zn binding domain that has known host-immune modulating function. Furthermore, we generate clade-independent VBM groupings by identifying interacting SNVs (epistasis). We find 37 sets of higher-order epistatic interactions joining 5 genomic regions (nsp3, nsp14, Spike S1, ORF3a, N). Structural modelling of these regions provides insights into potential mechanistic pathways of increased virulence as well as orthogonal methods of validation. Clade-independent monitoring of functionally interacting (epistasis, co-evolution) SNVs detected emerging VBM a week before they were flagged by Health Organizations and in conjunction with structural modelling provides faster, mechanistic insight into emerging strains to guide public health interventions. |
format | Online Article Text |
id | pubmed-9162986 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Research Network of Computational and Structural Biotechnology |
record_format | MEDLINE/PubMed |
spelling | pubmed-91629862022-06-04 Data-driven platform for identifying variants of interest in COVID-19 virus Ramarao-Milne, Priya Jain, Yatish Sng, Letitia M.F. Hosking, Brendan Lee, Carol Bayat, Arash Kuiper, Michael Wilson, Laurence O.W. Twine, Natalie A. Bauer, Denis C. Comput Struct Biotechnol J Research Article New SARS-CoV-2 variants emerge as part of the virus’ adaptation to the human host. The Health Organizations are monitoring newly emerging variants with suspected impact on disease or vaccination efficacy as Variants Being Monitored (VBM), like Delta and Omicron. Genetic changes (SNVs) compared to the Wuhan variant characterize VBMs with current emphasis on the spike protein and lineage markers. However, monitoring VBMs in such a way might miss SNVs with functional effect on disease. Here we introduce a lineage-agnostic genome-wide approach to identify SNVs associated with disease. We curated a case-control dataset of 10,520 samples and identified 117 SNVs significantly associated with adverse patient outcome. While 40% (47) SNV are already monitored and 36% (43) are in the spike protein, we also identified 70 new SNVs that are associated with disease outcome. 31 of these are disease-worsening and predominantly located in the 3′-5′ exonuclease (NSP14) with structural modelling revealing a concise cluster in the Zn binding domain that has known host-immune modulating function. Furthermore, we generate clade-independent VBM groupings by identifying interacting SNVs (epistasis). We find 37 sets of higher-order epistatic interactions joining 5 genomic regions (nsp3, nsp14, Spike S1, ORF3a, N). Structural modelling of these regions provides insights into potential mechanistic pathways of increased virulence as well as orthogonal methods of validation. Clade-independent monitoring of functionally interacting (epistasis, co-evolution) SNVs detected emerging VBM a week before they were flagged by Health Organizations and in conjunction with structural modelling provides faster, mechanistic insight into emerging strains to guide public health interventions. Research Network of Computational and Structural Biotechnology 2022-06-03 /pmc/articles/PMC9162986/ /pubmed/35677774 http://dx.doi.org/10.1016/j.csbj.2022.06.005 Text en © 2022 The Author(s) https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Research Article Ramarao-Milne, Priya Jain, Yatish Sng, Letitia M.F. Hosking, Brendan Lee, Carol Bayat, Arash Kuiper, Michael Wilson, Laurence O.W. Twine, Natalie A. Bauer, Denis C. Data-driven platform for identifying variants of interest in COVID-19 virus |
title | Data-driven platform for identifying variants of interest in COVID-19 virus |
title_full | Data-driven platform for identifying variants of interest in COVID-19 virus |
title_fullStr | Data-driven platform for identifying variants of interest in COVID-19 virus |
title_full_unstemmed | Data-driven platform for identifying variants of interest in COVID-19 virus |
title_short | Data-driven platform for identifying variants of interest in COVID-19 virus |
title_sort | data-driven platform for identifying variants of interest in covid-19 virus |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9162986/ https://www.ncbi.nlm.nih.gov/pubmed/35677774 http://dx.doi.org/10.1016/j.csbj.2022.06.005 |
work_keys_str_mv | AT ramaraomilnepriya datadrivenplatformforidentifyingvariantsofinterestincovid19virus AT jainyatish datadrivenplatformforidentifyingvariantsofinterestincovid19virus AT sngletitiamf datadrivenplatformforidentifyingvariantsofinterestincovid19virus AT hoskingbrendan datadrivenplatformforidentifyingvariantsofinterestincovid19virus AT leecarol datadrivenplatformforidentifyingvariantsofinterestincovid19virus AT bayatarash datadrivenplatformforidentifyingvariantsofinterestincovid19virus AT kuipermichael datadrivenplatformforidentifyingvariantsofinterestincovid19virus AT wilsonlaurenceow datadrivenplatformforidentifyingvariantsofinterestincovid19virus AT twinenataliea datadrivenplatformforidentifyingvariantsofinterestincovid19virus AT bauerdenisc datadrivenplatformforidentifyingvariantsofinterestincovid19virus |