Cargando…

Data-driven platform for identifying variants of interest in COVID-19 virus

New SARS-CoV-2 variants emerge as part of the virus’ adaptation to the human host. The Health Organizations are monitoring newly emerging variants with suspected impact on disease or vaccination efficacy as Variants Being Monitored (VBM), like Delta and Omicron. Genetic changes (SNVs) compared to th...

Descripción completa

Detalles Bibliográficos
Autores principales: Ramarao-Milne, Priya, Jain, Yatish, Sng, Letitia M.F., Hosking, Brendan, Lee, Carol, Bayat, Arash, Kuiper, Michael, Wilson, Laurence O.W., Twine, Natalie A., Bauer, Denis C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9162986/
https://www.ncbi.nlm.nih.gov/pubmed/35677774
http://dx.doi.org/10.1016/j.csbj.2022.06.005
_version_ 1784719830101786624
author Ramarao-Milne, Priya
Jain, Yatish
Sng, Letitia M.F.
Hosking, Brendan
Lee, Carol
Bayat, Arash
Kuiper, Michael
Wilson, Laurence O.W.
Twine, Natalie A.
Bauer, Denis C.
author_facet Ramarao-Milne, Priya
Jain, Yatish
Sng, Letitia M.F.
Hosking, Brendan
Lee, Carol
Bayat, Arash
Kuiper, Michael
Wilson, Laurence O.W.
Twine, Natalie A.
Bauer, Denis C.
author_sort Ramarao-Milne, Priya
collection PubMed
description New SARS-CoV-2 variants emerge as part of the virus’ adaptation to the human host. The Health Organizations are monitoring newly emerging variants with suspected impact on disease or vaccination efficacy as Variants Being Monitored (VBM), like Delta and Omicron. Genetic changes (SNVs) compared to the Wuhan variant characterize VBMs with current emphasis on the spike protein and lineage markers. However, monitoring VBMs in such a way might miss SNVs with functional effect on disease. Here we introduce a lineage-agnostic genome-wide approach to identify SNVs associated with disease. We curated a case-control dataset of 10,520 samples and identified 117 SNVs significantly associated with adverse patient outcome. While 40% (47) SNV are already monitored and 36% (43) are in the spike protein, we also identified 70 new SNVs that are associated with disease outcome. 31 of these are disease-worsening and predominantly located in the 3′-5′ exonuclease (NSP14) with structural modelling revealing a concise cluster in the Zn binding domain that has known host-immune modulating function. Furthermore, we generate clade-independent VBM groupings by identifying interacting SNVs (epistasis). We find 37 sets of higher-order epistatic interactions joining 5 genomic regions (nsp3, nsp14, Spike S1, ORF3a, N). Structural modelling of these regions provides insights into potential mechanistic pathways of increased virulence as well as orthogonal methods of validation. Clade-independent monitoring of functionally interacting (epistasis, co-evolution) SNVs detected emerging VBM a week before they were flagged by Health Organizations and in conjunction with structural modelling provides faster, mechanistic insight into emerging strains to guide public health interventions.
format Online
Article
Text
id pubmed-9162986
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-91629862022-06-04 Data-driven platform for identifying variants of interest in COVID-19 virus Ramarao-Milne, Priya Jain, Yatish Sng, Letitia M.F. Hosking, Brendan Lee, Carol Bayat, Arash Kuiper, Michael Wilson, Laurence O.W. Twine, Natalie A. Bauer, Denis C. Comput Struct Biotechnol J Research Article New SARS-CoV-2 variants emerge as part of the virus’ adaptation to the human host. The Health Organizations are monitoring newly emerging variants with suspected impact on disease or vaccination efficacy as Variants Being Monitored (VBM), like Delta and Omicron. Genetic changes (SNVs) compared to the Wuhan variant characterize VBMs with current emphasis on the spike protein and lineage markers. However, monitoring VBMs in such a way might miss SNVs with functional effect on disease. Here we introduce a lineage-agnostic genome-wide approach to identify SNVs associated with disease. We curated a case-control dataset of 10,520 samples and identified 117 SNVs significantly associated with adverse patient outcome. While 40% (47) SNV are already monitored and 36% (43) are in the spike protein, we also identified 70 new SNVs that are associated with disease outcome. 31 of these are disease-worsening and predominantly located in the 3′-5′ exonuclease (NSP14) with structural modelling revealing a concise cluster in the Zn binding domain that has known host-immune modulating function. Furthermore, we generate clade-independent VBM groupings by identifying interacting SNVs (epistasis). We find 37 sets of higher-order epistatic interactions joining 5 genomic regions (nsp3, nsp14, Spike S1, ORF3a, N). Structural modelling of these regions provides insights into potential mechanistic pathways of increased virulence as well as orthogonal methods of validation. Clade-independent monitoring of functionally interacting (epistasis, co-evolution) SNVs detected emerging VBM a week before they were flagged by Health Organizations and in conjunction with structural modelling provides faster, mechanistic insight into emerging strains to guide public health interventions. Research Network of Computational and Structural Biotechnology 2022-06-03 /pmc/articles/PMC9162986/ /pubmed/35677774 http://dx.doi.org/10.1016/j.csbj.2022.06.005 Text en © 2022 The Author(s) https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Research Article
Ramarao-Milne, Priya
Jain, Yatish
Sng, Letitia M.F.
Hosking, Brendan
Lee, Carol
Bayat, Arash
Kuiper, Michael
Wilson, Laurence O.W.
Twine, Natalie A.
Bauer, Denis C.
Data-driven platform for identifying variants of interest in COVID-19 virus
title Data-driven platform for identifying variants of interest in COVID-19 virus
title_full Data-driven platform for identifying variants of interest in COVID-19 virus
title_fullStr Data-driven platform for identifying variants of interest in COVID-19 virus
title_full_unstemmed Data-driven platform for identifying variants of interest in COVID-19 virus
title_short Data-driven platform for identifying variants of interest in COVID-19 virus
title_sort data-driven platform for identifying variants of interest in covid-19 virus
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9162986/
https://www.ncbi.nlm.nih.gov/pubmed/35677774
http://dx.doi.org/10.1016/j.csbj.2022.06.005
work_keys_str_mv AT ramaraomilnepriya datadrivenplatformforidentifyingvariantsofinterestincovid19virus
AT jainyatish datadrivenplatformforidentifyingvariantsofinterestincovid19virus
AT sngletitiamf datadrivenplatformforidentifyingvariantsofinterestincovid19virus
AT hoskingbrendan datadrivenplatformforidentifyingvariantsofinterestincovid19virus
AT leecarol datadrivenplatformforidentifyingvariantsofinterestincovid19virus
AT bayatarash datadrivenplatformforidentifyingvariantsofinterestincovid19virus
AT kuipermichael datadrivenplatformforidentifyingvariantsofinterestincovid19virus
AT wilsonlaurenceow datadrivenplatformforidentifyingvariantsofinterestincovid19virus
AT twinenataliea datadrivenplatformforidentifyingvariantsofinterestincovid19virus
AT bauerdenisc datadrivenplatformforidentifyingvariantsofinterestincovid19virus