Cargando…

Epistatic models predict mutable sites in SARS-CoV-2 proteins and epitopes

The emergence of new variants of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is a major concern given their potential impact on the transmissibility and pathogenicity of the virus as well as the efficacy of therapeutic interventions. Here, we predict the mutability of all positions...

Descripción completa

Detalles Bibliográficos
Autores principales: Rodriguez-Rivas, Juan, Croce, Giancarlo, Muscat, Maureen, Weigt, Martin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Academy of Sciences 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8795541/
https://www.ncbi.nlm.nih.gov/pubmed/35022216
http://dx.doi.org/10.1073/pnas.2113118119
_version_ 1784641089432453120
author Rodriguez-Rivas, Juan
Croce, Giancarlo
Muscat, Maureen
Weigt, Martin
author_facet Rodriguez-Rivas, Juan
Croce, Giancarlo
Muscat, Maureen
Weigt, Martin
author_sort Rodriguez-Rivas, Juan
collection PubMed
description The emergence of new variants of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is a major concern given their potential impact on the transmissibility and pathogenicity of the virus as well as the efficacy of therapeutic interventions. Here, we predict the mutability of all positions in SARS-CoV-2 protein domains to forecast the appearance of unseen variants. Using sequence data from other coronaviruses, preexisting to SARS-CoV-2, we build statistical models that not only capture amino acid conservation but also more complex patterns resulting from epistasis. We show that these models are notably superior to conservation profiles in estimating the already observable SARS-CoV-2 variability. In the receptor binding domain of the spike protein, we observe that the predicted mutability correlates well with experimental measures of protein stability and that both are reliable mutability predictors (receiver operating characteristic areas under the curve ∼0.8). Most interestingly, we observe an increasing agreement between our model and the observed variability as more data become available over time, proving the anticipatory capacity of our model. When combined with data concerning the immune response, our approach identifies positions where current variants of concern are highly overrepresented. These results could assist studies on viral evolution and future viral outbreaks and, in particular, guide the exploration and anticipation of potentially harmful future SARS-CoV-2 variants.
format Online
Article
Text
id pubmed-8795541
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher National Academy of Sciences
record_format MEDLINE/PubMed
spelling pubmed-87955412022-02-03 Epistatic models predict mutable sites in SARS-CoV-2 proteins and epitopes Rodriguez-Rivas, Juan Croce, Giancarlo Muscat, Maureen Weigt, Martin Proc Natl Acad Sci U S A Physical Sciences The emergence of new variants of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is a major concern given their potential impact on the transmissibility and pathogenicity of the virus as well as the efficacy of therapeutic interventions. Here, we predict the mutability of all positions in SARS-CoV-2 protein domains to forecast the appearance of unseen variants. Using sequence data from other coronaviruses, preexisting to SARS-CoV-2, we build statistical models that not only capture amino acid conservation but also more complex patterns resulting from epistasis. We show that these models are notably superior to conservation profiles in estimating the already observable SARS-CoV-2 variability. In the receptor binding domain of the spike protein, we observe that the predicted mutability correlates well with experimental measures of protein stability and that both are reliable mutability predictors (receiver operating characteristic areas under the curve ∼0.8). Most interestingly, we observe an increasing agreement between our model and the observed variability as more data become available over time, proving the anticipatory capacity of our model. When combined with data concerning the immune response, our approach identifies positions where current variants of concern are highly overrepresented. These results could assist studies on viral evolution and future viral outbreaks and, in particular, guide the exploration and anticipation of potentially harmful future SARS-CoV-2 variants. National Academy of Sciences 2022-01-12 2022-01-25 /pmc/articles/PMC8795541/ /pubmed/35022216 http://dx.doi.org/10.1073/pnas.2113118119 Text en Copyright © 2022 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by/4.0/This open access article is distributed under Creative Commons Attribution License 4.0 (CC BY) (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Physical Sciences
Rodriguez-Rivas, Juan
Croce, Giancarlo
Muscat, Maureen
Weigt, Martin
Epistatic models predict mutable sites in SARS-CoV-2 proteins and epitopes
title Epistatic models predict mutable sites in SARS-CoV-2 proteins and epitopes
title_full Epistatic models predict mutable sites in SARS-CoV-2 proteins and epitopes
title_fullStr Epistatic models predict mutable sites in SARS-CoV-2 proteins and epitopes
title_full_unstemmed Epistatic models predict mutable sites in SARS-CoV-2 proteins and epitopes
title_short Epistatic models predict mutable sites in SARS-CoV-2 proteins and epitopes
title_sort epistatic models predict mutable sites in sars-cov-2 proteins and epitopes
topic Physical Sciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8795541/
https://www.ncbi.nlm.nih.gov/pubmed/35022216
http://dx.doi.org/10.1073/pnas.2113118119
work_keys_str_mv AT rodriguezrivasjuan epistaticmodelspredictmutablesitesinsarscov2proteinsandepitopes
AT crocegiancarlo epistaticmodelspredictmutablesitesinsarscov2proteinsandepitopes
AT muscatmaureen epistaticmodelspredictmutablesitesinsarscov2proteinsandepitopes
AT weigtmartin epistaticmodelspredictmutablesitesinsarscov2proteinsandepitopes