Cargando…

Complete Combinatorial Mutational Enumeration of a protein functional site enables sequence-landscape mapping and identifies highly-mutated variants that retain activity

Understanding how proteins evolve under selective pressure is a longstanding challenge. The immensity of the search space has limited efforts to systematically evaluate the impact of multiple simultaneous mutations, so mutations have typically been assessed individually. However, epistasis, or the w...

Descripción completa

Detalles Bibliográficos
Autores principales: Colom, Mireia Solà, Vucinic, Jelena, Adolf-Bryfogle, Jared, Bowman, James W., Verel, Sébastien, Moczygemba, Isabelle, Schiex, Thomas, Simoncini, David, Bahl, Christopher D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Journal Experts 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9727770/
https://www.ncbi.nlm.nih.gov/pubmed/36482980
http://dx.doi.org/10.21203/rs.3.rs-2248327/v2
_version_ 1784845096531787776
author Colom, Mireia Solà
Vucinic, Jelena
Adolf-Bryfogle, Jared
Bowman, James W.
Verel, Sébastien
Moczygemba, Isabelle
Schiex, Thomas
Simoncini, David
Bahl, Christopher D.
author_facet Colom, Mireia Solà
Vucinic, Jelena
Adolf-Bryfogle, Jared
Bowman, James W.
Verel, Sébastien
Moczygemba, Isabelle
Schiex, Thomas
Simoncini, David
Bahl, Christopher D.
author_sort Colom, Mireia Solà
collection PubMed
description Understanding how proteins evolve under selective pressure is a longstanding challenge. The immensity of the search space has limited efforts to systematically evaluate the impact of multiple simultaneous mutations, so mutations have typically been assessed individually. However, epistasis, or the way in which mutations interact, prevents accurate prediction of combinatorial mutations based on measurements of individual mutations. Here, we use artificial intelligence to define the entire functional sequence landscape of a protein binding site in silico, and we call this approach Complete Combinatorial Mutational Enumeration (CCME). By leveraging CCME, we are able to construct a comprehensive map of the evolutionary connectivity within this functional sequence landscape. As a proof of concept, we applied CCME to the ACE2 binding site of the SARS-CoV-2 spike protein receptor binding domain. We selected representative variants from across the functional sequence landscape for testing in the laboratory. We identified variants that retained functionality to bind ACE2 despite changing over 40% of evaluated residue positions, and the variants now escape binding and neutralization by monoclonal antibodies. This work represents a crucial initial stride towards achieving precise predictions of pathogen evolution, opening avenues for proactive mitigation.
format Online
Article
Text
id pubmed-9727770
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher American Journal Experts
record_format MEDLINE/PubMed
spelling pubmed-97277702022-12-08 Complete Combinatorial Mutational Enumeration of a protein functional site enables sequence-landscape mapping and identifies highly-mutated variants that retain activity Colom, Mireia Solà Vucinic, Jelena Adolf-Bryfogle, Jared Bowman, James W. Verel, Sébastien Moczygemba, Isabelle Schiex, Thomas Simoncini, David Bahl, Christopher D. Res Sq Article Understanding how proteins evolve under selective pressure is a longstanding challenge. The immensity of the search space has limited efforts to systematically evaluate the impact of multiple simultaneous mutations, so mutations have typically been assessed individually. However, epistasis, or the way in which mutations interact, prevents accurate prediction of combinatorial mutations based on measurements of individual mutations. Here, we use artificial intelligence to define the entire functional sequence landscape of a protein binding site in silico, and we call this approach Complete Combinatorial Mutational Enumeration (CCME). By leveraging CCME, we are able to construct a comprehensive map of the evolutionary connectivity within this functional sequence landscape. As a proof of concept, we applied CCME to the ACE2 binding site of the SARS-CoV-2 spike protein receptor binding domain. We selected representative variants from across the functional sequence landscape for testing in the laboratory. We identified variants that retained functionality to bind ACE2 despite changing over 40% of evaluated residue positions, and the variants now escape binding and neutralization by monoclonal antibodies. This work represents a crucial initial stride towards achieving precise predictions of pathogen evolution, opening avenues for proactive mitigation. American Journal Experts 2023-09-11 /pmc/articles/PMC9727770/ /pubmed/36482980 http://dx.doi.org/10.21203/rs.3.rs-2248327/v2 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use.
spellingShingle Article
Colom, Mireia Solà
Vucinic, Jelena
Adolf-Bryfogle, Jared
Bowman, James W.
Verel, Sébastien
Moczygemba, Isabelle
Schiex, Thomas
Simoncini, David
Bahl, Christopher D.
Complete Combinatorial Mutational Enumeration of a protein functional site enables sequence-landscape mapping and identifies highly-mutated variants that retain activity
title Complete Combinatorial Mutational Enumeration of a protein functional site enables sequence-landscape mapping and identifies highly-mutated variants that retain activity
title_full Complete Combinatorial Mutational Enumeration of a protein functional site enables sequence-landscape mapping and identifies highly-mutated variants that retain activity
title_fullStr Complete Combinatorial Mutational Enumeration of a protein functional site enables sequence-landscape mapping and identifies highly-mutated variants that retain activity
title_full_unstemmed Complete Combinatorial Mutational Enumeration of a protein functional site enables sequence-landscape mapping and identifies highly-mutated variants that retain activity
title_short Complete Combinatorial Mutational Enumeration of a protein functional site enables sequence-landscape mapping and identifies highly-mutated variants that retain activity
title_sort complete combinatorial mutational enumeration of a protein functional site enables sequence-landscape mapping and identifies highly-mutated variants that retain activity
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9727770/
https://www.ncbi.nlm.nih.gov/pubmed/36482980
http://dx.doi.org/10.21203/rs.3.rs-2248327/v2
work_keys_str_mv AT colommireiasola completecombinatorialmutationalenumerationofaproteinfunctionalsiteenablessequencelandscapemappingandidentifieshighlymutatedvariantsthatretainactivity
AT vucinicjelena completecombinatorialmutationalenumerationofaproteinfunctionalsiteenablessequencelandscapemappingandidentifieshighlymutatedvariantsthatretainactivity
AT adolfbryfoglejared completecombinatorialmutationalenumerationofaproteinfunctionalsiteenablessequencelandscapemappingandidentifieshighlymutatedvariantsthatretainactivity
AT bowmanjamesw completecombinatorialmutationalenumerationofaproteinfunctionalsiteenablessequencelandscapemappingandidentifieshighlymutatedvariantsthatretainactivity
AT verelsebastien completecombinatorialmutationalenumerationofaproteinfunctionalsiteenablessequencelandscapemappingandidentifieshighlymutatedvariantsthatretainactivity
AT moczygembaisabelle completecombinatorialmutationalenumerationofaproteinfunctionalsiteenablessequencelandscapemappingandidentifieshighlymutatedvariantsthatretainactivity
AT schiexthomas completecombinatorialmutationalenumerationofaproteinfunctionalsiteenablessequencelandscapemappingandidentifieshighlymutatedvariantsthatretainactivity
AT simoncinidavid completecombinatorialmutationalenumerationofaproteinfunctionalsiteenablessequencelandscapemappingandidentifieshighlymutatedvariantsthatretainactivity
AT bahlchristopherd completecombinatorialmutationalenumerationofaproteinfunctionalsiteenablessequencelandscapemappingandidentifieshighlymutatedvariantsthatretainactivity