Cargando…

De novo protein fold design through sequence-independent fragment assembly simulations

De novo protein design generally consists of two steps, including structure and sequence design. Many protein design studies have focused on sequence design with scaffolds adapted from native structures in the PDB, which renders novel areas of protein structure and function space unexplored. We deve...

Descripción completa

Detalles Bibliográficos
Autores principales: Pearce, Robin, Huang, Xiaoqiang, Omenn, Gilbert S., Zhang, Yang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Academy of Sciences 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9942881/
https://www.ncbi.nlm.nih.gov/pubmed/36656852
http://dx.doi.org/10.1073/pnas.2208275120
_version_ 1784891594506240000
author Pearce, Robin
Huang, Xiaoqiang
Omenn, Gilbert S.
Zhang, Yang
author_facet Pearce, Robin
Huang, Xiaoqiang
Omenn, Gilbert S.
Zhang, Yang
author_sort Pearce, Robin
collection PubMed
description De novo protein design generally consists of two steps, including structure and sequence design. Many protein design studies have focused on sequence design with scaffolds adapted from native structures in the PDB, which renders novel areas of protein structure and function space unexplored. We developed FoldDesign to create novel protein folds from specific secondary structure (SS) assignments through sequence-independent replica-exchange Monte Carlo (REMC) simulations. The method was tested on 354 non-redundant topologies, where FoldDesign consistently created stable structural folds, while recapitulating on average 87.7% of the SS elements. Meanwhile, the FoldDesign scaffolds had well-formed structures with buried residues and solvent-exposed areas closely matching their native counterparts. Despite the high fidelity to the input SS restraints and local structural characteristics of native proteins, a large portion of the designed scaffolds possessed global folds completely different from natural proteins in the PDB, highlighting the ability of FoldDesign to explore novel areas of protein fold space. Detailed data analyses revealed that the major contributions to the successful structure design lay in the optimal energy force field, which contains a balanced set of SS packing terms, and REMC simulations, which were coupled with multiple auxiliary movements to efficiently search the conformational space. Additionally, the ability to recognize and assemble uncommon super-SS geometries, rather than the unique arrangement of common SS motifs, was the key to generating novel folds. These results demonstrate a strong potential to explore both structural and functional spaces through computational design simulations that natural proteins have not reached through evolution.
format Online
Article
Text
id pubmed-9942881
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher National Academy of Sciences
record_format MEDLINE/PubMed
spelling pubmed-99428812023-07-19 De novo protein fold design through sequence-independent fragment assembly simulations Pearce, Robin Huang, Xiaoqiang Omenn, Gilbert S. Zhang, Yang Proc Natl Acad Sci U S A Biological Sciences De novo protein design generally consists of two steps, including structure and sequence design. Many protein design studies have focused on sequence design with scaffolds adapted from native structures in the PDB, which renders novel areas of protein structure and function space unexplored. We developed FoldDesign to create novel protein folds from specific secondary structure (SS) assignments through sequence-independent replica-exchange Monte Carlo (REMC) simulations. The method was tested on 354 non-redundant topologies, where FoldDesign consistently created stable structural folds, while recapitulating on average 87.7% of the SS elements. Meanwhile, the FoldDesign scaffolds had well-formed structures with buried residues and solvent-exposed areas closely matching their native counterparts. Despite the high fidelity to the input SS restraints and local structural characteristics of native proteins, a large portion of the designed scaffolds possessed global folds completely different from natural proteins in the PDB, highlighting the ability of FoldDesign to explore novel areas of protein fold space. Detailed data analyses revealed that the major contributions to the successful structure design lay in the optimal energy force field, which contains a balanced set of SS packing terms, and REMC simulations, which were coupled with multiple auxiliary movements to efficiently search the conformational space. Additionally, the ability to recognize and assemble uncommon super-SS geometries, rather than the unique arrangement of common SS motifs, was the key to generating novel folds. These results demonstrate a strong potential to explore both structural and functional spaces through computational design simulations that natural proteins have not reached through evolution. National Academy of Sciences 2023-01-19 2023-01-24 /pmc/articles/PMC9942881/ /pubmed/36656852 http://dx.doi.org/10.1073/pnas.2208275120 Text en Copyright © 2023 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by-nc-nd/4.0/This article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND) (https://creativecommons.org/licenses/by-nc-nd/4.0/) .
spellingShingle Biological Sciences
Pearce, Robin
Huang, Xiaoqiang
Omenn, Gilbert S.
Zhang, Yang
De novo protein fold design through sequence-independent fragment assembly simulations
title De novo protein fold design through sequence-independent fragment assembly simulations
title_full De novo protein fold design through sequence-independent fragment assembly simulations
title_fullStr De novo protein fold design through sequence-independent fragment assembly simulations
title_full_unstemmed De novo protein fold design through sequence-independent fragment assembly simulations
title_short De novo protein fold design through sequence-independent fragment assembly simulations
title_sort de novo protein fold design through sequence-independent fragment assembly simulations
topic Biological Sciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9942881/
https://www.ncbi.nlm.nih.gov/pubmed/36656852
http://dx.doi.org/10.1073/pnas.2208275120
work_keys_str_mv AT pearcerobin denovoproteinfolddesignthroughsequenceindependentfragmentassemblysimulations
AT huangxiaoqiang denovoproteinfolddesignthroughsequenceindependentfragmentassemblysimulations
AT omenngilberts denovoproteinfolddesignthroughsequenceindependentfragmentassemblysimulations
AT zhangyang denovoproteinfolddesignthroughsequenceindependentfragmentassemblysimulations