Cargando…
De novo protein fold design through sequence-independent fragment assembly simulations
De novo protein design generally consists of two steps, including structure and sequence design. Many protein design studies have focused on sequence design with scaffolds adapted from native structures in the PDB, which renders novel areas of protein structure and function space unexplored. We deve...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
National Academy of Sciences
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9942881/ https://www.ncbi.nlm.nih.gov/pubmed/36656852 http://dx.doi.org/10.1073/pnas.2208275120 |
_version_ | 1784891594506240000 |
---|---|
author | Pearce, Robin Huang, Xiaoqiang Omenn, Gilbert S. Zhang, Yang |
author_facet | Pearce, Robin Huang, Xiaoqiang Omenn, Gilbert S. Zhang, Yang |
author_sort | Pearce, Robin |
collection | PubMed |
description | De novo protein design generally consists of two steps, including structure and sequence design. Many protein design studies have focused on sequence design with scaffolds adapted from native structures in the PDB, which renders novel areas of protein structure and function space unexplored. We developed FoldDesign to create novel protein folds from specific secondary structure (SS) assignments through sequence-independent replica-exchange Monte Carlo (REMC) simulations. The method was tested on 354 non-redundant topologies, where FoldDesign consistently created stable structural folds, while recapitulating on average 87.7% of the SS elements. Meanwhile, the FoldDesign scaffolds had well-formed structures with buried residues and solvent-exposed areas closely matching their native counterparts. Despite the high fidelity to the input SS restraints and local structural characteristics of native proteins, a large portion of the designed scaffolds possessed global folds completely different from natural proteins in the PDB, highlighting the ability of FoldDesign to explore novel areas of protein fold space. Detailed data analyses revealed that the major contributions to the successful structure design lay in the optimal energy force field, which contains a balanced set of SS packing terms, and REMC simulations, which were coupled with multiple auxiliary movements to efficiently search the conformational space. Additionally, the ability to recognize and assemble uncommon super-SS geometries, rather than the unique arrangement of common SS motifs, was the key to generating novel folds. These results demonstrate a strong potential to explore both structural and functional spaces through computational design simulations that natural proteins have not reached through evolution. |
format | Online Article Text |
id | pubmed-9942881 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | National Academy of Sciences |
record_format | MEDLINE/PubMed |
spelling | pubmed-99428812023-07-19 De novo protein fold design through sequence-independent fragment assembly simulations Pearce, Robin Huang, Xiaoqiang Omenn, Gilbert S. Zhang, Yang Proc Natl Acad Sci U S A Biological Sciences De novo protein design generally consists of two steps, including structure and sequence design. Many protein design studies have focused on sequence design with scaffolds adapted from native structures in the PDB, which renders novel areas of protein structure and function space unexplored. We developed FoldDesign to create novel protein folds from specific secondary structure (SS) assignments through sequence-independent replica-exchange Monte Carlo (REMC) simulations. The method was tested on 354 non-redundant topologies, where FoldDesign consistently created stable structural folds, while recapitulating on average 87.7% of the SS elements. Meanwhile, the FoldDesign scaffolds had well-formed structures with buried residues and solvent-exposed areas closely matching their native counterparts. Despite the high fidelity to the input SS restraints and local structural characteristics of native proteins, a large portion of the designed scaffolds possessed global folds completely different from natural proteins in the PDB, highlighting the ability of FoldDesign to explore novel areas of protein fold space. Detailed data analyses revealed that the major contributions to the successful structure design lay in the optimal energy force field, which contains a balanced set of SS packing terms, and REMC simulations, which were coupled with multiple auxiliary movements to efficiently search the conformational space. Additionally, the ability to recognize and assemble uncommon super-SS geometries, rather than the unique arrangement of common SS motifs, was the key to generating novel folds. These results demonstrate a strong potential to explore both structural and functional spaces through computational design simulations that natural proteins have not reached through evolution. National Academy of Sciences 2023-01-19 2023-01-24 /pmc/articles/PMC9942881/ /pubmed/36656852 http://dx.doi.org/10.1073/pnas.2208275120 Text en Copyright © 2023 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by-nc-nd/4.0/This article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND) (https://creativecommons.org/licenses/by-nc-nd/4.0/) . |
spellingShingle | Biological Sciences Pearce, Robin Huang, Xiaoqiang Omenn, Gilbert S. Zhang, Yang De novo protein fold design through sequence-independent fragment assembly simulations |
title | De novo protein fold design through sequence-independent fragment assembly simulations |
title_full | De novo protein fold design through sequence-independent fragment assembly simulations |
title_fullStr | De novo protein fold design through sequence-independent fragment assembly simulations |
title_full_unstemmed | De novo protein fold design through sequence-independent fragment assembly simulations |
title_short | De novo protein fold design through sequence-independent fragment assembly simulations |
title_sort | de novo protein fold design through sequence-independent fragment assembly simulations |
topic | Biological Sciences |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9942881/ https://www.ncbi.nlm.nih.gov/pubmed/36656852 http://dx.doi.org/10.1073/pnas.2208275120 |
work_keys_str_mv | AT pearcerobin denovoproteinfolddesignthroughsequenceindependentfragmentassemblysimulations AT huangxiaoqiang denovoproteinfolddesignthroughsequenceindependentfragmentassemblysimulations AT omenngilberts denovoproteinfolddesignthroughsequenceindependentfragmentassemblysimulations AT zhangyang denovoproteinfolddesignthroughsequenceindependentfragmentassemblysimulations |