Cargando…

A general-purpose protein design framework based on mining sequence–structure relationships in known protein structures

Current state-of-the-art approaches to computational protein design (CPD) aim to capture the determinants of structure from physical principles. While this has led to many successful designs, it does have strong limitations associated with inaccuracies in physical modeling, such that a reliable gene...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Jianfu, Panaitiu, Alexandra E., Grigoryan, Gevorg
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Academy of Sciences 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6969538/
https://www.ncbi.nlm.nih.gov/pubmed/31892539
http://dx.doi.org/10.1073/pnas.1908723117
_version_ 1783489343787106304
author Zhou, Jianfu
Panaitiu, Alexandra E.
Grigoryan, Gevorg
author_facet Zhou, Jianfu
Panaitiu, Alexandra E.
Grigoryan, Gevorg
author_sort Zhou, Jianfu
collection PubMed
description Current state-of-the-art approaches to computational protein design (CPD) aim to capture the determinants of structure from physical principles. While this has led to many successful designs, it does have strong limitations associated with inaccuracies in physical modeling, such that a reliable general solution to CPD has yet to be found. Here, we propose a design framework—one based on identifying and applying patterns of sequence–structure compatibility found in known proteins, rather than approximating them from models of interatomic interactions. We carry out extensive computational analyses and an experimental validation for our method. Our results strongly argue that the Protein Data Bank is now sufficiently large to enable proteins to be designed by using only examples of structural motifs from unrelated proteins. Because our method is likely to have orthogonal strengths relative to existing techniques, it could represent an important step toward removing remaining barriers to robust CPD.
format Online
Article
Text
id pubmed-6969538
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher National Academy of Sciences
record_format MEDLINE/PubMed
spelling pubmed-69695382020-01-27 A general-purpose protein design framework based on mining sequence–structure relationships in known protein structures Zhou, Jianfu Panaitiu, Alexandra E. Grigoryan, Gevorg Proc Natl Acad Sci U S A PNAS Plus Current state-of-the-art approaches to computational protein design (CPD) aim to capture the determinants of structure from physical principles. While this has led to many successful designs, it does have strong limitations associated with inaccuracies in physical modeling, such that a reliable general solution to CPD has yet to be found. Here, we propose a design framework—one based on identifying and applying patterns of sequence–structure compatibility found in known proteins, rather than approximating them from models of interatomic interactions. We carry out extensive computational analyses and an experimental validation for our method. Our results strongly argue that the Protein Data Bank is now sufficiently large to enable proteins to be designed by using only examples of structural motifs from unrelated proteins. Because our method is likely to have orthogonal strengths relative to existing techniques, it could represent an important step toward removing remaining barriers to robust CPD. National Academy of Sciences 2020-01-14 2019-12-31 /pmc/articles/PMC6969538/ /pubmed/31892539 http://dx.doi.org/10.1073/pnas.1908723117 Text en Copyright © 2020 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by-nc-nd/4.0/ https://creativecommons.org/licenses/by-nc-nd/4.0/This open access article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND) (https://creativecommons.org/licenses/by-nc-nd/4.0/) .
spellingShingle PNAS Plus
Zhou, Jianfu
Panaitiu, Alexandra E.
Grigoryan, Gevorg
A general-purpose protein design framework based on mining sequence–structure relationships in known protein structures
title A general-purpose protein design framework based on mining sequence–structure relationships in known protein structures
title_full A general-purpose protein design framework based on mining sequence–structure relationships in known protein structures
title_fullStr A general-purpose protein design framework based on mining sequence–structure relationships in known protein structures
title_full_unstemmed A general-purpose protein design framework based on mining sequence–structure relationships in known protein structures
title_short A general-purpose protein design framework based on mining sequence–structure relationships in known protein structures
title_sort general-purpose protein design framework based on mining sequence–structure relationships in known protein structures
topic PNAS Plus
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6969538/
https://www.ncbi.nlm.nih.gov/pubmed/31892539
http://dx.doi.org/10.1073/pnas.1908723117
work_keys_str_mv AT zhoujianfu ageneralpurposeproteindesignframeworkbasedonminingsequencestructurerelationshipsinknownproteinstructures
AT panaitiualexandrae ageneralpurposeproteindesignframeworkbasedonminingsequencestructurerelationshipsinknownproteinstructures
AT grigoryangevorg ageneralpurposeproteindesignframeworkbasedonminingsequencestructurerelationshipsinknownproteinstructures
AT zhoujianfu generalpurposeproteindesignframeworkbasedonminingsequencestructurerelationshipsinknownproteinstructures
AT panaitiualexandrae generalpurposeproteindesignframeworkbasedonminingsequencestructurerelationshipsinknownproteinstructures
AT grigoryangevorg generalpurposeproteindesignframeworkbasedonminingsequencestructurerelationshipsinknownproteinstructures