Cargando…

Data driven flexible backbone protein design

Protein design remains an important problem in computational structural biology. Current computational protein design methods largely use physics-based methods, which make use of information from a single protein structure. This is despite the fact that multiple structures of many protein folds are...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Mark G. F., Kim, Philip M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5587332/
https://www.ncbi.nlm.nih.gov/pubmed/28837553
http://dx.doi.org/10.1371/journal.pcbi.1005722
_version_ 1783261978279542784
author Sun, Mark G. F.
Kim, Philip M.
author_facet Sun, Mark G. F.
Kim, Philip M.
author_sort Sun, Mark G. F.
collection PubMed
description Protein design remains an important problem in computational structural biology. Current computational protein design methods largely use physics-based methods, which make use of information from a single protein structure. This is despite the fact that multiple structures of many protein folds are now readily available in the PDB. While ensemble protein design methods can use multiple protein structures, they treat each structure independently. Here, we introduce a flexible backbone strategy, FlexiBaL-GP, which learns global protein backbone movements directly from multiple protein structures. FlexiBaL-GP uses the machine learning method of Gaussian Process Latent Variable Models to learn a lower dimensional representation of the protein coordinates that best represent backbone movements. These learned backbone movements are used to explore alternative protein backbones, while engineering a protein within a parallel tempered MCMC framework. Using the human ubiquitin–USP21 complex as a model we demonstrate that our design strategy outperforms current strategies for the interface design task of identifying tight binding ubiquitin variants for USP21.
format Online
Article
Text
id pubmed-5587332
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-55873322017-09-22 Data driven flexible backbone protein design Sun, Mark G. F. Kim, Philip M. PLoS Comput Biol Research Article Protein design remains an important problem in computational structural biology. Current computational protein design methods largely use physics-based methods, which make use of information from a single protein structure. This is despite the fact that multiple structures of many protein folds are now readily available in the PDB. While ensemble protein design methods can use multiple protein structures, they treat each structure independently. Here, we introduce a flexible backbone strategy, FlexiBaL-GP, which learns global protein backbone movements directly from multiple protein structures. FlexiBaL-GP uses the machine learning method of Gaussian Process Latent Variable Models to learn a lower dimensional representation of the protein coordinates that best represent backbone movements. These learned backbone movements are used to explore alternative protein backbones, while engineering a protein within a parallel tempered MCMC framework. Using the human ubiquitin–USP21 complex as a model we demonstrate that our design strategy outperforms current strategies for the interface design task of identifying tight binding ubiquitin variants for USP21. Public Library of Science 2017-08-24 /pmc/articles/PMC5587332/ /pubmed/28837553 http://dx.doi.org/10.1371/journal.pcbi.1005722 Text en © 2017 Sun, Kim http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Sun, Mark G. F.
Kim, Philip M.
Data driven flexible backbone protein design
title Data driven flexible backbone protein design
title_full Data driven flexible backbone protein design
title_fullStr Data driven flexible backbone protein design
title_full_unstemmed Data driven flexible backbone protein design
title_short Data driven flexible backbone protein design
title_sort data driven flexible backbone protein design
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5587332/
https://www.ncbi.nlm.nih.gov/pubmed/28837553
http://dx.doi.org/10.1371/journal.pcbi.1005722
work_keys_str_mv AT sunmarkgf datadrivenflexiblebackboneproteindesign
AT kimphilipm datadrivenflexiblebackboneproteindesign