Cargando…

Amino-acid site variability among natural and designed proteins

Computational protein design attempts to create protein sequences that fold stably into pre-specified structures. Here we compare alignments of designed proteins to alignments of natural proteins and assess how closely designed sequences recapitulate patterns of sequence variation found in natural p...

Descripción completa

Detalles Bibliográficos
Autores principales: Jackson, Eleisha L., Ollikainen, Noah, Covert, Arthur W., Kortemme, Tanja, Wilke, Claus O.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3828621/
https://www.ncbi.nlm.nih.gov/pubmed/24255821
http://dx.doi.org/10.7717/peerj.211
_version_ 1782291270620676096
author Jackson, Eleisha L.
Ollikainen, Noah
Covert, Arthur W.
Kortemme, Tanja
Wilke, Claus O.
author_facet Jackson, Eleisha L.
Ollikainen, Noah
Covert, Arthur W.
Kortemme, Tanja
Wilke, Claus O.
author_sort Jackson, Eleisha L.
collection PubMed
description Computational protein design attempts to create protein sequences that fold stably into pre-specified structures. Here we compare alignments of designed proteins to alignments of natural proteins and assess how closely designed sequences recapitulate patterns of sequence variation found in natural protein sequences. We design proteins using RosettaDesign, and we evaluate both fixed-backbone designs and variable-backbone designs with different amounts of backbone flexibility. We find that proteins designed with a fixed backbone tend to underestimate the amount of site variability observed in natural proteins while proteins designed with an intermediate amount of backbone flexibility result in more realistic site variability. Further, the correlation between solvent exposure and site variability in designed proteins is lower than that in natural proteins. This finding suggests that site variability is too uniform across different solvent exposure states (i.e., buried residues are too variable or exposed residues too conserved). When comparing the amino acid frequencies in the designed proteins with those in natural proteins we find that in the designed proteins hydrophobic residues are underrepresented in the core. From these results we conclude that intermediate backbone flexibility during design results in more accurate protein design and that either scoring functions or backbone sampling methods require further improvement to accurately replicate structural constraints on site variability.
format Online
Article
Text
id pubmed-3828621
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-38286212013-11-19 Amino-acid site variability among natural and designed proteins Jackson, Eleisha L. Ollikainen, Noah Covert, Arthur W. Kortemme, Tanja Wilke, Claus O. PeerJ Biochemistry Computational protein design attempts to create protein sequences that fold stably into pre-specified structures. Here we compare alignments of designed proteins to alignments of natural proteins and assess how closely designed sequences recapitulate patterns of sequence variation found in natural protein sequences. We design proteins using RosettaDesign, and we evaluate both fixed-backbone designs and variable-backbone designs with different amounts of backbone flexibility. We find that proteins designed with a fixed backbone tend to underestimate the amount of site variability observed in natural proteins while proteins designed with an intermediate amount of backbone flexibility result in more realistic site variability. Further, the correlation between solvent exposure and site variability in designed proteins is lower than that in natural proteins. This finding suggests that site variability is too uniform across different solvent exposure states (i.e., buried residues are too variable or exposed residues too conserved). When comparing the amino acid frequencies in the designed proteins with those in natural proteins we find that in the designed proteins hydrophobic residues are underrepresented in the core. From these results we conclude that intermediate backbone flexibility during design results in more accurate protein design and that either scoring functions or backbone sampling methods require further improvement to accurately replicate structural constraints on site variability. PeerJ Inc. 2013-11-12 /pmc/articles/PMC3828621/ /pubmed/24255821 http://dx.doi.org/10.7717/peerj.211 Text en © 2013 Jackson et al. http://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Biochemistry
Jackson, Eleisha L.
Ollikainen, Noah
Covert, Arthur W.
Kortemme, Tanja
Wilke, Claus O.
Amino-acid site variability among natural and designed proteins
title Amino-acid site variability among natural and designed proteins
title_full Amino-acid site variability among natural and designed proteins
title_fullStr Amino-acid site variability among natural and designed proteins
title_full_unstemmed Amino-acid site variability among natural and designed proteins
title_short Amino-acid site variability among natural and designed proteins
title_sort amino-acid site variability among natural and designed proteins
topic Biochemistry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3828621/
https://www.ncbi.nlm.nih.gov/pubmed/24255821
http://dx.doi.org/10.7717/peerj.211
work_keys_str_mv AT jacksoneleishal aminoacidsitevariabilityamongnaturalanddesignedproteins
AT ollikainennoah aminoacidsitevariabilityamongnaturalanddesignedproteins
AT covertarthurw aminoacidsitevariabilityamongnaturalanddesignedproteins
AT kortemmetanja aminoacidsitevariabilityamongnaturalanddesignedproteins
AT wilkeclauso aminoacidsitevariabilityamongnaturalanddesignedproteins