Cargando…

Emergence of Protein Fold Families through Rational Design

Diverse proteins with similar structures are grouped into families of homologs and analogs, if their sequence similarity is higher or lower, respectively, than 20%–30%. It was suggested that protein homologs and analogs originate from a common ancestor and diverge in their distinct evolutionary time...

Descripción completa

Detalles Bibliográficos
Autores principales: Ding, Feng, Dokholyan, Nikolay V
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1487181/
https://www.ncbi.nlm.nih.gov/pubmed/16839198
http://dx.doi.org/10.1371/journal.pcbi.0020085
_version_ 1782128340820295680
author Ding, Feng
Dokholyan, Nikolay V
author_facet Ding, Feng
Dokholyan, Nikolay V
author_sort Ding, Feng
collection PubMed
description Diverse proteins with similar structures are grouped into families of homologs and analogs, if their sequence similarity is higher or lower, respectively, than 20%–30%. It was suggested that protein homologs and analogs originate from a common ancestor and diverge in their distinct evolutionary time scales, emerging as a consequence of the physical properties of the protein sequence space. Although a number of studies have determined key signatures of protein family organization, the sequence-structure factors that differentiate the two evolution-related protein families remain unknown. Here, we stipulate that subtle structural changes, which appear due to accumulating mutations in the homologous families, lead to distinct packing of the protein core and, thus, novel compositions of core residues. The latter process leads to the formation of distinct families of homologs. We propose that such differentiation results in the formation of analogous families. To test our postulate, we developed a molecular modeling and design toolkit, Medusa, to computationally design protein sequences that correspond to the same fold family. We find that analogous proteins emerge when a backbone structure deviates only 1–2 Å root-mean-square deviation from the original structure. For close homologs, core residues are highly conserved. However, when the overall sequence similarity drops to ~25%–30%, the composition of core residues starts to diverge, thereby forming novel families of protein homologs. This direct observation of the formation of protein homologs within a specific fold family supports our hypothesis. The conservation of amino acids in designed sequences recapitulates that of the naturally occurring sequences, thereby validating our computational design methodology.
format Text
id pubmed-1487181
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-14871812006-07-07 Emergence of Protein Fold Families through Rational Design Ding, Feng Dokholyan, Nikolay V PLoS Comput Biol Research Article Diverse proteins with similar structures are grouped into families of homologs and analogs, if their sequence similarity is higher or lower, respectively, than 20%–30%. It was suggested that protein homologs and analogs originate from a common ancestor and diverge in their distinct evolutionary time scales, emerging as a consequence of the physical properties of the protein sequence space. Although a number of studies have determined key signatures of protein family organization, the sequence-structure factors that differentiate the two evolution-related protein families remain unknown. Here, we stipulate that subtle structural changes, which appear due to accumulating mutations in the homologous families, lead to distinct packing of the protein core and, thus, novel compositions of core residues. The latter process leads to the formation of distinct families of homologs. We propose that such differentiation results in the formation of analogous families. To test our postulate, we developed a molecular modeling and design toolkit, Medusa, to computationally design protein sequences that correspond to the same fold family. We find that analogous proteins emerge when a backbone structure deviates only 1–2 Å root-mean-square deviation from the original structure. For close homologs, core residues are highly conserved. However, when the overall sequence similarity drops to ~25%–30%, the composition of core residues starts to diverge, thereby forming novel families of protein homologs. This direct observation of the formation of protein homologs within a specific fold family supports our hypothesis. The conservation of amino acids in designed sequences recapitulates that of the naturally occurring sequences, thereby validating our computational design methodology. Public Library of Science 2006-07 2006-07-07 /pmc/articles/PMC1487181/ /pubmed/16839198 http://dx.doi.org/10.1371/journal.pcbi.0020085 Text en © 2006 Ding and Dokholyan. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Ding, Feng
Dokholyan, Nikolay V
Emergence of Protein Fold Families through Rational Design
title Emergence of Protein Fold Families through Rational Design
title_full Emergence of Protein Fold Families through Rational Design
title_fullStr Emergence of Protein Fold Families through Rational Design
title_full_unstemmed Emergence of Protein Fold Families through Rational Design
title_short Emergence of Protein Fold Families through Rational Design
title_sort emergence of protein fold families through rational design
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1487181/
https://www.ncbi.nlm.nih.gov/pubmed/16839198
http://dx.doi.org/10.1371/journal.pcbi.0020085
work_keys_str_mv AT dingfeng emergenceofproteinfoldfamiliesthroughrationaldesign
AT dokholyannikolayv emergenceofproteinfoldfamiliesthroughrationaldesign