Cargando…

Complex evolutionary footprints revealed in an analysis of reused protein segments of diverse lengths

Proteins share similar segments with one another. Such “reused parts”—which have been successfully incorporated into other proteins—are likely to offer an evolutionary advantage over de novo evolved segments, as most of the latter will not even have the capacity to fold. To systematically explore th...

Descripción completa

Detalles Bibliográficos
Autores principales: Nepomnyachiy, Sergey, Ben-Tal, Nir, Kolodny, Rachel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Academy of Sciences 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5676897/
https://www.ncbi.nlm.nih.gov/pubmed/29078314
http://dx.doi.org/10.1073/pnas.1707642114
_version_ 1783277147114176512
author Nepomnyachiy, Sergey
Ben-Tal, Nir
Kolodny, Rachel
author_facet Nepomnyachiy, Sergey
Ben-Tal, Nir
Kolodny, Rachel
author_sort Nepomnyachiy, Sergey
collection PubMed
description Proteins share similar segments with one another. Such “reused parts”—which have been successfully incorporated into other proteins—are likely to offer an evolutionary advantage over de novo evolved segments, as most of the latter will not even have the capacity to fold. To systematically explore the evolutionary traces of segment “reuse” across proteins, we developed an automated methodology that identifies reused segments from protein alignments. We search for “themes”—segments of at least 35 residues of similar sequence and structure—reused within representative sets of 15,016 domains [Evolutionary Classification of Protein Domains (ECOD) database] or 20,398 chains [Protein Data Bank (PDB)]. We observe that theme reuse is highly prevalent and that reuse is more extensive when the length threshold for identifying a theme is lower. Structural domains, the best characterized form of reuse in proteins, are just one of many complex and intertwined evolutionary traces. Others include long themes shared among a few proteins, which encompass and overlap with shorter themes that recur in numerous proteins. The observed complexity is consistent with evolution by duplication and divergence, and some of the themes might include descendants of ancestral segments. The observed recursive footprints, where the same amino acid can simultaneously participate in several intertwined themes, could be a useful concept for protein design. Data are available at http://trachel-srv.cs.haifa.ac.il/rachel/ppi/themes/.
format Online
Article
Text
id pubmed-5676897
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher National Academy of Sciences
record_format MEDLINE/PubMed
spelling pubmed-56768972017-11-15 Complex evolutionary footprints revealed in an analysis of reused protein segments of diverse lengths Nepomnyachiy, Sergey Ben-Tal, Nir Kolodny, Rachel Proc Natl Acad Sci U S A Biological Sciences Proteins share similar segments with one another. Such “reused parts”—which have been successfully incorporated into other proteins—are likely to offer an evolutionary advantage over de novo evolved segments, as most of the latter will not even have the capacity to fold. To systematically explore the evolutionary traces of segment “reuse” across proteins, we developed an automated methodology that identifies reused segments from protein alignments. We search for “themes”—segments of at least 35 residues of similar sequence and structure—reused within representative sets of 15,016 domains [Evolutionary Classification of Protein Domains (ECOD) database] or 20,398 chains [Protein Data Bank (PDB)]. We observe that theme reuse is highly prevalent and that reuse is more extensive when the length threshold for identifying a theme is lower. Structural domains, the best characterized form of reuse in proteins, are just one of many complex and intertwined evolutionary traces. Others include long themes shared among a few proteins, which encompass and overlap with shorter themes that recur in numerous proteins. The observed complexity is consistent with evolution by duplication and divergence, and some of the themes might include descendants of ancestral segments. The observed recursive footprints, where the same amino acid can simultaneously participate in several intertwined themes, could be a useful concept for protein design. Data are available at http://trachel-srv.cs.haifa.ac.il/rachel/ppi/themes/. National Academy of Sciences 2017-10-31 2017-10-19 /pmc/articles/PMC5676897/ /pubmed/29078314 http://dx.doi.org/10.1073/pnas.1707642114 Text en Copyright © 2017 the Author(s). Published by PNAS. This is an open access article distributed under the PNAS license (http://www.pnas.org/site/aboutpnas/licenses.xhtml) .http://www.pnas.org/site/aboutpnas/licenses.xhtml
spellingShingle Biological Sciences
Nepomnyachiy, Sergey
Ben-Tal, Nir
Kolodny, Rachel
Complex evolutionary footprints revealed in an analysis of reused protein segments of diverse lengths
title Complex evolutionary footprints revealed in an analysis of reused protein segments of diverse lengths
title_full Complex evolutionary footprints revealed in an analysis of reused protein segments of diverse lengths
title_fullStr Complex evolutionary footprints revealed in an analysis of reused protein segments of diverse lengths
title_full_unstemmed Complex evolutionary footprints revealed in an analysis of reused protein segments of diverse lengths
title_short Complex evolutionary footprints revealed in an analysis of reused protein segments of diverse lengths
title_sort complex evolutionary footprints revealed in an analysis of reused protein segments of diverse lengths
topic Biological Sciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5676897/
https://www.ncbi.nlm.nih.gov/pubmed/29078314
http://dx.doi.org/10.1073/pnas.1707642114
work_keys_str_mv AT nepomnyachiysergey complexevolutionaryfootprintsrevealedinananalysisofreusedproteinsegmentsofdiverselengths
AT bentalnir complexevolutionaryfootprintsrevealedinananalysisofreusedproteinsegmentsofdiverselengths
AT kolodnyrachel complexevolutionaryfootprintsrevealedinananalysisofreusedproteinsegmentsofdiverselengths