Cargando…

Exploring the Universe of Protein Structures beyond the Protein Data Bank

It is currently believed that the atlas of existing protein structures is faithfully represented in the Protein Data Bank. However, whether this atlas covers the full universe of all possible protein structures is still a highly debated issue. By using a sophisticated numerical approach, we performe...

Descripción completa

Detalles Bibliográficos
Autores principales: Cossio, Pilar, Trovato, Antonio, Pietrucci, Fabio, Seno, Flavio, Maritan, Amos, Laio, Alessandro
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2973819/
https://www.ncbi.nlm.nih.gov/pubmed/21079678
http://dx.doi.org/10.1371/journal.pcbi.1000957
_version_ 1782190839798169600
author Cossio, Pilar
Trovato, Antonio
Pietrucci, Fabio
Seno, Flavio
Maritan, Amos
Laio, Alessandro
author_facet Cossio, Pilar
Trovato, Antonio
Pietrucci, Fabio
Seno, Flavio
Maritan, Amos
Laio, Alessandro
author_sort Cossio, Pilar
collection PubMed
description It is currently believed that the atlas of existing protein structures is faithfully represented in the Protein Data Bank. However, whether this atlas covers the full universe of all possible protein structures is still a highly debated issue. By using a sophisticated numerical approach, we performed an exhaustive exploration of the conformational space of a 60 amino acid polypeptide chain described with an accurate all-atom interaction potential. We generated a database of around 30,000 compact folds with at least [Image: see text] of secondary structure corresponding to local minima of the potential energy. This ensemble plausibly represents the universe of protein folds of similar length; indeed, all the known folds are represented in the set with good accuracy. However, we discover that the known folds form a rather small subset, which cannot be reproduced by choosing random structures in the database. Rather, natural and possible folds differ by the contact order, on average significantly smaller in the former. This suggests the presence of an evolutionary bias, possibly related to kinetic accessibility, towards structures with shorter loops between contacting residues. Beside their conceptual relevance, the new structures open a range of practical applications such as the development of accurate structure prediction strategies, the optimization of force fields, and the identification and design of novel folds.
format Text
id pubmed-2973819
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-29738192010-11-15 Exploring the Universe of Protein Structures beyond the Protein Data Bank Cossio, Pilar Trovato, Antonio Pietrucci, Fabio Seno, Flavio Maritan, Amos Laio, Alessandro PLoS Comput Biol Research Article It is currently believed that the atlas of existing protein structures is faithfully represented in the Protein Data Bank. However, whether this atlas covers the full universe of all possible protein structures is still a highly debated issue. By using a sophisticated numerical approach, we performed an exhaustive exploration of the conformational space of a 60 amino acid polypeptide chain described with an accurate all-atom interaction potential. We generated a database of around 30,000 compact folds with at least [Image: see text] of secondary structure corresponding to local minima of the potential energy. This ensemble plausibly represents the universe of protein folds of similar length; indeed, all the known folds are represented in the set with good accuracy. However, we discover that the known folds form a rather small subset, which cannot be reproduced by choosing random structures in the database. Rather, natural and possible folds differ by the contact order, on average significantly smaller in the former. This suggests the presence of an evolutionary bias, possibly related to kinetic accessibility, towards structures with shorter loops between contacting residues. Beside their conceptual relevance, the new structures open a range of practical applications such as the development of accurate structure prediction strategies, the optimization of force fields, and the identification and design of novel folds. Public Library of Science 2010-11-04 /pmc/articles/PMC2973819/ /pubmed/21079678 http://dx.doi.org/10.1371/journal.pcbi.1000957 Text en Cossio et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Cossio, Pilar
Trovato, Antonio
Pietrucci, Fabio
Seno, Flavio
Maritan, Amos
Laio, Alessandro
Exploring the Universe of Protein Structures beyond the Protein Data Bank
title Exploring the Universe of Protein Structures beyond the Protein Data Bank
title_full Exploring the Universe of Protein Structures beyond the Protein Data Bank
title_fullStr Exploring the Universe of Protein Structures beyond the Protein Data Bank
title_full_unstemmed Exploring the Universe of Protein Structures beyond the Protein Data Bank
title_short Exploring the Universe of Protein Structures beyond the Protein Data Bank
title_sort exploring the universe of protein structures beyond the protein data bank
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2973819/
https://www.ncbi.nlm.nih.gov/pubmed/21079678
http://dx.doi.org/10.1371/journal.pcbi.1000957
work_keys_str_mv AT cossiopilar exploringtheuniverseofproteinstructuresbeyondtheproteindatabank
AT trovatoantonio exploringtheuniverseofproteinstructuresbeyondtheproteindatabank
AT pietruccifabio exploringtheuniverseofproteinstructuresbeyondtheproteindatabank
AT senoflavio exploringtheuniverseofproteinstructuresbeyondtheproteindatabank
AT maritanamos exploringtheuniverseofproteinstructuresbeyondtheproteindatabank
AT laioalessandro exploringtheuniverseofproteinstructuresbeyondtheproteindatabank