Cargando…

Deriving High-Resolution Protein Backbone Structure Propensities from All Crystal Data Using the Information Maximization Device

The most informative probability distribution functions (PDFs) describing the Ramachandran phi-psi dihedral angle pair, a fundamental descriptor of backbone conformation of protein molecules, are derived from high-resolution X-ray crystal structures using an information-theoretic approach. The Infor...

Descripción completa

Detalles Bibliográficos
Autor principal: Solis, Armando D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4045576/
https://www.ncbi.nlm.nih.gov/pubmed/24896099
http://dx.doi.org/10.1371/journal.pone.0094334
_version_ 1782319343852322816
author Solis, Armando D.
author_facet Solis, Armando D.
author_sort Solis, Armando D.
collection PubMed
description The most informative probability distribution functions (PDFs) describing the Ramachandran phi-psi dihedral angle pair, a fundamental descriptor of backbone conformation of protein molecules, are derived from high-resolution X-ray crystal structures using an information-theoretic approach. The Information Maximization Device (IMD) is established, based on fundamental information-theoretic concepts, and then applied specifically to derive highly resolved phi-psi maps for all 20 single amino acid and all 8000 triplet sequences at an optimal resolution determined by the volume of current data. The paper shows that utilizing the latent information contained in all viable high-resolution crystal structures found in the Protein Data Bank (PDB), totaling more than 77,000 chains, permits the derivation of a large number of optimized sequence-dependent PDFs. This work demonstrates the effectiveness of the IMD and the superiority of the resulting PDFs by extensive fold recognition experiments and rigorous comparisons with previously published triplet PDFs. Because it automatically optimizes PDFs, IMD results in improved performance of knowledge-based potentials, which rely on such PDFs. Furthermore, it provides an easy computational recipe for empirically deriving other kinds of sequence-dependent structural PDFs with greater detail and precision. The high-resolution phi-psi maps derived in this work are available for download.
format Online
Article
Text
id pubmed-4045576
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-40455762014-06-09 Deriving High-Resolution Protein Backbone Structure Propensities from All Crystal Data Using the Information Maximization Device Solis, Armando D. PLoS One Research Article The most informative probability distribution functions (PDFs) describing the Ramachandran phi-psi dihedral angle pair, a fundamental descriptor of backbone conformation of protein molecules, are derived from high-resolution X-ray crystal structures using an information-theoretic approach. The Information Maximization Device (IMD) is established, based on fundamental information-theoretic concepts, and then applied specifically to derive highly resolved phi-psi maps for all 20 single amino acid and all 8000 triplet sequences at an optimal resolution determined by the volume of current data. The paper shows that utilizing the latent information contained in all viable high-resolution crystal structures found in the Protein Data Bank (PDB), totaling more than 77,000 chains, permits the derivation of a large number of optimized sequence-dependent PDFs. This work demonstrates the effectiveness of the IMD and the superiority of the resulting PDFs by extensive fold recognition experiments and rigorous comparisons with previously published triplet PDFs. Because it automatically optimizes PDFs, IMD results in improved performance of knowledge-based potentials, which rely on such PDFs. Furthermore, it provides an easy computational recipe for empirically deriving other kinds of sequence-dependent structural PDFs with greater detail and precision. The high-resolution phi-psi maps derived in this work are available for download. Public Library of Science 2014-06-04 /pmc/articles/PMC4045576/ /pubmed/24896099 http://dx.doi.org/10.1371/journal.pone.0094334 Text en © 2014 Armando D http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Solis, Armando D.
Deriving High-Resolution Protein Backbone Structure Propensities from All Crystal Data Using the Information Maximization Device
title Deriving High-Resolution Protein Backbone Structure Propensities from All Crystal Data Using the Information Maximization Device
title_full Deriving High-Resolution Protein Backbone Structure Propensities from All Crystal Data Using the Information Maximization Device
title_fullStr Deriving High-Resolution Protein Backbone Structure Propensities from All Crystal Data Using the Information Maximization Device
title_full_unstemmed Deriving High-Resolution Protein Backbone Structure Propensities from All Crystal Data Using the Information Maximization Device
title_short Deriving High-Resolution Protein Backbone Structure Propensities from All Crystal Data Using the Information Maximization Device
title_sort deriving high-resolution protein backbone structure propensities from all crystal data using the information maximization device
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4045576/
https://www.ncbi.nlm.nih.gov/pubmed/24896099
http://dx.doi.org/10.1371/journal.pone.0094334
work_keys_str_mv AT solisarmandod derivinghighresolutionproteinbackbonestructurepropensitiesfromallcrystaldatausingtheinformationmaximizationdevice