Cargando…
Deriving High-Resolution Protein Backbone Structure Propensities from All Crystal Data Using the Information Maximization Device
The most informative probability distribution functions (PDFs) describing the Ramachandran phi-psi dihedral angle pair, a fundamental descriptor of backbone conformation of protein molecules, are derived from high-resolution X-ray crystal structures using an information-theoretic approach. The Infor...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4045576/ https://www.ncbi.nlm.nih.gov/pubmed/24896099 http://dx.doi.org/10.1371/journal.pone.0094334 |
_version_ | 1782319343852322816 |
---|---|
author | Solis, Armando D. |
author_facet | Solis, Armando D. |
author_sort | Solis, Armando D. |
collection | PubMed |
description | The most informative probability distribution functions (PDFs) describing the Ramachandran phi-psi dihedral angle pair, a fundamental descriptor of backbone conformation of protein molecules, are derived from high-resolution X-ray crystal structures using an information-theoretic approach. The Information Maximization Device (IMD) is established, based on fundamental information-theoretic concepts, and then applied specifically to derive highly resolved phi-psi maps for all 20 single amino acid and all 8000 triplet sequences at an optimal resolution determined by the volume of current data. The paper shows that utilizing the latent information contained in all viable high-resolution crystal structures found in the Protein Data Bank (PDB), totaling more than 77,000 chains, permits the derivation of a large number of optimized sequence-dependent PDFs. This work demonstrates the effectiveness of the IMD and the superiority of the resulting PDFs by extensive fold recognition experiments and rigorous comparisons with previously published triplet PDFs. Because it automatically optimizes PDFs, IMD results in improved performance of knowledge-based potentials, which rely on such PDFs. Furthermore, it provides an easy computational recipe for empirically deriving other kinds of sequence-dependent structural PDFs with greater detail and precision. The high-resolution phi-psi maps derived in this work are available for download. |
format | Online Article Text |
id | pubmed-4045576 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-40455762014-06-09 Deriving High-Resolution Protein Backbone Structure Propensities from All Crystal Data Using the Information Maximization Device Solis, Armando D. PLoS One Research Article The most informative probability distribution functions (PDFs) describing the Ramachandran phi-psi dihedral angle pair, a fundamental descriptor of backbone conformation of protein molecules, are derived from high-resolution X-ray crystal structures using an information-theoretic approach. The Information Maximization Device (IMD) is established, based on fundamental information-theoretic concepts, and then applied specifically to derive highly resolved phi-psi maps for all 20 single amino acid and all 8000 triplet sequences at an optimal resolution determined by the volume of current data. The paper shows that utilizing the latent information contained in all viable high-resolution crystal structures found in the Protein Data Bank (PDB), totaling more than 77,000 chains, permits the derivation of a large number of optimized sequence-dependent PDFs. This work demonstrates the effectiveness of the IMD and the superiority of the resulting PDFs by extensive fold recognition experiments and rigorous comparisons with previously published triplet PDFs. Because it automatically optimizes PDFs, IMD results in improved performance of knowledge-based potentials, which rely on such PDFs. Furthermore, it provides an easy computational recipe for empirically deriving other kinds of sequence-dependent structural PDFs with greater detail and precision. The high-resolution phi-psi maps derived in this work are available for download. Public Library of Science 2014-06-04 /pmc/articles/PMC4045576/ /pubmed/24896099 http://dx.doi.org/10.1371/journal.pone.0094334 Text en © 2014 Armando D http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Solis, Armando D. Deriving High-Resolution Protein Backbone Structure Propensities from All Crystal Data Using the Information Maximization Device |
title | Deriving High-Resolution Protein Backbone Structure Propensities from All Crystal Data Using the Information Maximization Device |
title_full | Deriving High-Resolution Protein Backbone Structure Propensities from All Crystal Data Using the Information Maximization Device |
title_fullStr | Deriving High-Resolution Protein Backbone Structure Propensities from All Crystal Data Using the Information Maximization Device |
title_full_unstemmed | Deriving High-Resolution Protein Backbone Structure Propensities from All Crystal Data Using the Information Maximization Device |
title_short | Deriving High-Resolution Protein Backbone Structure Propensities from All Crystal Data Using the Information Maximization Device |
title_sort | deriving high-resolution protein backbone structure propensities from all crystal data using the information maximization device |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4045576/ https://www.ncbi.nlm.nih.gov/pubmed/24896099 http://dx.doi.org/10.1371/journal.pone.0094334 |
work_keys_str_mv | AT solisarmandod derivinghighresolutionproteinbackbonestructurepropensitiesfromallcrystaldatausingtheinformationmaximizationdevice |