Cargando…
Hyperdimensional Analysis of Amino Acid Pair Distributions in Proteins
Our manuscript presents a novel approach to protein structure analyses. We have organized an 8-dimensional data cube with protein 3D-structural information from 8706 high-resolution non-redundant protein-chains with the aim of identifying packing rules at the amino acid pair level. The cube contains...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3235099/ https://www.ncbi.nlm.nih.gov/pubmed/22174733 http://dx.doi.org/10.1371/journal.pone.0025638 |
_version_ | 1782218561563918336 |
---|---|
author | Henriksen, Svend B. Mortensen, Rasmus J. Geertz-Hansen, Henrik M. Neves-Petersen, Maria Teresa Arnason, Omar Söring, Jón Petersen, Steffen B. |
author_facet | Henriksen, Svend B. Mortensen, Rasmus J. Geertz-Hansen, Henrik M. Neves-Petersen, Maria Teresa Arnason, Omar Söring, Jón Petersen, Steffen B. |
author_sort | Henriksen, Svend B. |
collection | PubMed |
description | Our manuscript presents a novel approach to protein structure analyses. We have organized an 8-dimensional data cube with protein 3D-structural information from 8706 high-resolution non-redundant protein-chains with the aim of identifying packing rules at the amino acid pair level. The cube contains information about amino acid type, solvent accessibility, spatial and sequence distance, secondary structure and sequence length. We are able to pose structural queries to the data cube using program ProPack. The response is a 1, 2 or 3D graph. Whereas the response is of a statistical nature, the user can obtain an instant list of all PDB-structures where such pair is found. The user may select a particular structure, which is displayed highlighting the pair in question. The user may pose millions of different queries and for each one he will receive the answer in a few seconds. In order to demonstrate the capabilities of the data cube as well as the programs, we have selected well known structural features, disulphide bridges and salt bridges, where we illustrate how the queries are posed, and how answers are given. Motifs involving cysteines such as disulphide bridges, zinc-fingers and iron-sulfur clusters are clearly identified and differentiated. ProPack also reveals that whereas pairs of Lys residues virtually never appear in close spatial proximity, pairs of Arg are abundant and appear at close spatial distance, contrasting the belief that electrostatic repulsion would prevent this juxtaposition and that Arg-Lys is perceived as a conservative mutation. The presented programs can find and visualize novel packing preferences in proteins structures allowing the user to unravel correlations between pairs of amino acids. The new tools allow the user to view statistical information and visualize instantly the structures that underpin the statistical information, which is far from trivial with most other SW tools for protein structure analysis. |
format | Online Article Text |
id | pubmed-3235099 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-32350992011-12-15 Hyperdimensional Analysis of Amino Acid Pair Distributions in Proteins Henriksen, Svend B. Mortensen, Rasmus J. Geertz-Hansen, Henrik M. Neves-Petersen, Maria Teresa Arnason, Omar Söring, Jón Petersen, Steffen B. PLoS One Research Article Our manuscript presents a novel approach to protein structure analyses. We have organized an 8-dimensional data cube with protein 3D-structural information from 8706 high-resolution non-redundant protein-chains with the aim of identifying packing rules at the amino acid pair level. The cube contains information about amino acid type, solvent accessibility, spatial and sequence distance, secondary structure and sequence length. We are able to pose structural queries to the data cube using program ProPack. The response is a 1, 2 or 3D graph. Whereas the response is of a statistical nature, the user can obtain an instant list of all PDB-structures where such pair is found. The user may select a particular structure, which is displayed highlighting the pair in question. The user may pose millions of different queries and for each one he will receive the answer in a few seconds. In order to demonstrate the capabilities of the data cube as well as the programs, we have selected well known structural features, disulphide bridges and salt bridges, where we illustrate how the queries are posed, and how answers are given. Motifs involving cysteines such as disulphide bridges, zinc-fingers and iron-sulfur clusters are clearly identified and differentiated. ProPack also reveals that whereas pairs of Lys residues virtually never appear in close spatial proximity, pairs of Arg are abundant and appear at close spatial distance, contrasting the belief that electrostatic repulsion would prevent this juxtaposition and that Arg-Lys is perceived as a conservative mutation. The presented programs can find and visualize novel packing preferences in proteins structures allowing the user to unravel correlations between pairs of amino acids. The new tools allow the user to view statistical information and visualize instantly the structures that underpin the statistical information, which is far from trivial with most other SW tools for protein structure analysis. Public Library of Science 2011-12-09 /pmc/articles/PMC3235099/ /pubmed/22174733 http://dx.doi.org/10.1371/journal.pone.0025638 Text en Henriksen et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Henriksen, Svend B. Mortensen, Rasmus J. Geertz-Hansen, Henrik M. Neves-Petersen, Maria Teresa Arnason, Omar Söring, Jón Petersen, Steffen B. Hyperdimensional Analysis of Amino Acid Pair Distributions in Proteins |
title | Hyperdimensional Analysis of Amino Acid Pair Distributions in Proteins |
title_full | Hyperdimensional Analysis of Amino Acid Pair Distributions in Proteins |
title_fullStr | Hyperdimensional Analysis of Amino Acid Pair Distributions in Proteins |
title_full_unstemmed | Hyperdimensional Analysis of Amino Acid Pair Distributions in Proteins |
title_short | Hyperdimensional Analysis of Amino Acid Pair Distributions in Proteins |
title_sort | hyperdimensional analysis of amino acid pair distributions in proteins |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3235099/ https://www.ncbi.nlm.nih.gov/pubmed/22174733 http://dx.doi.org/10.1371/journal.pone.0025638 |
work_keys_str_mv | AT henriksensvendb hyperdimensionalanalysisofaminoacidpairdistributionsinproteins AT mortensenrasmusj hyperdimensionalanalysisofaminoacidpairdistributionsinproteins AT geertzhansenhenrikm hyperdimensionalanalysisofaminoacidpairdistributionsinproteins AT nevespetersenmariateresa hyperdimensionalanalysisofaminoacidpairdistributionsinproteins AT arnasonomar hyperdimensionalanalysisofaminoacidpairdistributionsinproteins AT soringjon hyperdimensionalanalysisofaminoacidpairdistributionsinproteins AT petersensteffenb hyperdimensionalanalysisofaminoacidpairdistributionsinproteins |