Cargando…

Hyperdimensional Analysis of Amino Acid Pair Distributions in Proteins

Our manuscript presents a novel approach to protein structure analyses. We have organized an 8-dimensional data cube with protein 3D-structural information from 8706 high-resolution non-redundant protein-chains with the aim of identifying packing rules at the amino acid pair level. The cube contains...

Descripción completa

Detalles Bibliográficos
Autores principales: Henriksen, Svend B., Mortensen, Rasmus J., Geertz-Hansen, Henrik M., Neves-Petersen, Maria Teresa, Arnason, Omar, Söring, Jón, Petersen, Steffen B.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3235099/
https://www.ncbi.nlm.nih.gov/pubmed/22174733
http://dx.doi.org/10.1371/journal.pone.0025638
_version_ 1782218561563918336
author Henriksen, Svend B.
Mortensen, Rasmus J.
Geertz-Hansen, Henrik M.
Neves-Petersen, Maria Teresa
Arnason, Omar
Söring, Jón
Petersen, Steffen B.
author_facet Henriksen, Svend B.
Mortensen, Rasmus J.
Geertz-Hansen, Henrik M.
Neves-Petersen, Maria Teresa
Arnason, Omar
Söring, Jón
Petersen, Steffen B.
author_sort Henriksen, Svend B.
collection PubMed
description Our manuscript presents a novel approach to protein structure analyses. We have organized an 8-dimensional data cube with protein 3D-structural information from 8706 high-resolution non-redundant protein-chains with the aim of identifying packing rules at the amino acid pair level. The cube contains information about amino acid type, solvent accessibility, spatial and sequence distance, secondary structure and sequence length. We are able to pose structural queries to the data cube using program ProPack. The response is a 1, 2 or 3D graph. Whereas the response is of a statistical nature, the user can obtain an instant list of all PDB-structures where such pair is found. The user may select a particular structure, which is displayed highlighting the pair in question. The user may pose millions of different queries and for each one he will receive the answer in a few seconds. In order to demonstrate the capabilities of the data cube as well as the programs, we have selected well known structural features, disulphide bridges and salt bridges, where we illustrate how the queries are posed, and how answers are given. Motifs involving cysteines such as disulphide bridges, zinc-fingers and iron-sulfur clusters are clearly identified and differentiated. ProPack also reveals that whereas pairs of Lys residues virtually never appear in close spatial proximity, pairs of Arg are abundant and appear at close spatial distance, contrasting the belief that electrostatic repulsion would prevent this juxtaposition and that Arg-Lys is perceived as a conservative mutation. The presented programs can find and visualize novel packing preferences in proteins structures allowing the user to unravel correlations between pairs of amino acids. The new tools allow the user to view statistical information and visualize instantly the structures that underpin the statistical information, which is far from trivial with most other SW tools for protein structure analysis.
format Online
Article
Text
id pubmed-3235099
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-32350992011-12-15 Hyperdimensional Analysis of Amino Acid Pair Distributions in Proteins Henriksen, Svend B. Mortensen, Rasmus J. Geertz-Hansen, Henrik M. Neves-Petersen, Maria Teresa Arnason, Omar Söring, Jón Petersen, Steffen B. PLoS One Research Article Our manuscript presents a novel approach to protein structure analyses. We have organized an 8-dimensional data cube with protein 3D-structural information from 8706 high-resolution non-redundant protein-chains with the aim of identifying packing rules at the amino acid pair level. The cube contains information about amino acid type, solvent accessibility, spatial and sequence distance, secondary structure and sequence length. We are able to pose structural queries to the data cube using program ProPack. The response is a 1, 2 or 3D graph. Whereas the response is of a statistical nature, the user can obtain an instant list of all PDB-structures where such pair is found. The user may select a particular structure, which is displayed highlighting the pair in question. The user may pose millions of different queries and for each one he will receive the answer in a few seconds. In order to demonstrate the capabilities of the data cube as well as the programs, we have selected well known structural features, disulphide bridges and salt bridges, where we illustrate how the queries are posed, and how answers are given. Motifs involving cysteines such as disulphide bridges, zinc-fingers and iron-sulfur clusters are clearly identified and differentiated. ProPack also reveals that whereas pairs of Lys residues virtually never appear in close spatial proximity, pairs of Arg are abundant and appear at close spatial distance, contrasting the belief that electrostatic repulsion would prevent this juxtaposition and that Arg-Lys is perceived as a conservative mutation. The presented programs can find and visualize novel packing preferences in proteins structures allowing the user to unravel correlations between pairs of amino acids. The new tools allow the user to view statistical information and visualize instantly the structures that underpin the statistical information, which is far from trivial with most other SW tools for protein structure analysis. Public Library of Science 2011-12-09 /pmc/articles/PMC3235099/ /pubmed/22174733 http://dx.doi.org/10.1371/journal.pone.0025638 Text en Henriksen et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Henriksen, Svend B.
Mortensen, Rasmus J.
Geertz-Hansen, Henrik M.
Neves-Petersen, Maria Teresa
Arnason, Omar
Söring, Jón
Petersen, Steffen B.
Hyperdimensional Analysis of Amino Acid Pair Distributions in Proteins
title Hyperdimensional Analysis of Amino Acid Pair Distributions in Proteins
title_full Hyperdimensional Analysis of Amino Acid Pair Distributions in Proteins
title_fullStr Hyperdimensional Analysis of Amino Acid Pair Distributions in Proteins
title_full_unstemmed Hyperdimensional Analysis of Amino Acid Pair Distributions in Proteins
title_short Hyperdimensional Analysis of Amino Acid Pair Distributions in Proteins
title_sort hyperdimensional analysis of amino acid pair distributions in proteins
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3235099/
https://www.ncbi.nlm.nih.gov/pubmed/22174733
http://dx.doi.org/10.1371/journal.pone.0025638
work_keys_str_mv AT henriksensvendb hyperdimensionalanalysisofaminoacidpairdistributionsinproteins
AT mortensenrasmusj hyperdimensionalanalysisofaminoacidpairdistributionsinproteins
AT geertzhansenhenrikm hyperdimensionalanalysisofaminoacidpairdistributionsinproteins
AT nevespetersenmariateresa hyperdimensionalanalysisofaminoacidpairdistributionsinproteins
AT arnasonomar hyperdimensionalanalysisofaminoacidpairdistributionsinproteins
AT soringjon hyperdimensionalanalysisofaminoacidpairdistributionsinproteins
AT petersensteffenb hyperdimensionalanalysisofaminoacidpairdistributionsinproteins