Cargando…

Using First Passage Statistics to Extract Environmentally Dependent Amino Acid Correlations

In this work, we study the first passage statistics of amino acid primary sequences, that is the probability of observing an amino acid for the first time at a certain number of residues away from a fixed amino acid. By using this rich mathematical framework, we are able to capture the background di...

Descripción completa

Detalles Bibliográficos
Autores principales: Greenbaum, Benjamin D., Kumar, Pradeep, Libchaber, Albert
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4084998/
https://www.ncbi.nlm.nih.gov/pubmed/25000191
http://dx.doi.org/10.1371/journal.pone.0101665
_version_ 1782324588620808192
author Greenbaum, Benjamin D.
Kumar, Pradeep
Libchaber, Albert
author_facet Greenbaum, Benjamin D.
Kumar, Pradeep
Libchaber, Albert
author_sort Greenbaum, Benjamin D.
collection PubMed
description In this work, we study the first passage statistics of amino acid primary sequences, that is the probability of observing an amino acid for the first time at a certain number of residues away from a fixed amino acid. By using this rich mathematical framework, we are able to capture the background distribution for an organism, and infer lengths at which the first passage has a probability that differs from what is expected. While many features of an organism's genome are due to natural selection, others are related to amino acid chemistry and the environment in which an organism lives, constraining the randomness of genomes upon which selection can further act. We therefore use this approach to infer amino acid correlations, and then study how these correlations vary across a wide range of organisms under a wide range of optimal growth temperatures. We find a nearly universal exponential background distribution, consistent with the idea that most amino acids are globally uncorrelated from other amino acids in genomes. When we are able to extract significant correlations, these correlations are reliably dependent on optimal growth temperature, across phylogenetic boundaries. Some of the correlations we extract, such as the enhanced probability of finding, for the first time, a cysteine three residues away from a cysteine or glutamic acid two residues away from an arginine, likely relate to thermal stability. However, other correlations, likely appearing on alpha helical surfaces, have a less clear physiochemical interpretation and may relate to thermal stability or unusual metabolic properties of organisms that live in a high temperature environment.
format Online
Article
Text
id pubmed-4084998
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-40849982014-07-09 Using First Passage Statistics to Extract Environmentally Dependent Amino Acid Correlations Greenbaum, Benjamin D. Kumar, Pradeep Libchaber, Albert PLoS One Research Article In this work, we study the first passage statistics of amino acid primary sequences, that is the probability of observing an amino acid for the first time at a certain number of residues away from a fixed amino acid. By using this rich mathematical framework, we are able to capture the background distribution for an organism, and infer lengths at which the first passage has a probability that differs from what is expected. While many features of an organism's genome are due to natural selection, others are related to amino acid chemistry and the environment in which an organism lives, constraining the randomness of genomes upon which selection can further act. We therefore use this approach to infer amino acid correlations, and then study how these correlations vary across a wide range of organisms under a wide range of optimal growth temperatures. We find a nearly universal exponential background distribution, consistent with the idea that most amino acids are globally uncorrelated from other amino acids in genomes. When we are able to extract significant correlations, these correlations are reliably dependent on optimal growth temperature, across phylogenetic boundaries. Some of the correlations we extract, such as the enhanced probability of finding, for the first time, a cysteine three residues away from a cysteine or glutamic acid two residues away from an arginine, likely relate to thermal stability. However, other correlations, likely appearing on alpha helical surfaces, have a less clear physiochemical interpretation and may relate to thermal stability or unusual metabolic properties of organisms that live in a high temperature environment. Public Library of Science 2014-07-07 /pmc/articles/PMC4084998/ /pubmed/25000191 http://dx.doi.org/10.1371/journal.pone.0101665 Text en © 2014 Greenbaum et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Greenbaum, Benjamin D.
Kumar, Pradeep
Libchaber, Albert
Using First Passage Statistics to Extract Environmentally Dependent Amino Acid Correlations
title Using First Passage Statistics to Extract Environmentally Dependent Amino Acid Correlations
title_full Using First Passage Statistics to Extract Environmentally Dependent Amino Acid Correlations
title_fullStr Using First Passage Statistics to Extract Environmentally Dependent Amino Acid Correlations
title_full_unstemmed Using First Passage Statistics to Extract Environmentally Dependent Amino Acid Correlations
title_short Using First Passage Statistics to Extract Environmentally Dependent Amino Acid Correlations
title_sort using first passage statistics to extract environmentally dependent amino acid correlations
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4084998/
https://www.ncbi.nlm.nih.gov/pubmed/25000191
http://dx.doi.org/10.1371/journal.pone.0101665
work_keys_str_mv AT greenbaumbenjamind usingfirstpassagestatisticstoextractenvironmentallydependentaminoacidcorrelations
AT kumarpradeep usingfirstpassagestatisticstoextractenvironmentallydependentaminoacidcorrelations
AT libchaberalbert usingfirstpassagestatisticstoextractenvironmentallydependentaminoacidcorrelations