Cargando…

A Computational Framework for Analyzing Stochasticity in Gene Expression

Stochastic fluctuations in gene expression give rise to distributions of protein levels across cell populations. Despite a mounting number of theoretical models explaining stochasticity in protein expression, we lack a robust, efficient, assumption-free approach for inferring the molecular mechanism...

Descripción completa

Detalles Bibliográficos
Autores principales: Sherman, Marc S., Cohen, Barak A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4014403/
https://www.ncbi.nlm.nih.gov/pubmed/24811315
http://dx.doi.org/10.1371/journal.pcbi.1003596
_version_ 1782315165480386560
author Sherman, Marc S.
Cohen, Barak A.
author_facet Sherman, Marc S.
Cohen, Barak A.
author_sort Sherman, Marc S.
collection PubMed
description Stochastic fluctuations in gene expression give rise to distributions of protein levels across cell populations. Despite a mounting number of theoretical models explaining stochasticity in protein expression, we lack a robust, efficient, assumption-free approach for inferring the molecular mechanisms that underlie the shape of protein distributions. Here we propose a method for inferring sets of biochemical rate constants that govern chromatin modification, transcription, translation, and RNA and protein degradation from stochasticity in protein expression. We asked whether the rates of these underlying processes can be estimated accurately from protein expression distributions, in the absence of any limiting assumptions. To do this, we (1) derived analytical solutions for the first four moments of the protein distribution, (2) found that these four moments completely capture the shape of protein distributions, and (3) developed an efficient algorithm for inferring gene expression rate constants from the moments of protein distributions. Using this algorithm we find that most protein distributions are consistent with a large number of different biochemical rate constant sets. Despite this degeneracy, the solution space of rate constants almost always informs on underlying mechanism. For example, we distinguish between regimes where transcriptional bursting occurs from regimes reflecting constitutive transcript production. Our method agrees with the current standard approach, and in the restrictive regime where the standard method operates, also identifies rate constants not previously obtainable. Even without making any assumptions we obtain estimates of individual biochemical rate constants, or meaningful ratios of rate constants, in 91% of tested cases. In some cases our method identified all of the underlying rate constants. The framework developed here will be a powerful tool for deducing the contributions of particular molecular mechanisms to specific patterns of gene expression.
format Online
Article
Text
id pubmed-4014403
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-40144032014-05-14 A Computational Framework for Analyzing Stochasticity in Gene Expression Sherman, Marc S. Cohen, Barak A. PLoS Comput Biol Research Article Stochastic fluctuations in gene expression give rise to distributions of protein levels across cell populations. Despite a mounting number of theoretical models explaining stochasticity in protein expression, we lack a robust, efficient, assumption-free approach for inferring the molecular mechanisms that underlie the shape of protein distributions. Here we propose a method for inferring sets of biochemical rate constants that govern chromatin modification, transcription, translation, and RNA and protein degradation from stochasticity in protein expression. We asked whether the rates of these underlying processes can be estimated accurately from protein expression distributions, in the absence of any limiting assumptions. To do this, we (1) derived analytical solutions for the first four moments of the protein distribution, (2) found that these four moments completely capture the shape of protein distributions, and (3) developed an efficient algorithm for inferring gene expression rate constants from the moments of protein distributions. Using this algorithm we find that most protein distributions are consistent with a large number of different biochemical rate constant sets. Despite this degeneracy, the solution space of rate constants almost always informs on underlying mechanism. For example, we distinguish between regimes where transcriptional bursting occurs from regimes reflecting constitutive transcript production. Our method agrees with the current standard approach, and in the restrictive regime where the standard method operates, also identifies rate constants not previously obtainable. Even without making any assumptions we obtain estimates of individual biochemical rate constants, or meaningful ratios of rate constants, in 91% of tested cases. In some cases our method identified all of the underlying rate constants. The framework developed here will be a powerful tool for deducing the contributions of particular molecular mechanisms to specific patterns of gene expression. Public Library of Science 2014-05-08 /pmc/articles/PMC4014403/ /pubmed/24811315 http://dx.doi.org/10.1371/journal.pcbi.1003596 Text en © 2014 Sherman, Cohen http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Sherman, Marc S.
Cohen, Barak A.
A Computational Framework for Analyzing Stochasticity in Gene Expression
title A Computational Framework for Analyzing Stochasticity in Gene Expression
title_full A Computational Framework for Analyzing Stochasticity in Gene Expression
title_fullStr A Computational Framework for Analyzing Stochasticity in Gene Expression
title_full_unstemmed A Computational Framework for Analyzing Stochasticity in Gene Expression
title_short A Computational Framework for Analyzing Stochasticity in Gene Expression
title_sort computational framework for analyzing stochasticity in gene expression
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4014403/
https://www.ncbi.nlm.nih.gov/pubmed/24811315
http://dx.doi.org/10.1371/journal.pcbi.1003596
work_keys_str_mv AT shermanmarcs acomputationalframeworkforanalyzingstochasticityingeneexpression
AT cohenbaraka acomputationalframeworkforanalyzingstochasticityingeneexpression
AT shermanmarcs computationalframeworkforanalyzingstochasticityingeneexpression
AT cohenbaraka computationalframeworkforanalyzingstochasticityingeneexpression