Cargando…
Inferring extrinsic noise from single-cell gene expression data using approximate Bayesian computation
BACKGROUND: Gene expression is known to be an intrinsically stochastic process which can involve single-digit numbers of mRNA molecules in a cell at any given time. The modelling of such processes calls for the use of exact stochastic simulation methods, most notably the Gillespie algorithm. However...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4994381/ https://www.ncbi.nlm.nih.gov/pubmed/27549182 http://dx.doi.org/10.1186/s12918-016-0324-x |
_version_ | 1782449319555629056 |
---|---|
author | Lenive, Oleg W. Kirk, Paul D. H. Stumpf, Michael P. |
author_facet | Lenive, Oleg W. Kirk, Paul D. H. Stumpf, Michael P. |
author_sort | Lenive, Oleg |
collection | PubMed |
description | BACKGROUND: Gene expression is known to be an intrinsically stochastic process which can involve single-digit numbers of mRNA molecules in a cell at any given time. The modelling of such processes calls for the use of exact stochastic simulation methods, most notably the Gillespie algorithm. However, this stochasticity, also termed “intrinsic noise”, does not account for all the variability between genetically identical cells growing in a homogeneous environment. Despite substantial experimental efforts, determining appropriate model parameters continues to be a challenge. Methods based on approximate Bayesian computation can be used to obtain posterior parameter distributions given the observed data. However, such inference procedures require large numbers of simulations of the model and exact stochastic simulation is computationally costly. In this work we focus on the specific case of trying to infer model parameters describing reaction rates and extrinsic noise on the basis of measurements of molecule numbers in individual cells at a given time point. RESULTS: To make the problem computationally tractable we develop an exact, model-specific, stochastic simulation algorithm for the commonly used two-state model of gene expression. This algorithm relies on certain assumptions and favourable properties of the model to forgo the simulation of the whole temporal trajectory of protein numbers in the system, instead returning only the number of protein and mRNA molecules present in the system at a specified time point. The computational gain is proportional to the number of protein molecules created in the system and becomes significant for systems involving hundreds or thousands of protein molecules. CONCLUSIONS: We employ this simulation algorithm with approximate Bayesian computation to jointly infer the model’s rate and noise parameters from published gene expression data. Our analysis indicates that for most genes the extrinsic contributions to noise will be small to moderate but certainly are non-negligible. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12918-016-0324-x) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4994381 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-49943812016-08-24 Inferring extrinsic noise from single-cell gene expression data using approximate Bayesian computation Lenive, Oleg W. Kirk, Paul D. H. Stumpf, Michael P. BMC Syst Biol Research Article BACKGROUND: Gene expression is known to be an intrinsically stochastic process which can involve single-digit numbers of mRNA molecules in a cell at any given time. The modelling of such processes calls for the use of exact stochastic simulation methods, most notably the Gillespie algorithm. However, this stochasticity, also termed “intrinsic noise”, does not account for all the variability between genetically identical cells growing in a homogeneous environment. Despite substantial experimental efforts, determining appropriate model parameters continues to be a challenge. Methods based on approximate Bayesian computation can be used to obtain posterior parameter distributions given the observed data. However, such inference procedures require large numbers of simulations of the model and exact stochastic simulation is computationally costly. In this work we focus on the specific case of trying to infer model parameters describing reaction rates and extrinsic noise on the basis of measurements of molecule numbers in individual cells at a given time point. RESULTS: To make the problem computationally tractable we develop an exact, model-specific, stochastic simulation algorithm for the commonly used two-state model of gene expression. This algorithm relies on certain assumptions and favourable properties of the model to forgo the simulation of the whole temporal trajectory of protein numbers in the system, instead returning only the number of protein and mRNA molecules present in the system at a specified time point. The computational gain is proportional to the number of protein molecules created in the system and becomes significant for systems involving hundreds or thousands of protein molecules. CONCLUSIONS: We employ this simulation algorithm with approximate Bayesian computation to jointly infer the model’s rate and noise parameters from published gene expression data. Our analysis indicates that for most genes the extrinsic contributions to noise will be small to moderate but certainly are non-negligible. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12918-016-0324-x) contains supplementary material, which is available to authorized users. BioMed Central 2016-08-22 /pmc/articles/PMC4994381/ /pubmed/27549182 http://dx.doi.org/10.1186/s12918-016-0324-x Text en © The Author(s) 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Lenive, Oleg W. Kirk, Paul D. H. Stumpf, Michael P. Inferring extrinsic noise from single-cell gene expression data using approximate Bayesian computation |
title | Inferring extrinsic noise from single-cell gene expression data using approximate Bayesian computation |
title_full | Inferring extrinsic noise from single-cell gene expression data using approximate Bayesian computation |
title_fullStr | Inferring extrinsic noise from single-cell gene expression data using approximate Bayesian computation |
title_full_unstemmed | Inferring extrinsic noise from single-cell gene expression data using approximate Bayesian computation |
title_short | Inferring extrinsic noise from single-cell gene expression data using approximate Bayesian computation |
title_sort | inferring extrinsic noise from single-cell gene expression data using approximate bayesian computation |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4994381/ https://www.ncbi.nlm.nih.gov/pubmed/27549182 http://dx.doi.org/10.1186/s12918-016-0324-x |
work_keys_str_mv | AT leniveoleg inferringextrinsicnoisefromsinglecellgeneexpressiondatausingapproximatebayesiancomputation AT wkirkpauld inferringextrinsicnoisefromsinglecellgeneexpressiondatausingapproximatebayesiancomputation AT hstumpfmichaelp inferringextrinsicnoisefromsinglecellgeneexpressiondatausingapproximatebayesiancomputation |