Cargando…

Inferring extrinsic noise from single-cell gene expression data using approximate Bayesian computation

BACKGROUND: Gene expression is known to be an intrinsically stochastic process which can involve single-digit numbers of mRNA molecules in a cell at any given time. The modelling of such processes calls for the use of exact stochastic simulation methods, most notably the Gillespie algorithm. However...

Descripción completa

Detalles Bibliográficos
Autores principales: Lenive, Oleg, W. Kirk, Paul D., H. Stumpf, Michael P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4994381/
https://www.ncbi.nlm.nih.gov/pubmed/27549182
http://dx.doi.org/10.1186/s12918-016-0324-x
_version_ 1782449319555629056
author Lenive, Oleg
W. Kirk, Paul D.
H. Stumpf, Michael P.
author_facet Lenive, Oleg
W. Kirk, Paul D.
H. Stumpf, Michael P.
author_sort Lenive, Oleg
collection PubMed
description BACKGROUND: Gene expression is known to be an intrinsically stochastic process which can involve single-digit numbers of mRNA molecules in a cell at any given time. The modelling of such processes calls for the use of exact stochastic simulation methods, most notably the Gillespie algorithm. However, this stochasticity, also termed “intrinsic noise”, does not account for all the variability between genetically identical cells growing in a homogeneous environment. Despite substantial experimental efforts, determining appropriate model parameters continues to be a challenge. Methods based on approximate Bayesian computation can be used to obtain posterior parameter distributions given the observed data. However, such inference procedures require large numbers of simulations of the model and exact stochastic simulation is computationally costly. In this work we focus on the specific case of trying to infer model parameters describing reaction rates and extrinsic noise on the basis of measurements of molecule numbers in individual cells at a given time point. RESULTS: To make the problem computationally tractable we develop an exact, model-specific, stochastic simulation algorithm for the commonly used two-state model of gene expression. This algorithm relies on certain assumptions and favourable properties of the model to forgo the simulation of the whole temporal trajectory of protein numbers in the system, instead returning only the number of protein and mRNA molecules present in the system at a specified time point. The computational gain is proportional to the number of protein molecules created in the system and becomes significant for systems involving hundreds or thousands of protein molecules. CONCLUSIONS: We employ this simulation algorithm with approximate Bayesian computation to jointly infer the model’s rate and noise parameters from published gene expression data. Our analysis indicates that for most genes the extrinsic contributions to noise will be small to moderate but certainly are non-negligible. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12918-016-0324-x) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4994381
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-49943812016-08-24 Inferring extrinsic noise from single-cell gene expression data using approximate Bayesian computation Lenive, Oleg W. Kirk, Paul D. H. Stumpf, Michael P. BMC Syst Biol Research Article BACKGROUND: Gene expression is known to be an intrinsically stochastic process which can involve single-digit numbers of mRNA molecules in a cell at any given time. The modelling of such processes calls for the use of exact stochastic simulation methods, most notably the Gillespie algorithm. However, this stochasticity, also termed “intrinsic noise”, does not account for all the variability between genetically identical cells growing in a homogeneous environment. Despite substantial experimental efforts, determining appropriate model parameters continues to be a challenge. Methods based on approximate Bayesian computation can be used to obtain posterior parameter distributions given the observed data. However, such inference procedures require large numbers of simulations of the model and exact stochastic simulation is computationally costly. In this work we focus on the specific case of trying to infer model parameters describing reaction rates and extrinsic noise on the basis of measurements of molecule numbers in individual cells at a given time point. RESULTS: To make the problem computationally tractable we develop an exact, model-specific, stochastic simulation algorithm for the commonly used two-state model of gene expression. This algorithm relies on certain assumptions and favourable properties of the model to forgo the simulation of the whole temporal trajectory of protein numbers in the system, instead returning only the number of protein and mRNA molecules present in the system at a specified time point. The computational gain is proportional to the number of protein molecules created in the system and becomes significant for systems involving hundreds or thousands of protein molecules. CONCLUSIONS: We employ this simulation algorithm with approximate Bayesian computation to jointly infer the model’s rate and noise parameters from published gene expression data. Our analysis indicates that for most genes the extrinsic contributions to noise will be small to moderate but certainly are non-negligible. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12918-016-0324-x) contains supplementary material, which is available to authorized users. BioMed Central 2016-08-22 /pmc/articles/PMC4994381/ /pubmed/27549182 http://dx.doi.org/10.1186/s12918-016-0324-x Text en © The Author(s) 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Lenive, Oleg
W. Kirk, Paul D.
H. Stumpf, Michael P.
Inferring extrinsic noise from single-cell gene expression data using approximate Bayesian computation
title Inferring extrinsic noise from single-cell gene expression data using approximate Bayesian computation
title_full Inferring extrinsic noise from single-cell gene expression data using approximate Bayesian computation
title_fullStr Inferring extrinsic noise from single-cell gene expression data using approximate Bayesian computation
title_full_unstemmed Inferring extrinsic noise from single-cell gene expression data using approximate Bayesian computation
title_short Inferring extrinsic noise from single-cell gene expression data using approximate Bayesian computation
title_sort inferring extrinsic noise from single-cell gene expression data using approximate bayesian computation
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4994381/
https://www.ncbi.nlm.nih.gov/pubmed/27549182
http://dx.doi.org/10.1186/s12918-016-0324-x
work_keys_str_mv AT leniveoleg inferringextrinsicnoisefromsinglecellgeneexpressiondatausingapproximatebayesiancomputation
AT wkirkpauld inferringextrinsicnoisefromsinglecellgeneexpressiondatausingapproximatebayesiancomputation
AT hstumpfmichaelp inferringextrinsicnoisefromsinglecellgeneexpressiondatausingapproximatebayesiancomputation