Cargando…

Exact and efficient hybrid Monte Carlo algorithm for accelerated Bayesian inference of gene expression models from snapshots of single-cell transcripts

Single cells exhibit a significant amount of variability in transcript levels, which arises from slow, stochastic transitions between gene expression states. Elucidating the nature of these states and understanding how transition rates are affected by different regulatory mechanisms require state-of...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Yen Ting, Buchler, Nicolas E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: AIP Publishing LLC 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6615996/
https://www.ncbi.nlm.nih.gov/pubmed/31301707
http://dx.doi.org/10.1063/1.5110503
_version_ 1783433428848345088
author Lin, Yen Ting
Buchler, Nicolas E.
author_facet Lin, Yen Ting
Buchler, Nicolas E.
author_sort Lin, Yen Ting
collection PubMed
description Single cells exhibit a significant amount of variability in transcript levels, which arises from slow, stochastic transitions between gene expression states. Elucidating the nature of these states and understanding how transition rates are affected by different regulatory mechanisms require state-of-the-art methods to infer underlying models of gene expression from single cell data. A Bayesian approach to statistical inference is the most suitable method for model selection and uncertainty quantification of kinetic parameters using small data sets. However, this approach is impractical because current algorithms are too slow to handle typical models of gene expression. To solve this problem, we first show that time-dependent mRNA distributions of discrete-state models of gene expression are dynamic Poisson mixtures, whose mixing kernels are characterized by a piecewise deterministic Markov process. We combined this analytical result with a kinetic Monte Carlo algorithm to create a hybrid numerical method that accelerates the calculation of time-dependent mRNA distributions by 1000-fold compared to current methods. We then integrated the hybrid algorithm into an existing Monte Carlo sampler to estimate the Bayesian posterior distribution of many different, competing models in a reasonable amount of time. We demonstrate that kinetic parameters can be reasonably constrained for modestly sampled data sets if the model is known a priori. If there are many competing models, Bayesian evidence can rigorously quantify the likelihood of a model relative to other models from the data. We demonstrate that Bayesian evidence selects the true model and outperforms approximate metrics typically used for model selection.
format Online
Article
Text
id pubmed-6615996
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher AIP Publishing LLC
record_format MEDLINE/PubMed
spelling pubmed-66159962019-07-16 Exact and efficient hybrid Monte Carlo algorithm for accelerated Bayesian inference of gene expression models from snapshots of single-cell transcripts Lin, Yen Ting Buchler, Nicolas E. J Chem Phys ARTICLES Single cells exhibit a significant amount of variability in transcript levels, which arises from slow, stochastic transitions between gene expression states. Elucidating the nature of these states and understanding how transition rates are affected by different regulatory mechanisms require state-of-the-art methods to infer underlying models of gene expression from single cell data. A Bayesian approach to statistical inference is the most suitable method for model selection and uncertainty quantification of kinetic parameters using small data sets. However, this approach is impractical because current algorithms are too slow to handle typical models of gene expression. To solve this problem, we first show that time-dependent mRNA distributions of discrete-state models of gene expression are dynamic Poisson mixtures, whose mixing kernels are characterized by a piecewise deterministic Markov process. We combined this analytical result with a kinetic Monte Carlo algorithm to create a hybrid numerical method that accelerates the calculation of time-dependent mRNA distributions by 1000-fold compared to current methods. We then integrated the hybrid algorithm into an existing Monte Carlo sampler to estimate the Bayesian posterior distribution of many different, competing models in a reasonable amount of time. We demonstrate that kinetic parameters can be reasonably constrained for modestly sampled data sets if the model is known a priori. If there are many competing models, Bayesian evidence can rigorously quantify the likelihood of a model relative to other models from the data. We demonstrate that Bayesian evidence selects the true model and outperforms approximate metrics typically used for model selection. AIP Publishing LLC 2019-07-14 2019-07-09 /pmc/articles/PMC6615996/ /pubmed/31301707 http://dx.doi.org/10.1063/1.5110503 Text en © 2019 Author(s). 0021-9606/2019/151(2)/024106/15/$0.00 All article content, except where otherwise noted, is licensed under a Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle ARTICLES
Lin, Yen Ting
Buchler, Nicolas E.
Exact and efficient hybrid Monte Carlo algorithm for accelerated Bayesian inference of gene expression models from snapshots of single-cell transcripts
title Exact and efficient hybrid Monte Carlo algorithm for accelerated Bayesian inference of gene expression models from snapshots of single-cell transcripts
title_full Exact and efficient hybrid Monte Carlo algorithm for accelerated Bayesian inference of gene expression models from snapshots of single-cell transcripts
title_fullStr Exact and efficient hybrid Monte Carlo algorithm for accelerated Bayesian inference of gene expression models from snapshots of single-cell transcripts
title_full_unstemmed Exact and efficient hybrid Monte Carlo algorithm for accelerated Bayesian inference of gene expression models from snapshots of single-cell transcripts
title_short Exact and efficient hybrid Monte Carlo algorithm for accelerated Bayesian inference of gene expression models from snapshots of single-cell transcripts
title_sort exact and efficient hybrid monte carlo algorithm for accelerated bayesian inference of gene expression models from snapshots of single-cell transcripts
topic ARTICLES
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6615996/
https://www.ncbi.nlm.nih.gov/pubmed/31301707
http://dx.doi.org/10.1063/1.5110503
work_keys_str_mv AT linyenting exactandefficienthybridmontecarloalgorithmforacceleratedbayesianinferenceofgeneexpressionmodelsfromsnapshotsofsinglecelltranscripts
AT buchlernicolase exactandefficienthybridmontecarloalgorithmforacceleratedbayesianinferenceofgeneexpressionmodelsfromsnapshotsofsinglecelltranscripts