Cargando…
Approximate inference of gene regulatory network models from RNA-Seq time series data
BACKGROUND: Inference of gene regulatory network structures from RNA-Seq data is challenging due to the nature of the data, as measurements take the form of counts of reads mapped to a given gene. Here we present a model for RNA-Seq time series data that applies a negative binomial distribution for...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5896118/ https://www.ncbi.nlm.nih.gov/pubmed/29642837 http://dx.doi.org/10.1186/s12859-018-2125-2 |
_version_ | 1783313779382026240 |
---|---|
author | Thorne, Thomas |
author_facet | Thorne, Thomas |
author_sort | Thorne, Thomas |
collection | PubMed |
description | BACKGROUND: Inference of gene regulatory network structures from RNA-Seq data is challenging due to the nature of the data, as measurements take the form of counts of reads mapped to a given gene. Here we present a model for RNA-Seq time series data that applies a negative binomial distribution for the observations, and uses sparse regression with a horseshoe prior to learn a dynamic Bayesian network of interactions between genes. We use a variational inference scheme to learn approximate posterior distributions for the model parameters. RESULTS: The methodology is benchmarked on synthetic data designed to replicate the distribution of real world RNA-Seq data. We compare our method to other sparse regression approaches and find improved performance in learning directed networks. We demonstrate an application of our method to a publicly available human neuronal stem cell differentiation RNA-Seq time series data set to infer the underlying network structure. CONCLUSIONS: Our method is able to improve performance on synthetic data by explicitly modelling the statistical distribution of the data when learning networks from RNA-Seq time series. Applying approximate inference techniques we can learn network structures quickly with only moderate computing resources. |
format | Online Article Text |
id | pubmed-5896118 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-58961182018-04-20 Approximate inference of gene regulatory network models from RNA-Seq time series data Thorne, Thomas BMC Bioinformatics Methodology Article BACKGROUND: Inference of gene regulatory network structures from RNA-Seq data is challenging due to the nature of the data, as measurements take the form of counts of reads mapped to a given gene. Here we present a model for RNA-Seq time series data that applies a negative binomial distribution for the observations, and uses sparse regression with a horseshoe prior to learn a dynamic Bayesian network of interactions between genes. We use a variational inference scheme to learn approximate posterior distributions for the model parameters. RESULTS: The methodology is benchmarked on synthetic data designed to replicate the distribution of real world RNA-Seq data. We compare our method to other sparse regression approaches and find improved performance in learning directed networks. We demonstrate an application of our method to a publicly available human neuronal stem cell differentiation RNA-Seq time series data set to infer the underlying network structure. CONCLUSIONS: Our method is able to improve performance on synthetic data by explicitly modelling the statistical distribution of the data when learning networks from RNA-Seq time series. Applying approximate inference techniques we can learn network structures quickly with only moderate computing resources. BioMed Central 2018-04-11 /pmc/articles/PMC5896118/ /pubmed/29642837 http://dx.doi.org/10.1186/s12859-018-2125-2 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Thorne, Thomas Approximate inference of gene regulatory network models from RNA-Seq time series data |
title | Approximate inference of gene regulatory network models from RNA-Seq time series data |
title_full | Approximate inference of gene regulatory network models from RNA-Seq time series data |
title_fullStr | Approximate inference of gene regulatory network models from RNA-Seq time series data |
title_full_unstemmed | Approximate inference of gene regulatory network models from RNA-Seq time series data |
title_short | Approximate inference of gene regulatory network models from RNA-Seq time series data |
title_sort | approximate inference of gene regulatory network models from rna-seq time series data |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5896118/ https://www.ncbi.nlm.nih.gov/pubmed/29642837 http://dx.doi.org/10.1186/s12859-018-2125-2 |
work_keys_str_mv | AT thornethomas approximateinferenceofgeneregulatorynetworkmodelsfromrnaseqtimeseriesdata |