Cargando…

Approximate inference of gene regulatory network models from RNA-Seq time series data

BACKGROUND: Inference of gene regulatory network structures from RNA-Seq data is challenging due to the nature of the data, as measurements take the form of counts of reads mapped to a given gene. Here we present a model for RNA-Seq time series data that applies a negative binomial distribution for...

Descripción completa

Detalles Bibliográficos
Autor principal: Thorne, Thomas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5896118/
https://www.ncbi.nlm.nih.gov/pubmed/29642837
http://dx.doi.org/10.1186/s12859-018-2125-2
_version_ 1783313779382026240
author Thorne, Thomas
author_facet Thorne, Thomas
author_sort Thorne, Thomas
collection PubMed
description BACKGROUND: Inference of gene regulatory network structures from RNA-Seq data is challenging due to the nature of the data, as measurements take the form of counts of reads mapped to a given gene. Here we present a model for RNA-Seq time series data that applies a negative binomial distribution for the observations, and uses sparse regression with a horseshoe prior to learn a dynamic Bayesian network of interactions between genes. We use a variational inference scheme to learn approximate posterior distributions for the model parameters. RESULTS: The methodology is benchmarked on synthetic data designed to replicate the distribution of real world RNA-Seq data. We compare our method to other sparse regression approaches and find improved performance in learning directed networks. We demonstrate an application of our method to a publicly available human neuronal stem cell differentiation RNA-Seq time series data set to infer the underlying network structure. CONCLUSIONS: Our method is able to improve performance on synthetic data by explicitly modelling the statistical distribution of the data when learning networks from RNA-Seq time series. Applying approximate inference techniques we can learn network structures quickly with only moderate computing resources.
format Online
Article
Text
id pubmed-5896118
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-58961182018-04-20 Approximate inference of gene regulatory network models from RNA-Seq time series data Thorne, Thomas BMC Bioinformatics Methodology Article BACKGROUND: Inference of gene regulatory network structures from RNA-Seq data is challenging due to the nature of the data, as measurements take the form of counts of reads mapped to a given gene. Here we present a model for RNA-Seq time series data that applies a negative binomial distribution for the observations, and uses sparse regression with a horseshoe prior to learn a dynamic Bayesian network of interactions between genes. We use a variational inference scheme to learn approximate posterior distributions for the model parameters. RESULTS: The methodology is benchmarked on synthetic data designed to replicate the distribution of real world RNA-Seq data. We compare our method to other sparse regression approaches and find improved performance in learning directed networks. We demonstrate an application of our method to a publicly available human neuronal stem cell differentiation RNA-Seq time series data set to infer the underlying network structure. CONCLUSIONS: Our method is able to improve performance on synthetic data by explicitly modelling the statistical distribution of the data when learning networks from RNA-Seq time series. Applying approximate inference techniques we can learn network structures quickly with only moderate computing resources. BioMed Central 2018-04-11 /pmc/articles/PMC5896118/ /pubmed/29642837 http://dx.doi.org/10.1186/s12859-018-2125-2 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Thorne, Thomas
Approximate inference of gene regulatory network models from RNA-Seq time series data
title Approximate inference of gene regulatory network models from RNA-Seq time series data
title_full Approximate inference of gene regulatory network models from RNA-Seq time series data
title_fullStr Approximate inference of gene regulatory network models from RNA-Seq time series data
title_full_unstemmed Approximate inference of gene regulatory network models from RNA-Seq time series data
title_short Approximate inference of gene regulatory network models from RNA-Seq time series data
title_sort approximate inference of gene regulatory network models from rna-seq time series data
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5896118/
https://www.ncbi.nlm.nih.gov/pubmed/29642837
http://dx.doi.org/10.1186/s12859-018-2125-2
work_keys_str_mv AT thornethomas approximateinferenceofgeneregulatorynetworkmodelsfromrnaseqtimeseriesdata