Cargando…

Broad distribution spectrum from Gaussian to power law appears in stochastic variations in RNA-seq data

Gene expression levels exhibit stochastic variations among genetically identical organisms under the same environmental conditions. In many recent transcriptome analyses based on RNA sequencing (RNA-seq), variations in gene expression levels among replicates were assumed to follow a negative binomia...

Descripción completa

Detalles Bibliográficos
Autores principales: Awazu, Akinori, Tanabe, Takahiro, Kamitani, Mari, Tezuka, Ayumi, Nagano, Atsushi J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5974282/
https://www.ncbi.nlm.nih.gov/pubmed/29844539
http://dx.doi.org/10.1038/s41598-018-26735-4
_version_ 1783326786819457024
author Awazu, Akinori
Tanabe, Takahiro
Kamitani, Mari
Tezuka, Ayumi
Nagano, Atsushi J.
author_facet Awazu, Akinori
Tanabe, Takahiro
Kamitani, Mari
Tezuka, Ayumi
Nagano, Atsushi J.
author_sort Awazu, Akinori
collection PubMed
description Gene expression levels exhibit stochastic variations among genetically identical organisms under the same environmental conditions. In many recent transcriptome analyses based on RNA sequencing (RNA-seq), variations in gene expression levels among replicates were assumed to follow a negative binomial distribution, although the physiological basis of this assumption remains unclear. In this study, RNA-seq data were obtained from Arabidopsis thaliana under eight conditions (21–27 replicates), and the characteristics of gene-dependent empirical probability density function (ePDF) profiles of gene expression levels were analyzed. For A. thaliana and Saccharomyces cerevisiae, various types of ePDF of gene expression levels were obtained that were classified as Gaussian, power law-like containing a long tail, or intermediate. These ePDF profiles were well fitted with a Gauss-power mixing distribution function derived from a simple model of a stochastic transcriptional network containing a feedback loop. The fitting function suggested that gene expression levels with long-tailed ePDFs would be strongly influenced by feedback regulation. Furthermore, the features of gene expression levels are correlated with their functions, with the levels of essential genes tending to follow a Gaussian-like ePDF while those of genes encoding nucleic acid-binding proteins and transcription factors exhibit long-tailed ePDF.
format Online
Article
Text
id pubmed-5974282
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-59742822018-05-31 Broad distribution spectrum from Gaussian to power law appears in stochastic variations in RNA-seq data Awazu, Akinori Tanabe, Takahiro Kamitani, Mari Tezuka, Ayumi Nagano, Atsushi J. Sci Rep Article Gene expression levels exhibit stochastic variations among genetically identical organisms under the same environmental conditions. In many recent transcriptome analyses based on RNA sequencing (RNA-seq), variations in gene expression levels among replicates were assumed to follow a negative binomial distribution, although the physiological basis of this assumption remains unclear. In this study, RNA-seq data were obtained from Arabidopsis thaliana under eight conditions (21–27 replicates), and the characteristics of gene-dependent empirical probability density function (ePDF) profiles of gene expression levels were analyzed. For A. thaliana and Saccharomyces cerevisiae, various types of ePDF of gene expression levels were obtained that were classified as Gaussian, power law-like containing a long tail, or intermediate. These ePDF profiles were well fitted with a Gauss-power mixing distribution function derived from a simple model of a stochastic transcriptional network containing a feedback loop. The fitting function suggested that gene expression levels with long-tailed ePDFs would be strongly influenced by feedback regulation. Furthermore, the features of gene expression levels are correlated with their functions, with the levels of essential genes tending to follow a Gaussian-like ePDF while those of genes encoding nucleic acid-binding proteins and transcription factors exhibit long-tailed ePDF. Nature Publishing Group UK 2018-05-29 /pmc/articles/PMC5974282/ /pubmed/29844539 http://dx.doi.org/10.1038/s41598-018-26735-4 Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Awazu, Akinori
Tanabe, Takahiro
Kamitani, Mari
Tezuka, Ayumi
Nagano, Atsushi J.
Broad distribution spectrum from Gaussian to power law appears in stochastic variations in RNA-seq data
title Broad distribution spectrum from Gaussian to power law appears in stochastic variations in RNA-seq data
title_full Broad distribution spectrum from Gaussian to power law appears in stochastic variations in RNA-seq data
title_fullStr Broad distribution spectrum from Gaussian to power law appears in stochastic variations in RNA-seq data
title_full_unstemmed Broad distribution spectrum from Gaussian to power law appears in stochastic variations in RNA-seq data
title_short Broad distribution spectrum from Gaussian to power law appears in stochastic variations in RNA-seq data
title_sort broad distribution spectrum from gaussian to power law appears in stochastic variations in rna-seq data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5974282/
https://www.ncbi.nlm.nih.gov/pubmed/29844539
http://dx.doi.org/10.1038/s41598-018-26735-4
work_keys_str_mv AT awazuakinori broaddistributionspectrumfromgaussiantopowerlawappearsinstochasticvariationsinrnaseqdata
AT tanabetakahiro broaddistributionspectrumfromgaussiantopowerlawappearsinstochasticvariationsinrnaseqdata
AT kamitanimari broaddistributionspectrumfromgaussiantopowerlawappearsinstochasticvariationsinrnaseqdata
AT tezukaayumi broaddistributionspectrumfromgaussiantopowerlawappearsinstochasticvariationsinrnaseqdata
AT naganoatsushij broaddistributionspectrumfromgaussiantopowerlawappearsinstochasticvariationsinrnaseqdata