Cargando…

Differential expression analysis for paired RNA-seq data

BACKGROUND: RNA-Seq technology measures the transcript abundance by generating sequence reads and counting their frequencies across different biological conditions. To identify differentially expressed genes between two conditions, it is important to consider the experimental design as well as the d...

Descripción completa

Detalles Bibliográficos
Autores principales: Chung, Lisa M, Ferguson, John P, Zheng, Wei, Qian, Feng, Bruno, Vincent, Montgomery, Ruth R, Zhao, Hongyu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3663822/
https://www.ncbi.nlm.nih.gov/pubmed/23530607
http://dx.doi.org/10.1186/1471-2105-14-110
_version_ 1782271050438934528
author Chung, Lisa M
Ferguson, John P
Zheng, Wei
Qian, Feng
Bruno, Vincent
Montgomery, Ruth R
Zhao, Hongyu
author_facet Chung, Lisa M
Ferguson, John P
Zheng, Wei
Qian, Feng
Bruno, Vincent
Montgomery, Ruth R
Zhao, Hongyu
author_sort Chung, Lisa M
collection PubMed
description BACKGROUND: RNA-Seq technology measures the transcript abundance by generating sequence reads and counting their frequencies across different biological conditions. To identify differentially expressed genes between two conditions, it is important to consider the experimental design as well as the distributional property of the data. In many RNA-Seq studies, the expression data are obtained as multiple pairs, e.g., pre- vs. post-treatment samples from the same individual. We seek to incorporate paired structure into analysis. RESULTS: We present a Bayesian hierarchical mixture model for RNA-Seq data to separately account for the variability within and between individuals from a paired data structure. The method assumes a Poisson distribution for the data mixed with a gamma distribution to account variability between pairs. The effect of differential expression is modeled by two-component mixture model. The performance of this approach is examined by simulated and real data. CONCLUSIONS: In this setting, our proposed model provides higher sensitivity than existing methods to detect differential expression. Application to real RNA-Seq data demonstrates the usefulness of this method for detecting expression alteration for genes with low average expression levels or shorter transcript length.
format Online
Article
Text
id pubmed-3663822
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-36638222013-05-31 Differential expression analysis for paired RNA-seq data Chung, Lisa M Ferguson, John P Zheng, Wei Qian, Feng Bruno, Vincent Montgomery, Ruth R Zhao, Hongyu BMC Bioinformatics Methodology Article BACKGROUND: RNA-Seq technology measures the transcript abundance by generating sequence reads and counting their frequencies across different biological conditions. To identify differentially expressed genes between two conditions, it is important to consider the experimental design as well as the distributional property of the data. In many RNA-Seq studies, the expression data are obtained as multiple pairs, e.g., pre- vs. post-treatment samples from the same individual. We seek to incorporate paired structure into analysis. RESULTS: We present a Bayesian hierarchical mixture model for RNA-Seq data to separately account for the variability within and between individuals from a paired data structure. The method assumes a Poisson distribution for the data mixed with a gamma distribution to account variability between pairs. The effect of differential expression is modeled by two-component mixture model. The performance of this approach is examined by simulated and real data. CONCLUSIONS: In this setting, our proposed model provides higher sensitivity than existing methods to detect differential expression. Application to real RNA-Seq data demonstrates the usefulness of this method for detecting expression alteration for genes with low average expression levels or shorter transcript length. BioMed Central 2013-03-27 /pmc/articles/PMC3663822/ /pubmed/23530607 http://dx.doi.org/10.1186/1471-2105-14-110 Text en Copyright © 2013 Chung et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Chung, Lisa M
Ferguson, John P
Zheng, Wei
Qian, Feng
Bruno, Vincent
Montgomery, Ruth R
Zhao, Hongyu
Differential expression analysis for paired RNA-seq data
title Differential expression analysis for paired RNA-seq data
title_full Differential expression analysis for paired RNA-seq data
title_fullStr Differential expression analysis for paired RNA-seq data
title_full_unstemmed Differential expression analysis for paired RNA-seq data
title_short Differential expression analysis for paired RNA-seq data
title_sort differential expression analysis for paired rna-seq data
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3663822/
https://www.ncbi.nlm.nih.gov/pubmed/23530607
http://dx.doi.org/10.1186/1471-2105-14-110
work_keys_str_mv AT chunglisam differentialexpressionanalysisforpairedrnaseqdata
AT fergusonjohnp differentialexpressionanalysisforpairedrnaseqdata
AT zhengwei differentialexpressionanalysisforpairedrnaseqdata
AT qianfeng differentialexpressionanalysisforpairedrnaseqdata
AT brunovincent differentialexpressionanalysisforpairedrnaseqdata
AT montgomeryruthr differentialexpressionanalysisforpairedrnaseqdata
AT zhaohongyu differentialexpressionanalysisforpairedrnaseqdata