Cargando…
LFCseq: a nonparametric approach for differential expression analysis of RNA-seq data
BACKGROUND: With the advances in high-throughput DNA sequencing technologies, RNA-seq has rapidly emerged as a powerful tool for the quantitative analysis of gene expression and transcript variant discovery. In comparative experiments, differential expression analysis is commonly performed on RNA-se...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4304217/ https://www.ncbi.nlm.nih.gov/pubmed/25560842 http://dx.doi.org/10.1186/1471-2164-15-S10-S7 |
_version_ | 1782354057916055552 |
---|---|
author | Lin, Bingqing Zhang, Li-Feng Chen, Xin |
author_facet | Lin, Bingqing Zhang, Li-Feng Chen, Xin |
author_sort | Lin, Bingqing |
collection | PubMed |
description | BACKGROUND: With the advances in high-throughput DNA sequencing technologies, RNA-seq has rapidly emerged as a powerful tool for the quantitative analysis of gene expression and transcript variant discovery. In comparative experiments, differential expression analysis is commonly performed on RNA-seq data to identify genes/features that are differentially expressed between biological conditions. Most existing statistical methods for differential expression analysis are parametric and assume either Poisson distribution or negative binomial distribution on gene read counts. However, violation of distributional assumptions or a poor estimation of parameters often leads to unreliable results. RESULTS: In this paper, we introduce a new nonparametric approach called LFCseq that uses log fold changes as a differential expression test statistic. To test each gene for differential expression, LFCseq estimates a null probability distribution of count changes from a selected set of genes with similar expression strength. In contrast, the nonparametric NOISeq approach relies on a null distribution estimated from all genes within an experimental condition regardless of their expression levels. CONCLUSION: Through extensive simulation study and RNA-seq real data analysis, we demonstrate that the proposed approach could well rank the differentially expressed genes ahead of non-differentially expressed genes, thereby achieving a much improved overall performance for differential expression analysis. |
format | Online Article Text |
id | pubmed-4304217 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-43042172015-02-09 LFCseq: a nonparametric approach for differential expression analysis of RNA-seq data Lin, Bingqing Zhang, Li-Feng Chen, Xin BMC Genomics Research BACKGROUND: With the advances in high-throughput DNA sequencing technologies, RNA-seq has rapidly emerged as a powerful tool for the quantitative analysis of gene expression and transcript variant discovery. In comparative experiments, differential expression analysis is commonly performed on RNA-seq data to identify genes/features that are differentially expressed between biological conditions. Most existing statistical methods for differential expression analysis are parametric and assume either Poisson distribution or negative binomial distribution on gene read counts. However, violation of distributional assumptions or a poor estimation of parameters often leads to unreliable results. RESULTS: In this paper, we introduce a new nonparametric approach called LFCseq that uses log fold changes as a differential expression test statistic. To test each gene for differential expression, LFCseq estimates a null probability distribution of count changes from a selected set of genes with similar expression strength. In contrast, the nonparametric NOISeq approach relies on a null distribution estimated from all genes within an experimental condition regardless of their expression levels. CONCLUSION: Through extensive simulation study and RNA-seq real data analysis, we demonstrate that the proposed approach could well rank the differentially expressed genes ahead of non-differentially expressed genes, thereby achieving a much improved overall performance for differential expression analysis. BioMed Central 2014-12-12 /pmc/articles/PMC4304217/ /pubmed/25560842 http://dx.doi.org/10.1186/1471-2164-15-S10-S7 Text en Copyright © 2014 Lin et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Lin, Bingqing Zhang, Li-Feng Chen, Xin LFCseq: a nonparametric approach for differential expression analysis of RNA-seq data |
title | LFCseq: a nonparametric approach for differential expression analysis of RNA-seq data |
title_full | LFCseq: a nonparametric approach for differential expression analysis of RNA-seq data |
title_fullStr | LFCseq: a nonparametric approach for differential expression analysis of RNA-seq data |
title_full_unstemmed | LFCseq: a nonparametric approach for differential expression analysis of RNA-seq data |
title_short | LFCseq: a nonparametric approach for differential expression analysis of RNA-seq data |
title_sort | lfcseq: a nonparametric approach for differential expression analysis of rna-seq data |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4304217/ https://www.ncbi.nlm.nih.gov/pubmed/25560842 http://dx.doi.org/10.1186/1471-2164-15-S10-S7 |
work_keys_str_mv | AT linbingqing lfcseqanonparametricapproachfordifferentialexpressionanalysisofrnaseqdata AT zhanglifeng lfcseqanonparametricapproachfordifferentialexpressionanalysisofrnaseqdata AT chenxin lfcseqanonparametricapproachfordifferentialexpressionanalysisofrnaseqdata |