Cargando…

Gene set enrichment analysis of RNA-Seq data: integrating differential expression and splicing

BACKGROUND: RNA-Seq has become a key technology in transcriptome studies because it can quantify overall expression levels and the degree of alternative splicing for each gene simultaneously. To interpret high-throughout transcriptome profiling data, functional enrichment analysis is critical. Howev...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Xi, Cairns, Murray J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3622641/
https://www.ncbi.nlm.nih.gov/pubmed/23734663
http://dx.doi.org/10.1186/1471-2105-14-S5-S16
_version_ 1782265859631218688
author Wang, Xi
Cairns, Murray J
author_facet Wang, Xi
Cairns, Murray J
author_sort Wang, Xi
collection PubMed
description BACKGROUND: RNA-Seq has become a key technology in transcriptome studies because it can quantify overall expression levels and the degree of alternative splicing for each gene simultaneously. To interpret high-throughout transcriptome profiling data, functional enrichment analysis is critical. However, existing functional analysis methods can only account for differential expression, leaving differential splicing out altogether. RESULTS: In this work, we present a novel approach to derive biological insight by integrating differential expression and splicing from RNA-Seq data with functional gene set analysis. This approach designated SeqGSEA, uses count data modelling with negative binomial distributions to first score differential expression and splicing in each gene, respectively, followed by two strategies to combine the two scores for integrated gene set enrichment analysis. Method comparison results and biological insight analysis on an artificial data set and three real RNA-Seq data sets indicate that our approach outperforms alternative analysis pipelines and can detect biological meaningful gene sets with high confidence, and that it has the ability to determine if transcription or splicing is their predominant regulatory mechanism. CONCLUSIONS: By integrating differential expression and splicing, the proposed method SeqGSEA is particularly useful for efficiently translating RNA-Seq data to biological discoveries.
format Online
Article
Text
id pubmed-3622641
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-36226412013-04-15 Gene set enrichment analysis of RNA-Seq data: integrating differential expression and splicing Wang, Xi Cairns, Murray J BMC Bioinformatics Proceedings BACKGROUND: RNA-Seq has become a key technology in transcriptome studies because it can quantify overall expression levels and the degree of alternative splicing for each gene simultaneously. To interpret high-throughout transcriptome profiling data, functional enrichment analysis is critical. However, existing functional analysis methods can only account for differential expression, leaving differential splicing out altogether. RESULTS: In this work, we present a novel approach to derive biological insight by integrating differential expression and splicing from RNA-Seq data with functional gene set analysis. This approach designated SeqGSEA, uses count data modelling with negative binomial distributions to first score differential expression and splicing in each gene, respectively, followed by two strategies to combine the two scores for integrated gene set enrichment analysis. Method comparison results and biological insight analysis on an artificial data set and three real RNA-Seq data sets indicate that our approach outperforms alternative analysis pipelines and can detect biological meaningful gene sets with high confidence, and that it has the ability to determine if transcription or splicing is their predominant regulatory mechanism. CONCLUSIONS: By integrating differential expression and splicing, the proposed method SeqGSEA is particularly useful for efficiently translating RNA-Seq data to biological discoveries. BioMed Central 2013-04-10 /pmc/articles/PMC3622641/ /pubmed/23734663 http://dx.doi.org/10.1186/1471-2105-14-S5-S16 Text en Copyright © 2013 Wang and Cairns.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Wang, Xi
Cairns, Murray J
Gene set enrichment analysis of RNA-Seq data: integrating differential expression and splicing
title Gene set enrichment analysis of RNA-Seq data: integrating differential expression and splicing
title_full Gene set enrichment analysis of RNA-Seq data: integrating differential expression and splicing
title_fullStr Gene set enrichment analysis of RNA-Seq data: integrating differential expression and splicing
title_full_unstemmed Gene set enrichment analysis of RNA-Seq data: integrating differential expression and splicing
title_short Gene set enrichment analysis of RNA-Seq data: integrating differential expression and splicing
title_sort gene set enrichment analysis of rna-seq data: integrating differential expression and splicing
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3622641/
https://www.ncbi.nlm.nih.gov/pubmed/23734663
http://dx.doi.org/10.1186/1471-2105-14-S5-S16
work_keys_str_mv AT wangxi genesetenrichmentanalysisofrnaseqdataintegratingdifferentialexpressionandsplicing
AT cairnsmurrayj genesetenrichmentanalysisofrnaseqdataintegratingdifferentialexpressionandsplicing