Cargando…
Improving Gene-Set Enrichment Analysis of RNA-Seq Data with Small Replicates
Deregulated pathways identified from transcriptome data of two sample groups have played a key role in many genomic studies. Gene-set enrichment analysis (GSEA) has been commonly used for pathway or functional analysis of microarray data, and it is also being applied to RNA-seq data. However, most R...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5102490/ https://www.ncbi.nlm.nih.gov/pubmed/27829002 http://dx.doi.org/10.1371/journal.pone.0165919 |
_version_ | 1782466440756985856 |
---|---|
author | Yoon, Sora Kim, Seon-Young Nam, Dougu |
author_facet | Yoon, Sora Kim, Seon-Young Nam, Dougu |
author_sort | Yoon, Sora |
collection | PubMed |
description | Deregulated pathways identified from transcriptome data of two sample groups have played a key role in many genomic studies. Gene-set enrichment analysis (GSEA) has been commonly used for pathway or functional analysis of microarray data, and it is also being applied to RNA-seq data. However, most RNA-seq data so far have only small replicates. This enforces to apply the gene-permuting GSEA method (or preranked GSEA) which results in a great number of false positives due to the inter-gene correlation in each gene-set. We demonstrate that incorporating the absolute gene statistic in one-tailed GSEA considerably improves the false-positive control and the overall discriminatory ability of the gene-permuting GSEA methods for RNA-seq data. To test the performance, a simulation method to generate correlated read counts within a gene-set was newly developed, and a dozen of currently available RNA-seq enrichment analysis methods were compared, where the proposed methods outperformed others that do not account for the inter-gene correlation. Analysis of real RNA-seq data also supported the proposed methods in terms of false positive control, ranks of true positives and biological relevance. An efficient R package (AbsFilterGSEA) coded with C++ (Rcpp) is available from CRAN. |
format | Online Article Text |
id | pubmed-5102490 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-51024902016-11-18 Improving Gene-Set Enrichment Analysis of RNA-Seq Data with Small Replicates Yoon, Sora Kim, Seon-Young Nam, Dougu PLoS One Research Article Deregulated pathways identified from transcriptome data of two sample groups have played a key role in many genomic studies. Gene-set enrichment analysis (GSEA) has been commonly used for pathway or functional analysis of microarray data, and it is also being applied to RNA-seq data. However, most RNA-seq data so far have only small replicates. This enforces to apply the gene-permuting GSEA method (or preranked GSEA) which results in a great number of false positives due to the inter-gene correlation in each gene-set. We demonstrate that incorporating the absolute gene statistic in one-tailed GSEA considerably improves the false-positive control and the overall discriminatory ability of the gene-permuting GSEA methods for RNA-seq data. To test the performance, a simulation method to generate correlated read counts within a gene-set was newly developed, and a dozen of currently available RNA-seq enrichment analysis methods were compared, where the proposed methods outperformed others that do not account for the inter-gene correlation. Analysis of real RNA-seq data also supported the proposed methods in terms of false positive control, ranks of true positives and biological relevance. An efficient R package (AbsFilterGSEA) coded with C++ (Rcpp) is available from CRAN. Public Library of Science 2016-11-09 /pmc/articles/PMC5102490/ /pubmed/27829002 http://dx.doi.org/10.1371/journal.pone.0165919 Text en © 2016 Yoon et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Yoon, Sora Kim, Seon-Young Nam, Dougu Improving Gene-Set Enrichment Analysis of RNA-Seq Data with Small Replicates |
title | Improving Gene-Set Enrichment Analysis of RNA-Seq Data with Small Replicates |
title_full | Improving Gene-Set Enrichment Analysis of RNA-Seq Data with Small Replicates |
title_fullStr | Improving Gene-Set Enrichment Analysis of RNA-Seq Data with Small Replicates |
title_full_unstemmed | Improving Gene-Set Enrichment Analysis of RNA-Seq Data with Small Replicates |
title_short | Improving Gene-Set Enrichment Analysis of RNA-Seq Data with Small Replicates |
title_sort | improving gene-set enrichment analysis of rna-seq data with small replicates |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5102490/ https://www.ncbi.nlm.nih.gov/pubmed/27829002 http://dx.doi.org/10.1371/journal.pone.0165919 |
work_keys_str_mv | AT yoonsora improvinggenesetenrichmentanalysisofrnaseqdatawithsmallreplicates AT kimseonyoung improvinggenesetenrichmentanalysisofrnaseqdatawithsmallreplicates AT namdougu improvinggenesetenrichmentanalysisofrnaseqdatawithsmallreplicates |