Cargando…

GSVA: gene set variation analysis for microarray and RNA-Seq data

BACKGROUND: Gene set enrichment (GSE) analysis is a popular framework for condensing information from gene expression profiles into a pathway or signature summary. The strengths of this approach over single gene analysis include noise and dimension reduction, as well as greater biological interpreta...

Descripción completa

Detalles Bibliográficos
Autores principales: Hänzelmann, Sonja, Castelo, Robert, Guinney, Justin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3618321/
https://www.ncbi.nlm.nih.gov/pubmed/23323831
http://dx.doi.org/10.1186/1471-2105-14-7
_version_ 1782265400250073088
author Hänzelmann, Sonja
Castelo, Robert
Guinney, Justin
author_facet Hänzelmann, Sonja
Castelo, Robert
Guinney, Justin
author_sort Hänzelmann, Sonja
collection PubMed
description BACKGROUND: Gene set enrichment (GSE) analysis is a popular framework for condensing information from gene expression profiles into a pathway or signature summary. The strengths of this approach over single gene analysis include noise and dimension reduction, as well as greater biological interpretability. As molecular profiling experiments move beyond simple case-control studies, robust and flexible GSE methodologies are needed that can model pathway activity within highly heterogeneous data sets. RESULTS: To address this challenge, we introduce Gene Set Variation Analysis (GSVA), a GSE method that estimates variation of pathway activity over a sample population in an unsupervised manner. We demonstrate the robustness of GSVA in a comparison with current state of the art sample-wise enrichment methods. Further, we provide examples of its utility in differential pathway activity and survival analysis. Lastly, we show how GSVA works analogously with data from both microarray and RNA-seq experiments. CONCLUSIONS: GSVA provides increased power to detect subtle pathway activity changes over a sample population in comparison to corresponding methods. While GSE methods are generally regarded as end points of a bioinformatic analysis, GSVA constitutes a starting point to build pathway-centric models of biology. Moreover, GSVA contributes to the current need of GSE methods for RNA-seq data. GSVA is an open source software package for R which forms part of the Bioconductor project and can be downloaded at http://www.bioconductor.org.
format Online
Article
Text
id pubmed-3618321
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-36183212013-04-09 GSVA: gene set variation analysis for microarray and RNA-Seq data Hänzelmann, Sonja Castelo, Robert Guinney, Justin BMC Bioinformatics Software BACKGROUND: Gene set enrichment (GSE) analysis is a popular framework for condensing information from gene expression profiles into a pathway or signature summary. The strengths of this approach over single gene analysis include noise and dimension reduction, as well as greater biological interpretability. As molecular profiling experiments move beyond simple case-control studies, robust and flexible GSE methodologies are needed that can model pathway activity within highly heterogeneous data sets. RESULTS: To address this challenge, we introduce Gene Set Variation Analysis (GSVA), a GSE method that estimates variation of pathway activity over a sample population in an unsupervised manner. We demonstrate the robustness of GSVA in a comparison with current state of the art sample-wise enrichment methods. Further, we provide examples of its utility in differential pathway activity and survival analysis. Lastly, we show how GSVA works analogously with data from both microarray and RNA-seq experiments. CONCLUSIONS: GSVA provides increased power to detect subtle pathway activity changes over a sample population in comparison to corresponding methods. While GSE methods are generally regarded as end points of a bioinformatic analysis, GSVA constitutes a starting point to build pathway-centric models of biology. Moreover, GSVA contributes to the current need of GSE methods for RNA-seq data. GSVA is an open source software package for R which forms part of the Bioconductor project and can be downloaded at http://www.bioconductor.org. BioMed Central 2013-01-16 /pmc/articles/PMC3618321/ /pubmed/23323831 http://dx.doi.org/10.1186/1471-2105-14-7 Text en Copyright © 2013 Hänzelmann et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Hänzelmann, Sonja
Castelo, Robert
Guinney, Justin
GSVA: gene set variation analysis for microarray and RNA-Seq data
title GSVA: gene set variation analysis for microarray and RNA-Seq data
title_full GSVA: gene set variation analysis for microarray and RNA-Seq data
title_fullStr GSVA: gene set variation analysis for microarray and RNA-Seq data
title_full_unstemmed GSVA: gene set variation analysis for microarray and RNA-Seq data
title_short GSVA: gene set variation analysis for microarray and RNA-Seq data
title_sort gsva: gene set variation analysis for microarray and rna-seq data
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3618321/
https://www.ncbi.nlm.nih.gov/pubmed/23323831
http://dx.doi.org/10.1186/1471-2105-14-7
work_keys_str_mv AT hanzelmannsonja gsvagenesetvariationanalysisformicroarrayandrnaseqdata
AT castelorobert gsvagenesetvariationanalysisformicroarrayandrnaseqdata
AT guinneyjustin gsvagenesetvariationanalysisformicroarrayandrnaseqdata