Cargando…
GSAR: Bioconductor package for Gene Set analysis in R
BACKGROUND: Gene set analysis (in a form of functionally related genes or pathways) has become the method of choice for analyzing omics data in general and gene expression data in particular. There are many statistical methods that either summarize gene-level statistics for a gene set or apply a mul...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5259853/ https://www.ncbi.nlm.nih.gov/pubmed/28118818 http://dx.doi.org/10.1186/s12859-017-1482-6 |
_version_ | 1782499287459954688 |
---|---|
author | Rahmatallah, Yasir Zybailov, Boris Emmert-Streib, Frank Glazko, Galina |
author_facet | Rahmatallah, Yasir Zybailov, Boris Emmert-Streib, Frank Glazko, Galina |
author_sort | Rahmatallah, Yasir |
collection | PubMed |
description | BACKGROUND: Gene set analysis (in a form of functionally related genes or pathways) has become the method of choice for analyzing omics data in general and gene expression data in particular. There are many statistical methods that either summarize gene-level statistics for a gene set or apply a multivariate statistic that accounts for intergene correlations. Most available methods detect complex departures from the null hypothesis but lack the ability to identify the specific alternative hypothesis that rejects the null. RESULTS: GSAR (Gene Set Analysis in R) is an open-source R/Bioconductor software package for gene set analysis (GSA). It implements self-contained multivariate non-parametric statistical methods testing a complex null hypothesis against specific alternatives, such as differences in mean (shift), variance (scale), or net correlation structure. The package also provides a graphical visualization tool, based on the union of two minimum spanning trees, for correlation networks to examine the change in the correlation structures of a gene set between two conditions and highlight influential genes (hubs). CONCLUSIONS: Package GSAR provides a set of multivariate non-parametric statistical methods that test a complex null hypothesis against specific alternatives. The methods in package GSAR are applicable to any type of omics data that can be represented in a matrix format. The package, with detailed instructions and examples, is freely available under the GPL (> = 2) license from the Bioconductor web site. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1482-6) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5259853 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-52598532017-01-26 GSAR: Bioconductor package for Gene Set analysis in R Rahmatallah, Yasir Zybailov, Boris Emmert-Streib, Frank Glazko, Galina BMC Bioinformatics Software BACKGROUND: Gene set analysis (in a form of functionally related genes or pathways) has become the method of choice for analyzing omics data in general and gene expression data in particular. There are many statistical methods that either summarize gene-level statistics for a gene set or apply a multivariate statistic that accounts for intergene correlations. Most available methods detect complex departures from the null hypothesis but lack the ability to identify the specific alternative hypothesis that rejects the null. RESULTS: GSAR (Gene Set Analysis in R) is an open-source R/Bioconductor software package for gene set analysis (GSA). It implements self-contained multivariate non-parametric statistical methods testing a complex null hypothesis against specific alternatives, such as differences in mean (shift), variance (scale), or net correlation structure. The package also provides a graphical visualization tool, based on the union of two minimum spanning trees, for correlation networks to examine the change in the correlation structures of a gene set between two conditions and highlight influential genes (hubs). CONCLUSIONS: Package GSAR provides a set of multivariate non-parametric statistical methods that test a complex null hypothesis against specific alternatives. The methods in package GSAR are applicable to any type of omics data that can be represented in a matrix format. The package, with detailed instructions and examples, is freely available under the GPL (> = 2) license from the Bioconductor web site. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1482-6) contains supplementary material, which is available to authorized users. BioMed Central 2017-01-24 /pmc/articles/PMC5259853/ /pubmed/28118818 http://dx.doi.org/10.1186/s12859-017-1482-6 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Software Rahmatallah, Yasir Zybailov, Boris Emmert-Streib, Frank Glazko, Galina GSAR: Bioconductor package for Gene Set analysis in R |
title | GSAR: Bioconductor package for Gene Set analysis in R |
title_full | GSAR: Bioconductor package for Gene Set analysis in R |
title_fullStr | GSAR: Bioconductor package for Gene Set analysis in R |
title_full_unstemmed | GSAR: Bioconductor package for Gene Set analysis in R |
title_short | GSAR: Bioconductor package for Gene Set analysis in R |
title_sort | gsar: bioconductor package for gene set analysis in r |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5259853/ https://www.ncbi.nlm.nih.gov/pubmed/28118818 http://dx.doi.org/10.1186/s12859-017-1482-6 |
work_keys_str_mv | AT rahmatallahyasir gsarbioconductorpackageforgenesetanalysisinr AT zybailovboris gsarbioconductorpackageforgenesetanalysisinr AT emmertstreibfrank gsarbioconductorpackageforgenesetanalysisinr AT glazkogalina gsarbioconductorpackageforgenesetanalysisinr |