Cargando…
Combining multiple tools outperforms individual methods in gene set enrichment analyses
MOTIVATION: Gene set enrichment (GSE) analysis allows researchers to efficiently extract biological insight from long lists of differentially expressed genes by interrogating them at a systems level. In recent years, there has been a proliferation of GSE analysis methods and hence it has become incr...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5408797/ https://www.ncbi.nlm.nih.gov/pubmed/27694195 http://dx.doi.org/10.1093/bioinformatics/btw623 |
_version_ | 1783232366601306112 |
---|---|
author | Alhamdoosh, Monther Ng, Milica Wilson, Nicholas J Sheridan, Julie M Huynh, Huy Wilson, Michael J Ritchie, Matthew E |
author_facet | Alhamdoosh, Monther Ng, Milica Wilson, Nicholas J Sheridan, Julie M Huynh, Huy Wilson, Michael J Ritchie, Matthew E |
author_sort | Alhamdoosh, Monther |
collection | PubMed |
description | MOTIVATION: Gene set enrichment (GSE) analysis allows researchers to efficiently extract biological insight from long lists of differentially expressed genes by interrogating them at a systems level. In recent years, there has been a proliferation of GSE analysis methods and hence it has become increasingly difficult for researchers to select an optimal GSE tool based on their particular dataset. Moreover, the majority of GSE analysis methods do not allow researchers to simultaneously compare gene set level results between multiple experimental conditions. RESULTS: The ensemble of genes set enrichment analyses (EGSEA) is a method developed for RNA-sequencing data that combines results from twelve algorithms and calculates collective gene set scores to improve the biological relevance of the highest ranked gene sets. EGSEA’s gene set database contains around 25 000 gene sets from sixteen collections. It has multiple visualization capabilities that allow researchers to view gene sets at various levels of granularity. EGSEA has been tested on simulated data and on a number of human and mouse datasets and, based on biologists’ feedback, consistently outperforms the individual tools that have been combined. Our evaluation demonstrates the superiority of the ensemble approach for GSE analysis, and its utility to effectively and efficiently extrapolate biological functions and potential involvement in disease processes from lists of differentially regulated genes. AVAILABILITY AND IMPLEMENTATION: EGSEA is available as an R package at http://www.bioconductor.org/packages/EGSEA/. The gene sets collections are available in the R package EGSEAdata from http://www.bioconductor.org/packages/EGSEAdata/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-5408797 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-54087972017-05-03 Combining multiple tools outperforms individual methods in gene set enrichment analyses Alhamdoosh, Monther Ng, Milica Wilson, Nicholas J Sheridan, Julie M Huynh, Huy Wilson, Michael J Ritchie, Matthew E Bioinformatics Original Papers MOTIVATION: Gene set enrichment (GSE) analysis allows researchers to efficiently extract biological insight from long lists of differentially expressed genes by interrogating them at a systems level. In recent years, there has been a proliferation of GSE analysis methods and hence it has become increasingly difficult for researchers to select an optimal GSE tool based on their particular dataset. Moreover, the majority of GSE analysis methods do not allow researchers to simultaneously compare gene set level results between multiple experimental conditions. RESULTS: The ensemble of genes set enrichment analyses (EGSEA) is a method developed for RNA-sequencing data that combines results from twelve algorithms and calculates collective gene set scores to improve the biological relevance of the highest ranked gene sets. EGSEA’s gene set database contains around 25 000 gene sets from sixteen collections. It has multiple visualization capabilities that allow researchers to view gene sets at various levels of granularity. EGSEA has been tested on simulated data and on a number of human and mouse datasets and, based on biologists’ feedback, consistently outperforms the individual tools that have been combined. Our evaluation demonstrates the superiority of the ensemble approach for GSE analysis, and its utility to effectively and efficiently extrapolate biological functions and potential involvement in disease processes from lists of differentially regulated genes. AVAILABILITY AND IMPLEMENTATION: EGSEA is available as an R package at http://www.bioconductor.org/packages/EGSEA/. The gene sets collections are available in the R package EGSEAdata from http://www.bioconductor.org/packages/EGSEAdata/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2017-02-01 2016-09-30 /pmc/articles/PMC5408797/ /pubmed/27694195 http://dx.doi.org/10.1093/bioinformatics/btw623 Text en © The Author 2016. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Original Papers Alhamdoosh, Monther Ng, Milica Wilson, Nicholas J Sheridan, Julie M Huynh, Huy Wilson, Michael J Ritchie, Matthew E Combining multiple tools outperforms individual methods in gene set enrichment analyses |
title | Combining multiple tools outperforms individual methods in gene set enrichment analyses |
title_full | Combining multiple tools outperforms individual methods in gene set enrichment analyses |
title_fullStr | Combining multiple tools outperforms individual methods in gene set enrichment analyses |
title_full_unstemmed | Combining multiple tools outperforms individual methods in gene set enrichment analyses |
title_short | Combining multiple tools outperforms individual methods in gene set enrichment analyses |
title_sort | combining multiple tools outperforms individual methods in gene set enrichment analyses |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5408797/ https://www.ncbi.nlm.nih.gov/pubmed/27694195 http://dx.doi.org/10.1093/bioinformatics/btw623 |
work_keys_str_mv | AT alhamdooshmonther combiningmultipletoolsoutperformsindividualmethodsingenesetenrichmentanalyses AT ngmilica combiningmultipletoolsoutperformsindividualmethodsingenesetenrichmentanalyses AT wilsonnicholasj combiningmultipletoolsoutperformsindividualmethodsingenesetenrichmentanalyses AT sheridanjuliem combiningmultipletoolsoutperformsindividualmethodsingenesetenrichmentanalyses AT huynhhuy combiningmultipletoolsoutperformsindividualmethodsingenesetenrichmentanalyses AT wilsonmichaelj combiningmultipletoolsoutperformsindividualmethodsingenesetenrichmentanalyses AT ritchiematthewe combiningmultipletoolsoutperformsindividualmethodsingenesetenrichmentanalyses |