Cargando…
Easy and efficient ensemble gene set testing with EGSEA
Gene set enrichment analysis is a popular approach for prioritising the biological processes perturbed in genomic datasets. The Bioconductor project hosts over 80 software packages capable of gene set analysis. Most of these packages search for enriched signatures amongst differentially regulated ge...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
F1000 Research Limited
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5747338/ https://www.ncbi.nlm.nih.gov/pubmed/29333246 http://dx.doi.org/10.12688/f1000research.12544.1 |
_version_ | 1783289258774102016 |
---|---|
author | Alhamdoosh, Monther Law, Charity W. Tian, Luyi Sheridan, Julie M. Ng, Milica Ritchie, Matthew E. |
author_facet | Alhamdoosh, Monther Law, Charity W. Tian, Luyi Sheridan, Julie M. Ng, Milica Ritchie, Matthew E. |
author_sort | Alhamdoosh, Monther |
collection | PubMed |
description | Gene set enrichment analysis is a popular approach for prioritising the biological processes perturbed in genomic datasets. The Bioconductor project hosts over 80 software packages capable of gene set analysis. Most of these packages search for enriched signatures amongst differentially regulated genes to reveal higher level biological themes that may be missed when focusing only on evidence from individual genes. With so many different methods on offer, choosing the best algorithm and visualization approach can be challenging. The EGSEA package solves this problem by combining results from up to 12 prominent gene set testing algorithms to obtain a consensus ranking of biologically relevant results.This workflow demonstrates how EGSEA can extend limma-based differential expression analyses for RNA-seq and microarray data using experiments that profile 3 distinct cell populations important for studying the origins of breast cancer. Following data normalization and set-up of an appropriate linear model for differential expression analysis, EGSEA builds gene signature specific indexes that link a wide range of mouse or human gene set collections obtained from MSigDB, GeneSetDB and KEGG to the gene expression data being investigated. EGSEA is then configured and the ensemble enrichment analysis run, returning an object that can be queried using several S4 methods for ranking gene sets and visualizing results via heatmaps, KEGG pathway views, GO graphs, scatter plots and bar plots. Finally, an HTML report that combines these displays can fast-track the sharing of results with collaborators, and thus expedite downstream biological validation. EGSEA is simple to use and can be easily integrated with existing gene expression analysis pipelines for both human and mouse data. |
format | Online Article Text |
id | pubmed-5747338 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | F1000 Research Limited |
record_format | MEDLINE/PubMed |
spelling | pubmed-57473382018-01-11 Easy and efficient ensemble gene set testing with EGSEA Alhamdoosh, Monther Law, Charity W. Tian, Luyi Sheridan, Julie M. Ng, Milica Ritchie, Matthew E. F1000Res Software Tool Article Gene set enrichment analysis is a popular approach for prioritising the biological processes perturbed in genomic datasets. The Bioconductor project hosts over 80 software packages capable of gene set analysis. Most of these packages search for enriched signatures amongst differentially regulated genes to reveal higher level biological themes that may be missed when focusing only on evidence from individual genes. With so many different methods on offer, choosing the best algorithm and visualization approach can be challenging. The EGSEA package solves this problem by combining results from up to 12 prominent gene set testing algorithms to obtain a consensus ranking of biologically relevant results.This workflow demonstrates how EGSEA can extend limma-based differential expression analyses for RNA-seq and microarray data using experiments that profile 3 distinct cell populations important for studying the origins of breast cancer. Following data normalization and set-up of an appropriate linear model for differential expression analysis, EGSEA builds gene signature specific indexes that link a wide range of mouse or human gene set collections obtained from MSigDB, GeneSetDB and KEGG to the gene expression data being investigated. EGSEA is then configured and the ensemble enrichment analysis run, returning an object that can be queried using several S4 methods for ranking gene sets and visualizing results via heatmaps, KEGG pathway views, GO graphs, scatter plots and bar plots. Finally, an HTML report that combines these displays can fast-track the sharing of results with collaborators, and thus expedite downstream biological validation. EGSEA is simple to use and can be easily integrated with existing gene expression analysis pipelines for both human and mouse data. F1000 Research Limited 2017-11-14 /pmc/articles/PMC5747338/ /pubmed/29333246 http://dx.doi.org/10.12688/f1000research.12544.1 Text en Copyright: © 2017 Alhamdoosh M et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Software Tool Article Alhamdoosh, Monther Law, Charity W. Tian, Luyi Sheridan, Julie M. Ng, Milica Ritchie, Matthew E. Easy and efficient ensemble gene set testing with EGSEA |
title | Easy and efficient ensemble gene set testing with EGSEA |
title_full | Easy and efficient ensemble gene set testing with EGSEA |
title_fullStr | Easy and efficient ensemble gene set testing with EGSEA |
title_full_unstemmed | Easy and efficient ensemble gene set testing with EGSEA |
title_short | Easy and efficient ensemble gene set testing with EGSEA |
title_sort | easy and efficient ensemble gene set testing with egsea |
topic | Software Tool Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5747338/ https://www.ncbi.nlm.nih.gov/pubmed/29333246 http://dx.doi.org/10.12688/f1000research.12544.1 |
work_keys_str_mv | AT alhamdooshmonther easyandefficientensemblegenesettestingwithegsea AT lawcharityw easyandefficientensemblegenesettestingwithegsea AT tianluyi easyandefficientensemblegenesettestingwithegsea AT sheridanjuliem easyandefficientensemblegenesettestingwithegsea AT ngmilica easyandefficientensemblegenesettestingwithegsea AT ritchiematthewe easyandefficientensemblegenesettestingwithegsea |