Cargando…

Easy and efficient ensemble gene set testing with EGSEA

Gene set enrichment analysis is a popular approach for prioritising the biological processes perturbed in genomic datasets. The Bioconductor project hosts over 80 software packages capable of gene set analysis. Most of these packages search for enriched signatures amongst differentially regulated ge...

Descripción completa

Detalles Bibliográficos
Autores principales: Alhamdoosh, Monther, Law, Charity W., Tian, Luyi, Sheridan, Julie M., Ng, Milica, Ritchie, Matthew E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: F1000 Research Limited 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5747338/
https://www.ncbi.nlm.nih.gov/pubmed/29333246
http://dx.doi.org/10.12688/f1000research.12544.1
_version_ 1783289258774102016
author Alhamdoosh, Monther
Law, Charity W.
Tian, Luyi
Sheridan, Julie M.
Ng, Milica
Ritchie, Matthew E.
author_facet Alhamdoosh, Monther
Law, Charity W.
Tian, Luyi
Sheridan, Julie M.
Ng, Milica
Ritchie, Matthew E.
author_sort Alhamdoosh, Monther
collection PubMed
description Gene set enrichment analysis is a popular approach for prioritising the biological processes perturbed in genomic datasets. The Bioconductor project hosts over 80 software packages capable of gene set analysis. Most of these packages search for enriched signatures amongst differentially regulated genes to reveal higher level biological themes that may be missed when focusing only on evidence from individual genes. With so many different methods on offer, choosing the best algorithm and visualization approach can be challenging. The EGSEA package solves this problem by combining results from up to 12 prominent gene set testing algorithms to obtain a consensus ranking of biologically relevant results.This workflow demonstrates how EGSEA can extend limma-based differential expression analyses for RNA-seq and microarray data using experiments that profile 3 distinct cell populations important for studying the origins of breast cancer. Following data normalization and set-up of an appropriate linear model for differential expression analysis, EGSEA builds gene signature specific indexes that link a wide range of mouse or human gene set collections obtained from MSigDB, GeneSetDB and KEGG to the gene expression data being investigated. EGSEA is then configured and the ensemble enrichment analysis run, returning an object that can be queried using several S4 methods for ranking gene sets and visualizing results via heatmaps, KEGG pathway views, GO graphs, scatter plots and bar plots. Finally, an HTML report that combines these displays can fast-track the sharing of results with collaborators, and thus expedite downstream biological validation. EGSEA is simple to use and can be easily integrated with existing gene expression analysis pipelines for both human and mouse data.
format Online
Article
Text
id pubmed-5747338
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher F1000 Research Limited
record_format MEDLINE/PubMed
spelling pubmed-57473382018-01-11 Easy and efficient ensemble gene set testing with EGSEA Alhamdoosh, Monther Law, Charity W. Tian, Luyi Sheridan, Julie M. Ng, Milica Ritchie, Matthew E. F1000Res Software Tool Article Gene set enrichment analysis is a popular approach for prioritising the biological processes perturbed in genomic datasets. The Bioconductor project hosts over 80 software packages capable of gene set analysis. Most of these packages search for enriched signatures amongst differentially regulated genes to reveal higher level biological themes that may be missed when focusing only on evidence from individual genes. With so many different methods on offer, choosing the best algorithm and visualization approach can be challenging. The EGSEA package solves this problem by combining results from up to 12 prominent gene set testing algorithms to obtain a consensus ranking of biologically relevant results.This workflow demonstrates how EGSEA can extend limma-based differential expression analyses for RNA-seq and microarray data using experiments that profile 3 distinct cell populations important for studying the origins of breast cancer. Following data normalization and set-up of an appropriate linear model for differential expression analysis, EGSEA builds gene signature specific indexes that link a wide range of mouse or human gene set collections obtained from MSigDB, GeneSetDB and KEGG to the gene expression data being investigated. EGSEA is then configured and the ensemble enrichment analysis run, returning an object that can be queried using several S4 methods for ranking gene sets and visualizing results via heatmaps, KEGG pathway views, GO graphs, scatter plots and bar plots. Finally, an HTML report that combines these displays can fast-track the sharing of results with collaborators, and thus expedite downstream biological validation. EGSEA is simple to use and can be easily integrated with existing gene expression analysis pipelines for both human and mouse data. F1000 Research Limited 2017-11-14 /pmc/articles/PMC5747338/ /pubmed/29333246 http://dx.doi.org/10.12688/f1000research.12544.1 Text en Copyright: © 2017 Alhamdoosh M et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software Tool Article
Alhamdoosh, Monther
Law, Charity W.
Tian, Luyi
Sheridan, Julie M.
Ng, Milica
Ritchie, Matthew E.
Easy and efficient ensemble gene set testing with EGSEA
title Easy and efficient ensemble gene set testing with EGSEA
title_full Easy and efficient ensemble gene set testing with EGSEA
title_fullStr Easy and efficient ensemble gene set testing with EGSEA
title_full_unstemmed Easy and efficient ensemble gene set testing with EGSEA
title_short Easy and efficient ensemble gene set testing with EGSEA
title_sort easy and efficient ensemble gene set testing with egsea
topic Software Tool Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5747338/
https://www.ncbi.nlm.nih.gov/pubmed/29333246
http://dx.doi.org/10.12688/f1000research.12544.1
work_keys_str_mv AT alhamdooshmonther easyandefficientensemblegenesettestingwithegsea
AT lawcharityw easyandefficientensemblegenesettestingwithegsea
AT tianluyi easyandefficientensemblegenesettestingwithegsea
AT sheridanjuliem easyandefficientensemblegenesettestingwithegsea
AT ngmilica easyandefficientensemblegenesettestingwithegsea
AT ritchiematthewe easyandefficientensemblegenesettestingwithegsea