Cargando…
BubbleGUM: automatic extraction of phenotype molecular signatures and comprehensive visualization of multiple Gene Set Enrichment Analyses
BACKGROUND: Recent advances in the analysis of high-throughput expression data have led to the development of tools that scaled-up their focus from single-gene to gene set level. For example, the popular Gene Set Enrichment Analysis (GSEA) algorithm can detect moderate but coordinated expression cha...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4617899/ https://www.ncbi.nlm.nih.gov/pubmed/26481321 http://dx.doi.org/10.1186/s12864-015-2012-4 |
_version_ | 1782396858552811520 |
---|---|
author | Spinelli, Lionel Carpentier, Sabrina Montañana Sanchis, Frédéric Dalod, Marc Vu Manh, Thien-Phong |
author_facet | Spinelli, Lionel Carpentier, Sabrina Montañana Sanchis, Frédéric Dalod, Marc Vu Manh, Thien-Phong |
author_sort | Spinelli, Lionel |
collection | PubMed |
description | BACKGROUND: Recent advances in the analysis of high-throughput expression data have led to the development of tools that scaled-up their focus from single-gene to gene set level. For example, the popular Gene Set Enrichment Analysis (GSEA) algorithm can detect moderate but coordinated expression changes of groups of presumably related genes between pairs of experimental conditions. This considerably improves extraction of information from high-throughput gene expression data. However, although many gene sets covering a large panel of biological fields are available in public databases, the ability to generate home-made gene sets relevant to one’s biological question is crucial but remains a substantial challenge to most biologists lacking statistic or bioinformatic expertise. This is all the more the case when attempting to define a gene set specific of one condition compared to many other ones. Thus, there is a crucial need for an easy-to-use software for generation of relevant home-made gene sets from complex datasets, their use in GSEA, and the correction of the results when applied to multiple comparisons of many experimental conditions. RESULT: We developed BubbleGUM (GSEA Unlimited Map), a tool that allows to automatically extract molecular signatures from transcriptomic data and perform exhaustive GSEA with multiple testing correction. One original feature of BubbleGUM notably resides in its capacity to integrate and compare numerous GSEA results into an easy-to-grasp graphical representation. We applied our method to generate transcriptomic fingerprints for murine cell types and to assess their enrichments in human cell types. This analysis allowed us to confirm homologies between mouse and human immunocytes. CONCLUSIONS: BubbleGUM is an open-source software that allows to automatically generate molecular signatures out of complex expression datasets and to assess directly their enrichment by GSEA on independent datasets. Enrichments are displayed in a graphical output that helps interpreting the results. This innovative methodology has recently been used to answer important questions in functional genomics, such as the degree of similarities between microarray datasets from different laboratories or with different experimental models or clinical cohorts. BubbleGUM is executable through an intuitive interface so that both bioinformaticians and biologists can use it. It is available at http://www.ciml.univ-mrs.fr/applications/BubbleGUM/index.html. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-2012-4) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4617899 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-46178992015-10-25 BubbleGUM: automatic extraction of phenotype molecular signatures and comprehensive visualization of multiple Gene Set Enrichment Analyses Spinelli, Lionel Carpentier, Sabrina Montañana Sanchis, Frédéric Dalod, Marc Vu Manh, Thien-Phong BMC Genomics Research Article BACKGROUND: Recent advances in the analysis of high-throughput expression data have led to the development of tools that scaled-up their focus from single-gene to gene set level. For example, the popular Gene Set Enrichment Analysis (GSEA) algorithm can detect moderate but coordinated expression changes of groups of presumably related genes between pairs of experimental conditions. This considerably improves extraction of information from high-throughput gene expression data. However, although many gene sets covering a large panel of biological fields are available in public databases, the ability to generate home-made gene sets relevant to one’s biological question is crucial but remains a substantial challenge to most biologists lacking statistic or bioinformatic expertise. This is all the more the case when attempting to define a gene set specific of one condition compared to many other ones. Thus, there is a crucial need for an easy-to-use software for generation of relevant home-made gene sets from complex datasets, their use in GSEA, and the correction of the results when applied to multiple comparisons of many experimental conditions. RESULT: We developed BubbleGUM (GSEA Unlimited Map), a tool that allows to automatically extract molecular signatures from transcriptomic data and perform exhaustive GSEA with multiple testing correction. One original feature of BubbleGUM notably resides in its capacity to integrate and compare numerous GSEA results into an easy-to-grasp graphical representation. We applied our method to generate transcriptomic fingerprints for murine cell types and to assess their enrichments in human cell types. This analysis allowed us to confirm homologies between mouse and human immunocytes. CONCLUSIONS: BubbleGUM is an open-source software that allows to automatically generate molecular signatures out of complex expression datasets and to assess directly their enrichment by GSEA on independent datasets. Enrichments are displayed in a graphical output that helps interpreting the results. This innovative methodology has recently been used to answer important questions in functional genomics, such as the degree of similarities between microarray datasets from different laboratories or with different experimental models or clinical cohorts. BubbleGUM is executable through an intuitive interface so that both bioinformaticians and biologists can use it. It is available at http://www.ciml.univ-mrs.fr/applications/BubbleGUM/index.html. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-2012-4) contains supplementary material, which is available to authorized users. BioMed Central 2015-10-19 /pmc/articles/PMC4617899/ /pubmed/26481321 http://dx.doi.org/10.1186/s12864-015-2012-4 Text en © Spinelli et al. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Spinelli, Lionel Carpentier, Sabrina Montañana Sanchis, Frédéric Dalod, Marc Vu Manh, Thien-Phong BubbleGUM: automatic extraction of phenotype molecular signatures and comprehensive visualization of multiple Gene Set Enrichment Analyses |
title | BubbleGUM: automatic extraction of phenotype molecular signatures and comprehensive visualization of multiple Gene Set Enrichment Analyses |
title_full | BubbleGUM: automatic extraction of phenotype molecular signatures and comprehensive visualization of multiple Gene Set Enrichment Analyses |
title_fullStr | BubbleGUM: automatic extraction of phenotype molecular signatures and comprehensive visualization of multiple Gene Set Enrichment Analyses |
title_full_unstemmed | BubbleGUM: automatic extraction of phenotype molecular signatures and comprehensive visualization of multiple Gene Set Enrichment Analyses |
title_short | BubbleGUM: automatic extraction of phenotype molecular signatures and comprehensive visualization of multiple Gene Set Enrichment Analyses |
title_sort | bubblegum: automatic extraction of phenotype molecular signatures and comprehensive visualization of multiple gene set enrichment analyses |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4617899/ https://www.ncbi.nlm.nih.gov/pubmed/26481321 http://dx.doi.org/10.1186/s12864-015-2012-4 |
work_keys_str_mv | AT spinellilionel bubblegumautomaticextractionofphenotypemolecularsignaturesandcomprehensivevisualizationofmultiplegenesetenrichmentanalyses AT carpentiersabrina bubblegumautomaticextractionofphenotypemolecularsignaturesandcomprehensivevisualizationofmultiplegenesetenrichmentanalyses AT montananasanchisfrederic bubblegumautomaticextractionofphenotypemolecularsignaturesandcomprehensivevisualizationofmultiplegenesetenrichmentanalyses AT dalodmarc bubblegumautomaticextractionofphenotypemolecularsignaturesandcomprehensivevisualizationofmultiplegenesetenrichmentanalyses AT vumanhthienphong bubblegumautomaticextractionofphenotypemolecularsignaturesandcomprehensivevisualizationofmultiplegenesetenrichmentanalyses |