Cargando…

BubbleGUM: automatic extraction of phenotype molecular signatures and comprehensive visualization of multiple Gene Set Enrichment Analyses

BACKGROUND: Recent advances in the analysis of high-throughput expression data have led to the development of tools that scaled-up their focus from single-gene to gene set level. For example, the popular Gene Set Enrichment Analysis (GSEA) algorithm can detect moderate but coordinated expression cha...

Descripción completa

Detalles Bibliográficos
Autores principales: Spinelli, Lionel, Carpentier, Sabrina, Montañana Sanchis, Frédéric, Dalod, Marc, Vu Manh, Thien-Phong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4617899/
https://www.ncbi.nlm.nih.gov/pubmed/26481321
http://dx.doi.org/10.1186/s12864-015-2012-4
_version_ 1782396858552811520
author Spinelli, Lionel
Carpentier, Sabrina
Montañana Sanchis, Frédéric
Dalod, Marc
Vu Manh, Thien-Phong
author_facet Spinelli, Lionel
Carpentier, Sabrina
Montañana Sanchis, Frédéric
Dalod, Marc
Vu Manh, Thien-Phong
author_sort Spinelli, Lionel
collection PubMed
description BACKGROUND: Recent advances in the analysis of high-throughput expression data have led to the development of tools that scaled-up their focus from single-gene to gene set level. For example, the popular Gene Set Enrichment Analysis (GSEA) algorithm can detect moderate but coordinated expression changes of groups of presumably related genes between pairs of experimental conditions. This considerably improves extraction of information from high-throughput gene expression data. However, although many gene sets covering a large panel of biological fields are available in public databases, the ability to generate home-made gene sets relevant to one’s biological question is crucial but remains a substantial challenge to most biologists lacking statistic or bioinformatic expertise. This is all the more the case when attempting to define a gene set specific of one condition compared to many other ones. Thus, there is a crucial need for an easy-to-use software for generation of relevant home-made gene sets from complex datasets, their use in GSEA, and the correction of the results when applied to multiple comparisons of many experimental conditions. RESULT: We developed BubbleGUM (GSEA Unlimited Map), a tool that allows to automatically extract molecular signatures from transcriptomic data and perform exhaustive GSEA with multiple testing correction. One original feature of BubbleGUM notably resides in its capacity to integrate and compare numerous GSEA results into an easy-to-grasp graphical representation. We applied our method to generate transcriptomic fingerprints for murine cell types and to assess their enrichments in human cell types. This analysis allowed us to confirm homologies between mouse and human immunocytes. CONCLUSIONS: BubbleGUM is an open-source software that allows to automatically generate molecular signatures out of complex expression datasets and to assess directly their enrichment by GSEA on independent datasets. Enrichments are displayed in a graphical output that helps interpreting the results. This innovative methodology has recently been used to answer important questions in functional genomics, such as the degree of similarities between microarray datasets from different laboratories or with different experimental models or clinical cohorts. BubbleGUM is executable through an intuitive interface so that both bioinformaticians and biologists can use it. It is available at http://www.ciml.univ-mrs.fr/applications/BubbleGUM/index.html. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-2012-4) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4617899
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-46178992015-10-25 BubbleGUM: automatic extraction of phenotype molecular signatures and comprehensive visualization of multiple Gene Set Enrichment Analyses Spinelli, Lionel Carpentier, Sabrina Montañana Sanchis, Frédéric Dalod, Marc Vu Manh, Thien-Phong BMC Genomics Research Article BACKGROUND: Recent advances in the analysis of high-throughput expression data have led to the development of tools that scaled-up their focus from single-gene to gene set level. For example, the popular Gene Set Enrichment Analysis (GSEA) algorithm can detect moderate but coordinated expression changes of groups of presumably related genes between pairs of experimental conditions. This considerably improves extraction of information from high-throughput gene expression data. However, although many gene sets covering a large panel of biological fields are available in public databases, the ability to generate home-made gene sets relevant to one’s biological question is crucial but remains a substantial challenge to most biologists lacking statistic or bioinformatic expertise. This is all the more the case when attempting to define a gene set specific of one condition compared to many other ones. Thus, there is a crucial need for an easy-to-use software for generation of relevant home-made gene sets from complex datasets, their use in GSEA, and the correction of the results when applied to multiple comparisons of many experimental conditions. RESULT: We developed BubbleGUM (GSEA Unlimited Map), a tool that allows to automatically extract molecular signatures from transcriptomic data and perform exhaustive GSEA with multiple testing correction. One original feature of BubbleGUM notably resides in its capacity to integrate and compare numerous GSEA results into an easy-to-grasp graphical representation. We applied our method to generate transcriptomic fingerprints for murine cell types and to assess their enrichments in human cell types. This analysis allowed us to confirm homologies between mouse and human immunocytes. CONCLUSIONS: BubbleGUM is an open-source software that allows to automatically generate molecular signatures out of complex expression datasets and to assess directly their enrichment by GSEA on independent datasets. Enrichments are displayed in a graphical output that helps interpreting the results. This innovative methodology has recently been used to answer important questions in functional genomics, such as the degree of similarities between microarray datasets from different laboratories or with different experimental models or clinical cohorts. BubbleGUM is executable through an intuitive interface so that both bioinformaticians and biologists can use it. It is available at http://www.ciml.univ-mrs.fr/applications/BubbleGUM/index.html. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-2012-4) contains supplementary material, which is available to authorized users. BioMed Central 2015-10-19 /pmc/articles/PMC4617899/ /pubmed/26481321 http://dx.doi.org/10.1186/s12864-015-2012-4 Text en © Spinelli et al. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Spinelli, Lionel
Carpentier, Sabrina
Montañana Sanchis, Frédéric
Dalod, Marc
Vu Manh, Thien-Phong
BubbleGUM: automatic extraction of phenotype molecular signatures and comprehensive visualization of multiple Gene Set Enrichment Analyses
title BubbleGUM: automatic extraction of phenotype molecular signatures and comprehensive visualization of multiple Gene Set Enrichment Analyses
title_full BubbleGUM: automatic extraction of phenotype molecular signatures and comprehensive visualization of multiple Gene Set Enrichment Analyses
title_fullStr BubbleGUM: automatic extraction of phenotype molecular signatures and comprehensive visualization of multiple Gene Set Enrichment Analyses
title_full_unstemmed BubbleGUM: automatic extraction of phenotype molecular signatures and comprehensive visualization of multiple Gene Set Enrichment Analyses
title_short BubbleGUM: automatic extraction of phenotype molecular signatures and comprehensive visualization of multiple Gene Set Enrichment Analyses
title_sort bubblegum: automatic extraction of phenotype molecular signatures and comprehensive visualization of multiple gene set enrichment analyses
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4617899/
https://www.ncbi.nlm.nih.gov/pubmed/26481321
http://dx.doi.org/10.1186/s12864-015-2012-4
work_keys_str_mv AT spinellilionel bubblegumautomaticextractionofphenotypemolecularsignaturesandcomprehensivevisualizationofmultiplegenesetenrichmentanalyses
AT carpentiersabrina bubblegumautomaticextractionofphenotypemolecularsignaturesandcomprehensivevisualizationofmultiplegenesetenrichmentanalyses
AT montananasanchisfrederic bubblegumautomaticextractionofphenotypemolecularsignaturesandcomprehensivevisualizationofmultiplegenesetenrichmentanalyses
AT dalodmarc bubblegumautomaticextractionofphenotypemolecularsignaturesandcomprehensivevisualizationofmultiplegenesetenrichmentanalyses
AT vumanhthienphong bubblegumautomaticextractionofphenotypemolecularsignaturesandcomprehensivevisualizationofmultiplegenesetenrichmentanalyses