Cargando…

Discovering biological connections between experimental conditions based on common patterns of differential gene expression

BACKGROUND: Identifying similarities between patterns of differential gene expression provides an opportunity to identify similarities between the experimental and biological conditions that give rise to these gene expression alterations. The growing volume of gene expression data in open data repos...

Descripción completa

Detalles Bibliográficos
Autores principales: Gower, Adam C, Spira, Avrum, Lenburg, Marc E
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3203354/
https://www.ncbi.nlm.nih.gov/pubmed/21951600
http://dx.doi.org/10.1186/1471-2105-12-381
_version_ 1782215110188597248
author Gower, Adam C
Spira, Avrum
Lenburg, Marc E
author_facet Gower, Adam C
Spira, Avrum
Lenburg, Marc E
author_sort Gower, Adam C
collection PubMed
description BACKGROUND: Identifying similarities between patterns of differential gene expression provides an opportunity to identify similarities between the experimental and biological conditions that give rise to these gene expression alterations. The growing volume of gene expression data in open data repositories such as the NCBI Gene Expression Omnibus (GEO) presents an opportunity to identify these gene expression similarities on a large scale across a diverse collection of datasets. We have developed a fast, pattern-based computational approach, named openSESAME (Search of Expression Signatures Across Many Experiments), that identifies datasets enriched in samples that display coordinate differential expression of a query signature. Importantly, openSESAME performs this search without prior knowledge of the phenotypic or experimental groups in the datasets being searched. This allows openSESAME to identify perturbations of gene expression that are due to phenotypic attributes that may not have been described in the sample annotation included in the repository. To demonstrate the utility of openSESAME, we used gene expression signatures of two biological perturbations to query a set of 75,164 human expression profiles that were generated using Affymetrix microarrays and deposited in GEO. The first query, using a signature of estradiol treatment, identified experiments in which estrogen signaling was perturbed and also identified differences in estrogen signaling between estrogen receptor-positive and -negative breast cancers. The second query, which used a signature of silencing of the transcription factor p63 (a key regulator of epidermal differentiation), identified datasets related to stratified squamous epithelia or epidermal diseases such as melanoma. CONCLUSIONS: openSESAME is a tool for leveraging the growing body of publicly available microarray data to discover relationships between different biological states based on common patterns of differential gene expression. These relationships may serve to generate hypotheses about the causes and consequences of specific patterns of observed differential gene expression. To encourage others to explore the utility of this approach, we have made a website for performing openSESAME queries freely available at http://opensesame.bu.edu.
format Online
Article
Text
id pubmed-3203354
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32033542011-10-31 Discovering biological connections between experimental conditions based on common patterns of differential gene expression Gower, Adam C Spira, Avrum Lenburg, Marc E BMC Bioinformatics Methodology Article BACKGROUND: Identifying similarities between patterns of differential gene expression provides an opportunity to identify similarities between the experimental and biological conditions that give rise to these gene expression alterations. The growing volume of gene expression data in open data repositories such as the NCBI Gene Expression Omnibus (GEO) presents an opportunity to identify these gene expression similarities on a large scale across a diverse collection of datasets. We have developed a fast, pattern-based computational approach, named openSESAME (Search of Expression Signatures Across Many Experiments), that identifies datasets enriched in samples that display coordinate differential expression of a query signature. Importantly, openSESAME performs this search without prior knowledge of the phenotypic or experimental groups in the datasets being searched. This allows openSESAME to identify perturbations of gene expression that are due to phenotypic attributes that may not have been described in the sample annotation included in the repository. To demonstrate the utility of openSESAME, we used gene expression signatures of two biological perturbations to query a set of 75,164 human expression profiles that were generated using Affymetrix microarrays and deposited in GEO. The first query, using a signature of estradiol treatment, identified experiments in which estrogen signaling was perturbed and also identified differences in estrogen signaling between estrogen receptor-positive and -negative breast cancers. The second query, which used a signature of silencing of the transcription factor p63 (a key regulator of epidermal differentiation), identified datasets related to stratified squamous epithelia or epidermal diseases such as melanoma. CONCLUSIONS: openSESAME is a tool for leveraging the growing body of publicly available microarray data to discover relationships between different biological states based on common patterns of differential gene expression. These relationships may serve to generate hypotheses about the causes and consequences of specific patterns of observed differential gene expression. To encourage others to explore the utility of this approach, we have made a website for performing openSESAME queries freely available at http://opensesame.bu.edu. BioMed Central 2011-09-27 /pmc/articles/PMC3203354/ /pubmed/21951600 http://dx.doi.org/10.1186/1471-2105-12-381 Text en Copyright ©2011 Gower et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Gower, Adam C
Spira, Avrum
Lenburg, Marc E
Discovering biological connections between experimental conditions based on common patterns of differential gene expression
title Discovering biological connections between experimental conditions based on common patterns of differential gene expression
title_full Discovering biological connections between experimental conditions based on common patterns of differential gene expression
title_fullStr Discovering biological connections between experimental conditions based on common patterns of differential gene expression
title_full_unstemmed Discovering biological connections between experimental conditions based on common patterns of differential gene expression
title_short Discovering biological connections between experimental conditions based on common patterns of differential gene expression
title_sort discovering biological connections between experimental conditions based on common patterns of differential gene expression
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3203354/
https://www.ncbi.nlm.nih.gov/pubmed/21951600
http://dx.doi.org/10.1186/1471-2105-12-381
work_keys_str_mv AT goweradamc discoveringbiologicalconnectionsbetweenexperimentalconditionsbasedoncommonpatternsofdifferentialgeneexpression
AT spiraavrum discoveringbiologicalconnectionsbetweenexperimentalconditionsbasedoncommonpatternsofdifferentialgeneexpression
AT lenburgmarce discoveringbiologicalconnectionsbetweenexperimentalconditionsbasedoncommonpatternsofdifferentialgeneexpression