Cargando…
Content-based microarray search using differential expression profiles
BACKGROUND: With the expansion of public repositories such as the Gene Expression Omnibus (GEO), we are rapidly cataloging cellular transcriptional responses to diverse experimental conditions. Methods that query these repositories based on gene expression content, rather than textual annotations, m...
Autores principales: | , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3022631/ https://www.ncbi.nlm.nih.gov/pubmed/21172034 http://dx.doi.org/10.1186/1471-2105-11-603 |
_version_ | 1782196537426706432 |
---|---|
author | Engreitz, Jesse M Morgan, Alexander A Dudley, Joel T Chen, Rong Thathoo, Rahul Altman, Russ B Butte, Atul J |
author_facet | Engreitz, Jesse M Morgan, Alexander A Dudley, Joel T Chen, Rong Thathoo, Rahul Altman, Russ B Butte, Atul J |
author_sort | Engreitz, Jesse M |
collection | PubMed |
description | BACKGROUND: With the expansion of public repositories such as the Gene Expression Omnibus (GEO), we are rapidly cataloging cellular transcriptional responses to diverse experimental conditions. Methods that query these repositories based on gene expression content, rather than textual annotations, may enable more effective experiment retrieval as well as the discovery of novel associations between drugs, diseases, and other perturbations. RESULTS: We develop methods to retrieve gene expression experiments that differentially express the same transcriptional programs as a query experiment. Avoiding thresholds, we generate differential expression profiles that include a score for each gene measured in an experiment. We use existing and novel dimension reduction and correlation measures to rank relevant experiments in an entirely data-driven manner, allowing emergent features of the data to drive the results. A combination of matrix decomposition and p-weighted Pearson correlation proves the most suitable for comparing differential expression profiles. We apply this method to index all GEO DataSets, and demonstrate the utility of our approach by identifying pathways and conditions relevant to transcription factors Nanog and FoxO3. CONCLUSIONS: Content-based gene expression search generates relevant hypotheses for biological inquiry. Experiments across platforms, tissue types, and protocols inform the analysis of new datasets. |
format | Text |
id | pubmed-3022631 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-30226312011-01-20 Content-based microarray search using differential expression profiles Engreitz, Jesse M Morgan, Alexander A Dudley, Joel T Chen, Rong Thathoo, Rahul Altman, Russ B Butte, Atul J BMC Bioinformatics Research Article BACKGROUND: With the expansion of public repositories such as the Gene Expression Omnibus (GEO), we are rapidly cataloging cellular transcriptional responses to diverse experimental conditions. Methods that query these repositories based on gene expression content, rather than textual annotations, may enable more effective experiment retrieval as well as the discovery of novel associations between drugs, diseases, and other perturbations. RESULTS: We develop methods to retrieve gene expression experiments that differentially express the same transcriptional programs as a query experiment. Avoiding thresholds, we generate differential expression profiles that include a score for each gene measured in an experiment. We use existing and novel dimension reduction and correlation measures to rank relevant experiments in an entirely data-driven manner, allowing emergent features of the data to drive the results. A combination of matrix decomposition and p-weighted Pearson correlation proves the most suitable for comparing differential expression profiles. We apply this method to index all GEO DataSets, and demonstrate the utility of our approach by identifying pathways and conditions relevant to transcription factors Nanog and FoxO3. CONCLUSIONS: Content-based gene expression search generates relevant hypotheses for biological inquiry. Experiments across platforms, tissue types, and protocols inform the analysis of new datasets. BioMed Central 2010-12-21 /pmc/articles/PMC3022631/ /pubmed/21172034 http://dx.doi.org/10.1186/1471-2105-11-603 Text en Copyright ©2010 Engreitz et al; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Engreitz, Jesse M Morgan, Alexander A Dudley, Joel T Chen, Rong Thathoo, Rahul Altman, Russ B Butte, Atul J Content-based microarray search using differential expression profiles |
title | Content-based microarray search using differential expression profiles |
title_full | Content-based microarray search using differential expression profiles |
title_fullStr | Content-based microarray search using differential expression profiles |
title_full_unstemmed | Content-based microarray search using differential expression profiles |
title_short | Content-based microarray search using differential expression profiles |
title_sort | content-based microarray search using differential expression profiles |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3022631/ https://www.ncbi.nlm.nih.gov/pubmed/21172034 http://dx.doi.org/10.1186/1471-2105-11-603 |
work_keys_str_mv | AT engreitzjessem contentbasedmicroarraysearchusingdifferentialexpressionprofiles AT morganalexandera contentbasedmicroarraysearchusingdifferentialexpressionprofiles AT dudleyjoelt contentbasedmicroarraysearchusingdifferentialexpressionprofiles AT chenrong contentbasedmicroarraysearchusingdifferentialexpressionprofiles AT thathoorahul contentbasedmicroarraysearchusingdifferentialexpressionprofiles AT altmanrussb contentbasedmicroarraysearchusingdifferentialexpressionprofiles AT butteatulj contentbasedmicroarraysearchusingdifferentialexpressionprofiles |