Cargando…
Weighted set enrichment of gene expression data
BACKGROUND: Sets of genes that are known to be associated with each other can be used to interpret microarray data. This gene set approach to microarray data analysis can illustrate patterns of gene expression which may be more informative than analyzing the expression of individual genes. Various s...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3854649/ https://www.ncbi.nlm.nih.gov/pubmed/24565001 http://dx.doi.org/10.1186/1752-0509-7-S4-S10 |
_version_ | 1782294839934582784 |
---|---|
author | Qureshi, Rehman Sacan, Ahmet |
author_facet | Qureshi, Rehman Sacan, Ahmet |
author_sort | Qureshi, Rehman |
collection | PubMed |
description | BACKGROUND: Sets of genes that are known to be associated with each other can be used to interpret microarray data. This gene set approach to microarray data analysis can illustrate patterns of gene expression which may be more informative than analyzing the expression of individual genes. Various statistical approaches exist for the analysis of gene sets. There are three main classes of these methods: over-representation analysis, functional class scoring, and pathway topology based methods. METHODS: We propose weighted hypergeometric and weighted chi-squared methods in order to assign a rank to the degree to which each gene participates in the enrichment. Each gene is assigned a weight determined by the absolute value of its log fold change, which is then raised to a certain power. The power value can be adjusted as needed. Datasets from the Gene Expression Omnibus are used to test the method. The significantly enriched pathways are validated through searching the literature in order to determine their relevance to the dataset. RESULTS: Although these methods detect fewer significantly enriched pathways, they can potentially produce more relevant results. Furthermore, we compare the results of different enrichment methods on a set of microarray studies all containing data from various rodent neuropathic pain models. DISCUSSION: Our method is able to produce more consistent results than other methods when evaluated on similar datasets. It can also potentially detect relevant pathways that are not identified by the standard methods. However, the lack of biological ground truth makes validating the method difficult. |
format | Online Article Text |
id | pubmed-3854649 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-38546492013-12-16 Weighted set enrichment of gene expression data Qureshi, Rehman Sacan, Ahmet BMC Syst Biol Research BACKGROUND: Sets of genes that are known to be associated with each other can be used to interpret microarray data. This gene set approach to microarray data analysis can illustrate patterns of gene expression which may be more informative than analyzing the expression of individual genes. Various statistical approaches exist for the analysis of gene sets. There are three main classes of these methods: over-representation analysis, functional class scoring, and pathway topology based methods. METHODS: We propose weighted hypergeometric and weighted chi-squared methods in order to assign a rank to the degree to which each gene participates in the enrichment. Each gene is assigned a weight determined by the absolute value of its log fold change, which is then raised to a certain power. The power value can be adjusted as needed. Datasets from the Gene Expression Omnibus are used to test the method. The significantly enriched pathways are validated through searching the literature in order to determine their relevance to the dataset. RESULTS: Although these methods detect fewer significantly enriched pathways, they can potentially produce more relevant results. Furthermore, we compare the results of different enrichment methods on a set of microarray studies all containing data from various rodent neuropathic pain models. DISCUSSION: Our method is able to produce more consistent results than other methods when evaluated on similar datasets. It can also potentially detect relevant pathways that are not identified by the standard methods. However, the lack of biological ground truth makes validating the method difficult. BioMed Central 2013-10-23 /pmc/articles/PMC3854649/ /pubmed/24565001 http://dx.doi.org/10.1186/1752-0509-7-S4-S10 Text en Copyright © 2013 Qureshi and Sacan; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Qureshi, Rehman Sacan, Ahmet Weighted set enrichment of gene expression data |
title | Weighted set enrichment of gene expression data |
title_full | Weighted set enrichment of gene expression data |
title_fullStr | Weighted set enrichment of gene expression data |
title_full_unstemmed | Weighted set enrichment of gene expression data |
title_short | Weighted set enrichment of gene expression data |
title_sort | weighted set enrichment of gene expression data |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3854649/ https://www.ncbi.nlm.nih.gov/pubmed/24565001 http://dx.doi.org/10.1186/1752-0509-7-S4-S10 |
work_keys_str_mv | AT qureshirehman weightedsetenrichmentofgeneexpressiondata AT sacanahmet weightedsetenrichmentofgeneexpressiondata |