Cargando…

Computation of significance scores of unweighted Gene Set Enrichment Analyses

BACKGROUND: Gene Set Enrichment Analysis (GSEA) is a computational method for the statistical evaluation of sorted lists of genes or proteins. Originally GSEA was developed for interpreting microarray gene expression data, but it can be applied to any sorted list of genes. Given the gene list and an...

Descripción completa

Detalles Bibliográficos
Autores principales:	Keller, Andreas, Backes, Christina, Lenhof, Hans-Peter
Formato:	Texto
Lenguaje:	English
Publicado:	BioMed Central 2007
Materias:	Methodology Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1994690/ https://www.ncbi.nlm.nih.gov/pubmed/17683603 http://dx.doi.org/10.1186/1471-2105-8-290

_version_	1782135493198086144
author	Keller, Andreas Backes, Christina Lenhof, Hans-Peter
author_facet	Keller, Andreas Backes, Christina Lenhof, Hans-Peter
author_sort	Keller, Andreas
collection	PubMed
description	BACKGROUND: Gene Set Enrichment Analysis (GSEA) is a computational method for the statistical evaluation of sorted lists of genes or proteins. Originally GSEA was developed for interpreting microarray gene expression data, but it can be applied to any sorted list of genes. Given the gene list and an arbitrary biological category, GSEA evaluates whether the genes of the considered category are randomly distributed or accumulated on top or bottom of the list. Usually, significance scores (p-values) of GSEA are computed by nonparametric permutation tests, a time consuming procedure that yields only estimates of the p-values. RESULTS: We present a novel dynamic programming algorithm for calculating exact significance values of unweighted Gene Set Enrichment Analyses. Our algorithm avoids typical problems of nonparametric permutation tests, as varying findings in different runs caused by the random sampling procedure. Another advantage of the presented dynamic programming algorithm is its runtime and memory efficiency. To test our algorithm, we applied it not only to simulated data sets, but additionally evaluated expression profiles of squamous cell lung cancer tissue and autologous unaffected tissue.
format	Text
id	pubmed-1994690
institution	National Center for Biotechnology Information
language	English
publishDate	2007
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-19946902007-09-27 Computation of significance scores of unweighted Gene Set Enrichment Analyses Keller, Andreas Backes, Christina Lenhof, Hans-Peter BMC Bioinformatics Methodology Article BACKGROUND: Gene Set Enrichment Analysis (GSEA) is a computational method for the statistical evaluation of sorted lists of genes or proteins. Originally GSEA was developed for interpreting microarray gene expression data, but it can be applied to any sorted list of genes. Given the gene list and an arbitrary biological category, GSEA evaluates whether the genes of the considered category are randomly distributed or accumulated on top or bottom of the list. Usually, significance scores (p-values) of GSEA are computed by nonparametric permutation tests, a time consuming procedure that yields only estimates of the p-values. RESULTS: We present a novel dynamic programming algorithm for calculating exact significance values of unweighted Gene Set Enrichment Analyses. Our algorithm avoids typical problems of nonparametric permutation tests, as varying findings in different runs caused by the random sampling procedure. Another advantage of the presented dynamic programming algorithm is its runtime and memory efficiency. To test our algorithm, we applied it not only to simulated data sets, but additionally evaluated expression profiles of squamous cell lung cancer tissue and autologous unaffected tissue. BioMed Central 2007-08-06 /pmc/articles/PMC1994690/ /pubmed/17683603 http://dx.doi.org/10.1186/1471-2105-8-290 Text en Copyright © 2007 Keller et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Methodology Article Keller, Andreas Backes, Christina Lenhof, Hans-Peter Computation of significance scores of unweighted Gene Set Enrichment Analyses
title	Computation of significance scores of unweighted Gene Set Enrichment Analyses
title_full	Computation of significance scores of unweighted Gene Set Enrichment Analyses
title_fullStr	Computation of significance scores of unweighted Gene Set Enrichment Analyses
title_full_unstemmed	Computation of significance scores of unweighted Gene Set Enrichment Analyses
title_short	Computation of significance scores of unweighted Gene Set Enrichment Analyses
title_sort	computation of significance scores of unweighted gene set enrichment analyses
topic	Methodology Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1994690/ https://www.ncbi.nlm.nih.gov/pubmed/17683603 http://dx.doi.org/10.1186/1471-2105-8-290
work_keys_str_mv	AT kellerandreas computationofsignificancescoresofunweightedgenesetenrichmentanalyses AT backeschristina computationofsignificancescoresofunweightedgenesetenrichmentanalyses AT lenhofhanspeter computationofsignificancescoresofunweightedgenesetenrichmentanalyses

Computation of significance scores of unweighted Gene Set Enrichment Analyses

Ejemplares similares