Cargando…

FDR-FET: an optimizing gene set enrichment analysis method

Gene set enrichment analysis for analyzing large profiling and screening experiments can reveal unifying biological schemes based on previously accumulated knowledge represented as “gene sets”. Most of the existing implementations use a fixed fold-change or P value cutoff to generate regulated gene...

Descripción completa

Detalles Bibliográficos
Autores principales: Ji, Rui-Ru, Ott, Karl-Heinz, Yordanova, Roumyana, Bruccoleri, Robert E
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Dove Medical Press 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3169954/
https://www.ncbi.nlm.nih.gov/pubmed/21918636
http://dx.doi.org/10.2147/AABC.S15840
_version_ 1782211555113304064
author Ji, Rui-Ru
Ott, Karl-Heinz
Yordanova, Roumyana
Bruccoleri, Robert E
author_facet Ji, Rui-Ru
Ott, Karl-Heinz
Yordanova, Roumyana
Bruccoleri, Robert E
author_sort Ji, Rui-Ru
collection PubMed
description Gene set enrichment analysis for analyzing large profiling and screening experiments can reveal unifying biological schemes based on previously accumulated knowledge represented as “gene sets”. Most of the existing implementations use a fixed fold-change or P value cutoff to generate regulated gene lists. However, the threshold selection in most cases is arbitrary, and has a significant effect on the test outcome and interpretation of the experiment. We developed a new gene set enrichment analysis method, ie, FDR-FET, which dynamically optimizes the threshold choice and improves the sensitivity and selectivity of gene set enrichment analysis. The procedure translates experimental results into a series of regulated gene lists at multiple false discovery rate (FDR) cutoffs, and computes the P value of the overrepresentation of a gene set using a Fisher’s exact test (FET) in each of these gene lists. The lowest P value is retained to represent the significance of the gene set. We also implemented improved methods to define a more relevant global reference set for the FET. We demonstrate the validity of the method using a published microarray study of three protease inhibitors of the human immunodeficiency virus and compare the results with those from other popular gene set enrichment analysis algorithms. Our results show that combining FDR with multiple cutoffs allows us to control the error while retaining genes that increase information content. We conclude that FDR-FET can selectively identify significant affected biological processes. Our method can be used for any user-generated gene list in the area of transcriptome, proteome, and other biological and scientific applications.
format Online
Article
Text
id pubmed-3169954
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Dove Medical Press
record_format MEDLINE/PubMed
spelling pubmed-31699542011-09-14 FDR-FET: an optimizing gene set enrichment analysis method Ji, Rui-Ru Ott, Karl-Heinz Yordanova, Roumyana Bruccoleri, Robert E Adv Appl Bioinforma Chem Methodology Gene set enrichment analysis for analyzing large profiling and screening experiments can reveal unifying biological schemes based on previously accumulated knowledge represented as “gene sets”. Most of the existing implementations use a fixed fold-change or P value cutoff to generate regulated gene lists. However, the threshold selection in most cases is arbitrary, and has a significant effect on the test outcome and interpretation of the experiment. We developed a new gene set enrichment analysis method, ie, FDR-FET, which dynamically optimizes the threshold choice and improves the sensitivity and selectivity of gene set enrichment analysis. The procedure translates experimental results into a series of regulated gene lists at multiple false discovery rate (FDR) cutoffs, and computes the P value of the overrepresentation of a gene set using a Fisher’s exact test (FET) in each of these gene lists. The lowest P value is retained to represent the significance of the gene set. We also implemented improved methods to define a more relevant global reference set for the FET. We demonstrate the validity of the method using a published microarray study of three protease inhibitors of the human immunodeficiency virus and compare the results with those from other popular gene set enrichment analysis algorithms. Our results show that combining FDR with multiple cutoffs allows us to control the error while retaining genes that increase information content. We conclude that FDR-FET can selectively identify significant affected biological processes. Our method can be used for any user-generated gene list in the area of transcriptome, proteome, and other biological and scientific applications. Dove Medical Press 2011-03-15 /pmc/articles/PMC3169954/ /pubmed/21918636 http://dx.doi.org/10.2147/AABC.S15840 Text en © 2011 Ji et al, publisher and licensee Dove Medical Press Ltd. This is an Open Access article which permits unrestricted noncommercial use, provided the original work is properly cited.
spellingShingle Methodology
Ji, Rui-Ru
Ott, Karl-Heinz
Yordanova, Roumyana
Bruccoleri, Robert E
FDR-FET: an optimizing gene set enrichment analysis method
title FDR-FET: an optimizing gene set enrichment analysis method
title_full FDR-FET: an optimizing gene set enrichment analysis method
title_fullStr FDR-FET: an optimizing gene set enrichment analysis method
title_full_unstemmed FDR-FET: an optimizing gene set enrichment analysis method
title_short FDR-FET: an optimizing gene set enrichment analysis method
title_sort fdr-fet: an optimizing gene set enrichment analysis method
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3169954/
https://www.ncbi.nlm.nih.gov/pubmed/21918636
http://dx.doi.org/10.2147/AABC.S15840
work_keys_str_mv AT jiruiru fdrfetanoptimizinggenesetenrichmentanalysismethod
AT ottkarlheinz fdrfetanoptimizinggenesetenrichmentanalysismethod
AT yordanovaroumyana fdrfetanoptimizinggenesetenrichmentanalysismethod
AT bruccoleriroberte fdrfetanoptimizinggenesetenrichmentanalysismethod