Cargando…

A unified approach to false discovery rate estimation

BACKGROUND: False discovery rate (FDR) methods play an important role in analyzing high-dimensional data. There are two types of FDR, tail area-based FDR and local FDR, as well as numerous statistical algorithms for estimating or controlling FDR. These differ in terms of underlying test statistics a...

Descripción completa

Detalles Bibliográficos
Autor principal: Strimmer, Korbinian
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2475539/
https://www.ncbi.nlm.nih.gov/pubmed/18613966
http://dx.doi.org/10.1186/1471-2105-9-303
_version_ 1782157557469544448
author Strimmer, Korbinian
author_facet Strimmer, Korbinian
author_sort Strimmer, Korbinian
collection PubMed
description BACKGROUND: False discovery rate (FDR) methods play an important role in analyzing high-dimensional data. There are two types of FDR, tail area-based FDR and local FDR, as well as numerous statistical algorithms for estimating or controlling FDR. These differ in terms of underlying test statistics and procedures employed for statistical learning. RESULTS: A unifying algorithm for simultaneous estimation of both local FDR and tail area-based FDR is presented that can be applied to a diverse range of test statistics, including p-values, correlations, z- and t-scores. This approach is semipararametric and is based on a modified Grenander density estimator. For test statistics other than p-values it allows for empirical null modeling, so that dependencies among tests can be taken into account. The inference of the underlying model employs truncated maximum-likelihood estimation, with the cut-off point chosen according to the false non-discovery rate. CONCLUSION: The proposed procedure generalizes a number of more specialized algorithms and thus offers a common framework for FDR estimation consistent across test statistics and types of FDR. In comparative study the unified approach performs on par with the best competing yet more specialized alternatives. The algorithm is implemented in R in the "fdrtool" package, available under the GNU GPL from and from the R package archive CRAN.
format Text
id pubmed-2475539
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-24755392008-07-21 A unified approach to false discovery rate estimation Strimmer, Korbinian BMC Bioinformatics Methodology Article BACKGROUND: False discovery rate (FDR) methods play an important role in analyzing high-dimensional data. There are two types of FDR, tail area-based FDR and local FDR, as well as numerous statistical algorithms for estimating or controlling FDR. These differ in terms of underlying test statistics and procedures employed for statistical learning. RESULTS: A unifying algorithm for simultaneous estimation of both local FDR and tail area-based FDR is presented that can be applied to a diverse range of test statistics, including p-values, correlations, z- and t-scores. This approach is semipararametric and is based on a modified Grenander density estimator. For test statistics other than p-values it allows for empirical null modeling, so that dependencies among tests can be taken into account. The inference of the underlying model employs truncated maximum-likelihood estimation, with the cut-off point chosen according to the false non-discovery rate. CONCLUSION: The proposed procedure generalizes a number of more specialized algorithms and thus offers a common framework for FDR estimation consistent across test statistics and types of FDR. In comparative study the unified approach performs on par with the best competing yet more specialized alternatives. The algorithm is implemented in R in the "fdrtool" package, available under the GNU GPL from and from the R package archive CRAN. BioMed Central 2008-07-09 /pmc/articles/PMC2475539/ /pubmed/18613966 http://dx.doi.org/10.1186/1471-2105-9-303 Text en Copyright © 2008 Strimmer; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Strimmer, Korbinian
A unified approach to false discovery rate estimation
title A unified approach to false discovery rate estimation
title_full A unified approach to false discovery rate estimation
title_fullStr A unified approach to false discovery rate estimation
title_full_unstemmed A unified approach to false discovery rate estimation
title_short A unified approach to false discovery rate estimation
title_sort unified approach to false discovery rate estimation
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2475539/
https://www.ncbi.nlm.nih.gov/pubmed/18613966
http://dx.doi.org/10.1186/1471-2105-9-303
work_keys_str_mv AT strimmerkorbinian aunifiedapproachtofalsediscoveryrateestimation
AT strimmerkorbinian unifiedapproachtofalsediscoveryrateestimation