Cargando…

A direct approach to estimating false discovery rates conditional on covariates

Modern scientific studies from many diverse areas of research abound with multiple hypothesis testing concerns. The false discovery rate (FDR) is one of the most commonly used approaches for measuring and controlling error rates when performing multiple tests. Adaptive FDRs rely on an estimate of th...

Descripción completa

Detalles Bibliográficos
Autores principales: Boca, Simina M., Leek, Jeffrey T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6292380/
https://www.ncbi.nlm.nih.gov/pubmed/30581661
http://dx.doi.org/10.7717/peerj.6035
_version_ 1783380387830956032
author Boca, Simina M.
Leek, Jeffrey T.
author_facet Boca, Simina M.
Leek, Jeffrey T.
author_sort Boca, Simina M.
collection PubMed
description Modern scientific studies from many diverse areas of research abound with multiple hypothesis testing concerns. The false discovery rate (FDR) is one of the most commonly used approaches for measuring and controlling error rates when performing multiple tests. Adaptive FDRs rely on an estimate of the proportion of null hypotheses among all the hypotheses being tested. This proportion is typically estimated once for each collection of hypotheses. Here, we propose a regression framework to estimate the proportion of null hypotheses conditional on observed covariates. This may then be used as a multiplication factor with the Benjamini–Hochberg adjusted p-values, leading to a plug-in FDR estimator. We apply our method to a genome-wise association meta-analysis for body mass index. In our framework, we are able to use the sample sizes for the individual genomic loci and the minor allele frequencies as covariates. We further evaluate our approach via a number of simulation scenarios. We provide an implementation of this novel method for estimating the proportion of null hypotheses in a regression framework as part of the Bioconductor package swfdr.
format Online
Article
Text
id pubmed-6292380
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-62923802018-12-21 A direct approach to estimating false discovery rates conditional on covariates Boca, Simina M. Leek, Jeffrey T. PeerJ Bioinformatics Modern scientific studies from many diverse areas of research abound with multiple hypothesis testing concerns. The false discovery rate (FDR) is one of the most commonly used approaches for measuring and controlling error rates when performing multiple tests. Adaptive FDRs rely on an estimate of the proportion of null hypotheses among all the hypotheses being tested. This proportion is typically estimated once for each collection of hypotheses. Here, we propose a regression framework to estimate the proportion of null hypotheses conditional on observed covariates. This may then be used as a multiplication factor with the Benjamini–Hochberg adjusted p-values, leading to a plug-in FDR estimator. We apply our method to a genome-wise association meta-analysis for body mass index. In our framework, we are able to use the sample sizes for the individual genomic loci and the minor allele frequencies as covariates. We further evaluate our approach via a number of simulation scenarios. We provide an implementation of this novel method for estimating the proportion of null hypotheses in a regression framework as part of the Bioconductor package swfdr. PeerJ Inc. 2018-12-10 /pmc/articles/PMC6292380/ /pubmed/30581661 http://dx.doi.org/10.7717/peerj.6035 Text en © 2018 Boca and Leek http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
Boca, Simina M.
Leek, Jeffrey T.
A direct approach to estimating false discovery rates conditional on covariates
title A direct approach to estimating false discovery rates conditional on covariates
title_full A direct approach to estimating false discovery rates conditional on covariates
title_fullStr A direct approach to estimating false discovery rates conditional on covariates
title_full_unstemmed A direct approach to estimating false discovery rates conditional on covariates
title_short A direct approach to estimating false discovery rates conditional on covariates
title_sort direct approach to estimating false discovery rates conditional on covariates
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6292380/
https://www.ncbi.nlm.nih.gov/pubmed/30581661
http://dx.doi.org/10.7717/peerj.6035
work_keys_str_mv AT bocasiminam adirectapproachtoestimatingfalsediscoveryratesconditionaloncovariates
AT leekjeffreyt adirectapproachtoestimatingfalsediscoveryratesconditionaloncovariates
AT bocasiminam directapproachtoestimatingfalsediscoveryratesconditionaloncovariates
AT leekjeffreyt directapproachtoestimatingfalsediscoveryratesconditionaloncovariates