Cargando…
A direct approach to estimating false discovery rates conditional on covariates
Modern scientific studies from many diverse areas of research abound with multiple hypothesis testing concerns. The false discovery rate (FDR) is one of the most commonly used approaches for measuring and controlling error rates when performing multiple tests. Adaptive FDRs rely on an estimate of th...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6292380/ https://www.ncbi.nlm.nih.gov/pubmed/30581661 http://dx.doi.org/10.7717/peerj.6035 |
_version_ | 1783380387830956032 |
---|---|
author | Boca, Simina M. Leek, Jeffrey T. |
author_facet | Boca, Simina M. Leek, Jeffrey T. |
author_sort | Boca, Simina M. |
collection | PubMed |
description | Modern scientific studies from many diverse areas of research abound with multiple hypothesis testing concerns. The false discovery rate (FDR) is one of the most commonly used approaches for measuring and controlling error rates when performing multiple tests. Adaptive FDRs rely on an estimate of the proportion of null hypotheses among all the hypotheses being tested. This proportion is typically estimated once for each collection of hypotheses. Here, we propose a regression framework to estimate the proportion of null hypotheses conditional on observed covariates. This may then be used as a multiplication factor with the Benjamini–Hochberg adjusted p-values, leading to a plug-in FDR estimator. We apply our method to a genome-wise association meta-analysis for body mass index. In our framework, we are able to use the sample sizes for the individual genomic loci and the minor allele frequencies as covariates. We further evaluate our approach via a number of simulation scenarios. We provide an implementation of this novel method for estimating the proportion of null hypotheses in a regression framework as part of the Bioconductor package swfdr. |
format | Online Article Text |
id | pubmed-6292380 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-62923802018-12-21 A direct approach to estimating false discovery rates conditional on covariates Boca, Simina M. Leek, Jeffrey T. PeerJ Bioinformatics Modern scientific studies from many diverse areas of research abound with multiple hypothesis testing concerns. The false discovery rate (FDR) is one of the most commonly used approaches for measuring and controlling error rates when performing multiple tests. Adaptive FDRs rely on an estimate of the proportion of null hypotheses among all the hypotheses being tested. This proportion is typically estimated once for each collection of hypotheses. Here, we propose a regression framework to estimate the proportion of null hypotheses conditional on observed covariates. This may then be used as a multiplication factor with the Benjamini–Hochberg adjusted p-values, leading to a plug-in FDR estimator. We apply our method to a genome-wise association meta-analysis for body mass index. In our framework, we are able to use the sample sizes for the individual genomic loci and the minor allele frequencies as covariates. We further evaluate our approach via a number of simulation scenarios. We provide an implementation of this novel method for estimating the proportion of null hypotheses in a regression framework as part of the Bioconductor package swfdr. PeerJ Inc. 2018-12-10 /pmc/articles/PMC6292380/ /pubmed/30581661 http://dx.doi.org/10.7717/peerj.6035 Text en © 2018 Boca and Leek http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited. |
spellingShingle | Bioinformatics Boca, Simina M. Leek, Jeffrey T. A direct approach to estimating false discovery rates conditional on covariates |
title | A direct approach to estimating false discovery rates conditional on covariates |
title_full | A direct approach to estimating false discovery rates conditional on covariates |
title_fullStr | A direct approach to estimating false discovery rates conditional on covariates |
title_full_unstemmed | A direct approach to estimating false discovery rates conditional on covariates |
title_short | A direct approach to estimating false discovery rates conditional on covariates |
title_sort | direct approach to estimating false discovery rates conditional on covariates |
topic | Bioinformatics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6292380/ https://www.ncbi.nlm.nih.gov/pubmed/30581661 http://dx.doi.org/10.7717/peerj.6035 |
work_keys_str_mv | AT bocasiminam adirectapproachtoestimatingfalsediscoveryratesconditionaloncovariates AT leekjeffreyt adirectapproachtoestimatingfalsediscoveryratesconditionaloncovariates AT bocasiminam directapproachtoestimatingfalsediscoveryratesconditionaloncovariates AT leekjeffreyt directapproachtoestimatingfalsediscoveryratesconditionaloncovariates |