Cargando…

The functional false discovery rate with applications to genomics

The false discovery rate (FDR) measures the proportion of false discoveries among a set of hypothesis tests called significant. This quantity is typically estimated based on p-values or test statistics. In some scenarios, there is additional information available that may be used to more accurately...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Xiongzhi, Robinson, David G, Storey, John D
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7846131/
https://www.ncbi.nlm.nih.gov/pubmed/31135886
http://dx.doi.org/10.1093/biostatistics/kxz010
_version_ 1783644680908439552
author Chen, Xiongzhi
Robinson, David G
Storey, John D
author_facet Chen, Xiongzhi
Robinson, David G
Storey, John D
author_sort Chen, Xiongzhi
collection PubMed
description The false discovery rate (FDR) measures the proportion of false discoveries among a set of hypothesis tests called significant. This quantity is typically estimated based on p-values or test statistics. In some scenarios, there is additional information available that may be used to more accurately estimate the FDR. We develop a new framework for formulating and estimating FDRs and q-values when an additional piece of information, which we call an “informative variable”, is available. For a given test, the informative variable provides information about the prior probability a null hypothesis is true or the power of that particular test. The FDR is then treated as a function of this informative variable. We consider two applications in genomics. Our first application is a genetics of gene expression (eQTL) experiment in yeast where every genetic marker and gene expression trait pair are tested for associations. The informative variable in this case is the distance between each genetic marker and gene. Our second application is to detect differentially expressed genes in an RNA-seq study carried out in mice. The informative variable in this study is the per-gene read depth. The framework we develop is quite general, and it should be useful in a broad range of scientific applications.
format Online
Article
Text
id pubmed-7846131
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-78461312021-02-03 The functional false discovery rate with applications to genomics Chen, Xiongzhi Robinson, David G Storey, John D Biostatistics Articles The false discovery rate (FDR) measures the proportion of false discoveries among a set of hypothesis tests called significant. This quantity is typically estimated based on p-values or test statistics. In some scenarios, there is additional information available that may be used to more accurately estimate the FDR. We develop a new framework for formulating and estimating FDRs and q-values when an additional piece of information, which we call an “informative variable”, is available. For a given test, the informative variable provides information about the prior probability a null hypothesis is true or the power of that particular test. The FDR is then treated as a function of this informative variable. We consider two applications in genomics. Our first application is a genetics of gene expression (eQTL) experiment in yeast where every genetic marker and gene expression trait pair are tested for associations. The informative variable in this case is the distance between each genetic marker and gene. Our second application is to detect differentially expressed genes in an RNA-seq study carried out in mice. The informative variable in this study is the per-gene read depth. The framework we develop is quite general, and it should be useful in a broad range of scientific applications. Oxford University Press 2019-05-28 /pmc/articles/PMC7846131/ /pubmed/31135886 http://dx.doi.org/10.1093/biostatistics/kxz010 Text en © The Author 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Articles
Chen, Xiongzhi
Robinson, David G
Storey, John D
The functional false discovery rate with applications to genomics
title The functional false discovery rate with applications to genomics
title_full The functional false discovery rate with applications to genomics
title_fullStr The functional false discovery rate with applications to genomics
title_full_unstemmed The functional false discovery rate with applications to genomics
title_short The functional false discovery rate with applications to genomics
title_sort functional false discovery rate with applications to genomics
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7846131/
https://www.ncbi.nlm.nih.gov/pubmed/31135886
http://dx.doi.org/10.1093/biostatistics/kxz010
work_keys_str_mv AT chenxiongzhi thefunctionalfalsediscoveryratewithapplicationstogenomics
AT robinsondavidg thefunctionalfalsediscoveryratewithapplicationstogenomics
AT storeyjohnd thefunctionalfalsediscoveryratewithapplicationstogenomics
AT chenxiongzhi functionalfalsediscoveryratewithapplicationstogenomics
AT robinsondavidg functionalfalsediscoveryratewithapplicationstogenomics
AT storeyjohnd functionalfalsediscoveryratewithapplicationstogenomics