Cargando…
4497 Accessible False Discovery Rate Computation
OBJECTIVES/GOALS: To improve the implementation of FDRs in translation research. Current statistical packages are hard to use and fail to adequately convey strong assumptions. We developed a software package that allows the user to decide on assumptions and choose the hey desire. We encourage wider...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cambridge University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8822935/ http://dx.doi.org/10.1017/cts.2020.164 |
_version_ | 1784646706661425152 |
---|---|
author | Hollister, Megan C Blume, Jeffrey D. |
author_facet | Hollister, Megan C Blume, Jeffrey D. |
author_sort | Hollister, Megan C |
collection | PubMed |
description | OBJECTIVES/GOALS: To improve the implementation of FDRs in translation research. Current statistical packages are hard to use and fail to adequately convey strong assumptions. We developed a software package that allows the user to decide on assumptions and choose the hey desire. We encourage wider reporting of FDRs for observed findings. METHODS/STUDY POPULATION: We developed a user-friendly R function for computing FDRs from observed p-values. A variety of methods for FDR estimation and for FDR control are included so the user can select the approach most appropriate for their setting. Options include Efron’s Empirical Bayes FDR, Benjamini-Hochberg FDR control for multiple testing, Lindsey’s method for smoothing empirical distributions, estimation of the mixing proportion, and central matching. We illustrate the important difference between estimating the FDR for a particular finding and adjusting a hypothesis test to control the false discovery propensity. RESULTS/ANTICIPATED RESULTS: We performed a comparison of the capabilities of our new p.fdr function to the popular p.adjust function from the base stats-package. Specifically, we examined multiple examples of data coming from different unknown mixture distributions to highlight the null estimation methods p.fdr includes. The base package does not provide the optimal FDR usage nor sufficient estimation options. We also compared the step-up/step-down procedure used in adjusted p-value hypothesis test and discuss when this is inappropriate. The p.adjust function is not able to report raw-adjusted values and this will be shown in the graphical results. DISCUSSION/SIGNIFICANCE OF IMPACT: FDRs reveal the propensity for an observed result to be incorrect. FDRs should accompany observed results to help contextualize the relevance and potential impact of research findings. Our results show that previous methods are not sufficient rich or precise in their calculations. Our new package allows the user to be in control of the null estimation and step-up implementation when reporting FDRs. |
format | Online Article Text |
id | pubmed-8822935 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Cambridge University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-88229352022-02-18 4497 Accessible False Discovery Rate Computation Hollister, Megan C Blume, Jeffrey D. J Clin Transl Sci Data Science/Biostatistics/Informatics OBJECTIVES/GOALS: To improve the implementation of FDRs in translation research. Current statistical packages are hard to use and fail to adequately convey strong assumptions. We developed a software package that allows the user to decide on assumptions and choose the hey desire. We encourage wider reporting of FDRs for observed findings. METHODS/STUDY POPULATION: We developed a user-friendly R function for computing FDRs from observed p-values. A variety of methods for FDR estimation and for FDR control are included so the user can select the approach most appropriate for their setting. Options include Efron’s Empirical Bayes FDR, Benjamini-Hochberg FDR control for multiple testing, Lindsey’s method for smoothing empirical distributions, estimation of the mixing proportion, and central matching. We illustrate the important difference between estimating the FDR for a particular finding and adjusting a hypothesis test to control the false discovery propensity. RESULTS/ANTICIPATED RESULTS: We performed a comparison of the capabilities of our new p.fdr function to the popular p.adjust function from the base stats-package. Specifically, we examined multiple examples of data coming from different unknown mixture distributions to highlight the null estimation methods p.fdr includes. The base package does not provide the optimal FDR usage nor sufficient estimation options. We also compared the step-up/step-down procedure used in adjusted p-value hypothesis test and discuss when this is inappropriate. The p.adjust function is not able to report raw-adjusted values and this will be shown in the graphical results. DISCUSSION/SIGNIFICANCE OF IMPACT: FDRs reveal the propensity for an observed result to be incorrect. FDRs should accompany observed results to help contextualize the relevance and potential impact of research findings. Our results show that previous methods are not sufficient rich or precise in their calculations. Our new package allows the user to be in control of the null estimation and step-up implementation when reporting FDRs. Cambridge University Press 2020-07-29 /pmc/articles/PMC8822935/ http://dx.doi.org/10.1017/cts.2020.164 Text en © The Association for Clinical and Translational Science 2020 https://creativecommons.org/licenses/by/4.0/This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Data Science/Biostatistics/Informatics Hollister, Megan C Blume, Jeffrey D. 4497 Accessible False Discovery Rate Computation |
title | 4497 Accessible False Discovery Rate Computation |
title_full | 4497 Accessible False Discovery Rate Computation |
title_fullStr | 4497 Accessible False Discovery Rate Computation |
title_full_unstemmed | 4497 Accessible False Discovery Rate Computation |
title_short | 4497 Accessible False Discovery Rate Computation |
title_sort | 4497 accessible false discovery rate computation |
topic | Data Science/Biostatistics/Informatics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8822935/ http://dx.doi.org/10.1017/cts.2020.164 |
work_keys_str_mv | AT hollistermeganc 4497accessiblefalsediscoveryratecomputation AT blumejeffreyd 4497accessiblefalsediscoveryratecomputation |