Cargando…

Generalized mean p-values for combining dependent tests: comparison of generalized central limit theorem and robust risk analysis

The test statistics underpinning several methods for combining p-values are special cases of generalized mean p-value (GMP), including the minimum (Bonferroni procedure), harmonic mean and geometric mean. A key assumption influencing the practical performance of such methods concerns the dependence...

Descripción completa

Detalles Bibliográficos
Autor principal:	Wilson, Daniel J.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	F1000 Research Limited 2020
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7836035/ https://www.ncbi.nlm.nih.gov/pubmed/33537458 http://dx.doi.org/10.12688/wellcomeopenres.15761.1

_version_	1783642664710701056
author	Wilson, Daniel J.
author_facet	Wilson, Daniel J.
author_sort	Wilson, Daniel J.
collection	PubMed
description	The test statistics underpinning several methods for combining p-values are special cases of generalized mean p-value (GMP), including the minimum (Bonferroni procedure), harmonic mean and geometric mean. A key assumption influencing the practical performance of such methods concerns the dependence between p-values. Approaches that do not require specific knowledge of the dependence structure are practically convenient. Vovk and Wang derived significance thresholds for GMPs under the worst-case scenario of arbitrary dependence using results from Robust Risk Analysis (RRA). Here I calculate significance thresholds and closed testing procedures using Generalized Central Limit Theorem (GCLT). GCLT formally assumes independence, but enjoys a degree of robustness to dependence. The GCLT thresholds are less stringent than RRA thresholds, with the disparity increasing as the exponent of the GMP ( r) increases. I motivate a model of p-value dependence based on a Wishart-Multivariate-Gamma distribution for the underlying log-likelihood ratios. In simulations under this model, the RRA thresholds produced tests that were usually less powerful than Bonferroni, while the GCLT thresholds produced tests more powerful than Bonferroni, for all r> − ∞. Above r> − 1, the GCLT thresholds suffered pronounced false positive rates. Above r> − 1/2, standard central limit theorem applied and the GCLT thresholds no longer possessed any useful robustness to dependence. I consider the implications of these results in the context of various interpretations of GMPs, and conclude that the GCLT-based harmonic mean p-value procedure and Simes' (1986) test represent good compromises in power-robustness trade-off for combining dependent tests.
format	Online Article Text
id	pubmed-7836035
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	F1000 Research Limited
record_format	MEDLINE/PubMed
spelling	pubmed-78360352021-02-02 Generalized mean p-values for combining dependent tests: comparison of generalized central limit theorem and robust risk analysis Wilson, Daniel J. Wellcome Open Res Research Article The test statistics underpinning several methods for combining p-values are special cases of generalized mean p-value (GMP), including the minimum (Bonferroni procedure), harmonic mean and geometric mean. A key assumption influencing the practical performance of such methods concerns the dependence between p-values. Approaches that do not require specific knowledge of the dependence structure are practically convenient. Vovk and Wang derived significance thresholds for GMPs under the worst-case scenario of arbitrary dependence using results from Robust Risk Analysis (RRA). Here I calculate significance thresholds and closed testing procedures using Generalized Central Limit Theorem (GCLT). GCLT formally assumes independence, but enjoys a degree of robustness to dependence. The GCLT thresholds are less stringent than RRA thresholds, with the disparity increasing as the exponent of the GMP ( r) increases. I motivate a model of p-value dependence based on a Wishart-Multivariate-Gamma distribution for the underlying log-likelihood ratios. In simulations under this model, the RRA thresholds produced tests that were usually less powerful than Bonferroni, while the GCLT thresholds produced tests more powerful than Bonferroni, for all r> − ∞. Above r> − 1, the GCLT thresholds suffered pronounced false positive rates. Above r> − 1/2, standard central limit theorem applied and the GCLT thresholds no longer possessed any useful robustness to dependence. I consider the implications of these results in the context of various interpretations of GMPs, and conclude that the GCLT-based harmonic mean p-value procedure and Simes' (1986) test represent good compromises in power-robustness trade-off for combining dependent tests. F1000 Research Limited 2020-03-31 /pmc/articles/PMC7836035/ /pubmed/33537458 http://dx.doi.org/10.12688/wellcomeopenres.15761.1 Text en Copyright: © 2020 Wilson DJ http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Article Wilson, Daniel J. Generalized mean p-values for combining dependent tests: comparison of generalized central limit theorem and robust risk analysis
title	Generalized mean p-values for combining dependent tests: comparison of generalized central limit theorem and robust risk analysis
title_full	Generalized mean p-values for combining dependent tests: comparison of generalized central limit theorem and robust risk analysis
title_fullStr	Generalized mean p-values for combining dependent tests: comparison of generalized central limit theorem and robust risk analysis
title_full_unstemmed	Generalized mean p-values for combining dependent tests: comparison of generalized central limit theorem and robust risk analysis
title_short	Generalized mean p-values for combining dependent tests: comparison of generalized central limit theorem and robust risk analysis
title_sort	generalized mean p-values for combining dependent tests: comparison of generalized central limit theorem and robust risk analysis
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7836035/ https://www.ncbi.nlm.nih.gov/pubmed/33537458 http://dx.doi.org/10.12688/wellcomeopenres.15761.1
work_keys_str_mv	AT wilsondanielj generalizedmeanpvaluesforcombiningdependenttestscomparisonofgeneralizedcentrallimittheoremandrobustriskanalysis

Generalized mean p-values for combining dependent tests: comparison of generalized central limit theorem and robust risk analysis

Ejemplares similares