Cargando…

Fewer permutations, more accurate P-values

Motivation: Permutation tests have become a standard tool to assess the statistical significance of an event under investigation. The statistical significance, as expressed in a P-value, is calculated as the fraction of permutation values that are at least as extreme as the original statistic, which...

Descripción completa

Detalles Bibliográficos
Autores principales:	Knijnenburg, Theo A., Wessels, Lodewyk F. A., Reinders, Marcel J. T., Shmulevich, Ilya
Formato:	Texto
Lenguaje:	English
Publicado:	Oxford University Press 2009
Materias:	Ismb/Eccb 2009 Conference Proceedings June 27 to July 2, 2009, Stockholm, Sweden
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2687965/ https://www.ncbi.nlm.nih.gov/pubmed/19477983 http://dx.doi.org/10.1093/bioinformatics/btp211

Descripción
Sumario:	Motivation: Permutation tests have become a standard tool to assess the statistical significance of an event under investigation. The statistical significance, as expressed in a P-value, is calculated as the fraction of permutation values that are at least as extreme as the original statistic, which was derived from non-permuted data. This empirical method directly couples both the minimal obtainable P-value and the resolution of the P-value to the number of permutations. Thereby, it imposes upon itself the need for a very large number of permutations when small P-values are to be accurately estimated. This is computationally expensive and often infeasible. Results: A method of computing P-values based on tail approximation is presented. The tail of the distribution of permutation values is approximated by a generalized Pareto distribution. A good fit and thus accurate P-value estimates can be obtained with a drastically reduced number of permutations when compared with the standard empirical way of computing P-values. Availability: The Matlab code can be obtained from the corresponding author on request. Contact: tknijnenburg@systemsbiology.org Supplementary information:Supplementary data are available at Bioinformatics online.

Fewer permutations, more accurate P-values

Ejemplares similares