Cargando…
Fewer permutations, more accurate P-values
Motivation: Permutation tests have become a standard tool to assess the statistical significance of an event under investigation. The statistical significance, as expressed in a P-value, is calculated as the fraction of permutation values that are at least as extreme as the original statistic, which...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2687965/ https://www.ncbi.nlm.nih.gov/pubmed/19477983 http://dx.doi.org/10.1093/bioinformatics/btp211 |
_version_ | 1782167631302754304 |
---|---|
author | Knijnenburg, Theo A. Wessels, Lodewyk F. A. Reinders, Marcel J. T. Shmulevich, Ilya |
author_facet | Knijnenburg, Theo A. Wessels, Lodewyk F. A. Reinders, Marcel J. T. Shmulevich, Ilya |
author_sort | Knijnenburg, Theo A. |
collection | PubMed |
description | Motivation: Permutation tests have become a standard tool to assess the statistical significance of an event under investigation. The statistical significance, as expressed in a P-value, is calculated as the fraction of permutation values that are at least as extreme as the original statistic, which was derived from non-permuted data. This empirical method directly couples both the minimal obtainable P-value and the resolution of the P-value to the number of permutations. Thereby, it imposes upon itself the need for a very large number of permutations when small P-values are to be accurately estimated. This is computationally expensive and often infeasible. Results: A method of computing P-values based on tail approximation is presented. The tail of the distribution of permutation values is approximated by a generalized Pareto distribution. A good fit and thus accurate P-value estimates can be obtained with a drastically reduced number of permutations when compared with the standard empirical way of computing P-values. Availability: The Matlab code can be obtained from the corresponding author on request. Contact: tknijnenburg@systemsbiology.org Supplementary information:Supplementary data are available at Bioinformatics online. |
format | Text |
id | pubmed-2687965 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-26879652009-06-02 Fewer permutations, more accurate P-values Knijnenburg, Theo A. Wessels, Lodewyk F. A. Reinders, Marcel J. T. Shmulevich, Ilya Bioinformatics Ismb/Eccb 2009 Conference Proceedings June 27 to July 2, 2009, Stockholm, Sweden Motivation: Permutation tests have become a standard tool to assess the statistical significance of an event under investigation. The statistical significance, as expressed in a P-value, is calculated as the fraction of permutation values that are at least as extreme as the original statistic, which was derived from non-permuted data. This empirical method directly couples both the minimal obtainable P-value and the resolution of the P-value to the number of permutations. Thereby, it imposes upon itself the need for a very large number of permutations when small P-values are to be accurately estimated. This is computationally expensive and often infeasible. Results: A method of computing P-values based on tail approximation is presented. The tail of the distribution of permutation values is approximated by a generalized Pareto distribution. A good fit and thus accurate P-value estimates can be obtained with a drastically reduced number of permutations when compared with the standard empirical way of computing P-values. Availability: The Matlab code can be obtained from the corresponding author on request. Contact: tknijnenburg@systemsbiology.org Supplementary information:Supplementary data are available at Bioinformatics online. Oxford University Press 2009-06-15 2009-05-27 /pmc/articles/PMC2687965/ /pubmed/19477983 http://dx.doi.org/10.1093/bioinformatics/btp211 Text en © 2009 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Ismb/Eccb 2009 Conference Proceedings June 27 to July 2, 2009, Stockholm, Sweden Knijnenburg, Theo A. Wessels, Lodewyk F. A. Reinders, Marcel J. T. Shmulevich, Ilya Fewer permutations, more accurate P-values |
title | Fewer permutations, more accurate P-values |
title_full | Fewer permutations, more accurate P-values |
title_fullStr | Fewer permutations, more accurate P-values |
title_full_unstemmed | Fewer permutations, more accurate P-values |
title_short | Fewer permutations, more accurate P-values |
title_sort | fewer permutations, more accurate p-values |
topic | Ismb/Eccb 2009 Conference Proceedings June 27 to July 2, 2009, Stockholm, Sweden |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2687965/ https://www.ncbi.nlm.nih.gov/pubmed/19477983 http://dx.doi.org/10.1093/bioinformatics/btp211 |
work_keys_str_mv | AT knijnenburgtheoa fewerpermutationsmoreaccuratepvalues AT wesselslodewykfa fewerpermutationsmoreaccuratepvalues AT reindersmarceljt fewerpermutationsmoreaccuratepvalues AT shmulevichilya fewerpermutationsmoreaccuratepvalues |