Cargando…
On correcting the overestimation of the permutation-based false discovery rate estimator
Motivation: Recent attempts to account for multiple testing in the analysis of microarray data have focused on controlling the false discovery rate (FDR), which is defined as the expected percentage of the number of false positive genes among the claimed significant genes. As a consequence, the accu...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2008
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2638866/ https://www.ncbi.nlm.nih.gov/pubmed/18573796 http://dx.doi.org/10.1093/bioinformatics/btn310 |
_version_ | 1782164427047436288 |
---|---|
author | Jiao, Shuo Zhang, Shunpu |
author_facet | Jiao, Shuo Zhang, Shunpu |
author_sort | Jiao, Shuo |
collection | PubMed |
description | Motivation: Recent attempts to account for multiple testing in the analysis of microarray data have focused on controlling the false discovery rate (FDR), which is defined as the expected percentage of the number of false positive genes among the claimed significant genes. As a consequence, the accuracy of the FDR estimators will be important for correctly controlling FDR. Xie et al. found that the standard permutation method of estimating FDR is biased and proposed to delete the predicted differentially expressed (DE) genes in the estimation of FDR for one-sample comparison. However, we notice that the formula of the FDR used in their paper is incorrect. This makes the comparison results reported in their paper unconvincing. Other problems with their method include the biased estimation of FDR caused by over- or under-deletion of DE genes in the estimation of FDR and by the implicit use of an unreasonable estimator of the true proportion of equivalently expressed (EE) genes. Due to the great importance of accurate FDR estimation in microarray data analysis, it is necessary to point out such problems and propose improved methods. Results: Our results confirm that the standard permutation method overestimates the FDR. With the correct FDR formula, we show the method of Xie et al. always gives biased estimation of FDR: it overestimates when the number of claimed significant genes is small, and underestimates when the number of claimed significant genes is large. To overcome these problems, we propose two modifications. The simulation results show that our estimator gives more accurate estimation. Contact: szhang3@unl.edu |
format | Text |
id | pubmed-2638866 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2008 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-26388662009-02-25 On correcting the overestimation of the permutation-based false discovery rate estimator Jiao, Shuo Zhang, Shunpu Bioinformatics Original Papers Motivation: Recent attempts to account for multiple testing in the analysis of microarray data have focused on controlling the false discovery rate (FDR), which is defined as the expected percentage of the number of false positive genes among the claimed significant genes. As a consequence, the accuracy of the FDR estimators will be important for correctly controlling FDR. Xie et al. found that the standard permutation method of estimating FDR is biased and proposed to delete the predicted differentially expressed (DE) genes in the estimation of FDR for one-sample comparison. However, we notice that the formula of the FDR used in their paper is incorrect. This makes the comparison results reported in their paper unconvincing. Other problems with their method include the biased estimation of FDR caused by over- or under-deletion of DE genes in the estimation of FDR and by the implicit use of an unreasonable estimator of the true proportion of equivalently expressed (EE) genes. Due to the great importance of accurate FDR estimation in microarray data analysis, it is necessary to point out such problems and propose improved methods. Results: Our results confirm that the standard permutation method overestimates the FDR. With the correct FDR formula, we show the method of Xie et al. always gives biased estimation of FDR: it overestimates when the number of claimed significant genes is small, and underestimates when the number of claimed significant genes is large. To overcome these problems, we propose two modifications. The simulation results show that our estimator gives more accurate estimation. Contact: szhang3@unl.edu Oxford University Press 2008-08-01 2008-06-23 /pmc/articles/PMC2638866/ /pubmed/18573796 http://dx.doi.org/10.1093/bioinformatics/btn310 Text en © 2008 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Papers Jiao, Shuo Zhang, Shunpu On correcting the overestimation of the permutation-based false discovery rate estimator |
title | On correcting the overestimation of the permutation-based false discovery rate estimator |
title_full | On correcting the overestimation of the permutation-based false discovery rate estimator |
title_fullStr | On correcting the overestimation of the permutation-based false discovery rate estimator |
title_full_unstemmed | On correcting the overestimation of the permutation-based false discovery rate estimator |
title_short | On correcting the overestimation of the permutation-based false discovery rate estimator |
title_sort | on correcting the overestimation of the permutation-based false discovery rate estimator |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2638866/ https://www.ncbi.nlm.nih.gov/pubmed/18573796 http://dx.doi.org/10.1093/bioinformatics/btn310 |
work_keys_str_mv | AT jiaoshuo oncorrectingtheoverestimationofthepermutationbasedfalsediscoveryrateestimator AT zhangshunpu oncorrectingtheoverestimationofthepermutationbasedfalsediscoveryrateestimator |