Cargando…

On correcting the overestimation of the permutation-based false discovery rate estimator

Motivation: Recent attempts to account for multiple testing in the analysis of microarray data have focused on controlling the false discovery rate (FDR), which is defined as the expected percentage of the number of false positive genes among the claimed significant genes. As a consequence, the accu...

Descripción completa

Detalles Bibliográficos
Autores principales: Jiao, Shuo, Zhang, Shunpu
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2638866/
https://www.ncbi.nlm.nih.gov/pubmed/18573796
http://dx.doi.org/10.1093/bioinformatics/btn310
_version_ 1782164427047436288
author Jiao, Shuo
Zhang, Shunpu
author_facet Jiao, Shuo
Zhang, Shunpu
author_sort Jiao, Shuo
collection PubMed
description Motivation: Recent attempts to account for multiple testing in the analysis of microarray data have focused on controlling the false discovery rate (FDR), which is defined as the expected percentage of the number of false positive genes among the claimed significant genes. As a consequence, the accuracy of the FDR estimators will be important for correctly controlling FDR. Xie et al. found that the standard permutation method of estimating FDR is biased and proposed to delete the predicted differentially expressed (DE) genes in the estimation of FDR for one-sample comparison. However, we notice that the formula of the FDR used in their paper is incorrect. This makes the comparison results reported in their paper unconvincing. Other problems with their method include the biased estimation of FDR caused by over- or under-deletion of DE genes in the estimation of FDR and by the implicit use of an unreasonable estimator of the true proportion of equivalently expressed (EE) genes. Due to the great importance of accurate FDR estimation in microarray data analysis, it is necessary to point out such problems and propose improved methods. Results: Our results confirm that the standard permutation method overestimates the FDR. With the correct FDR formula, we show the method of Xie et al. always gives biased estimation of FDR: it overestimates when the number of claimed significant genes is small, and underestimates when the number of claimed significant genes is large. To overcome these problems, we propose two modifications. The simulation results show that our estimator gives more accurate estimation. Contact: szhang3@unl.edu
format Text
id pubmed-2638866
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-26388662009-02-25 On correcting the overestimation of the permutation-based false discovery rate estimator Jiao, Shuo Zhang, Shunpu Bioinformatics Original Papers Motivation: Recent attempts to account for multiple testing in the analysis of microarray data have focused on controlling the false discovery rate (FDR), which is defined as the expected percentage of the number of false positive genes among the claimed significant genes. As a consequence, the accuracy of the FDR estimators will be important for correctly controlling FDR. Xie et al. found that the standard permutation method of estimating FDR is biased and proposed to delete the predicted differentially expressed (DE) genes in the estimation of FDR for one-sample comparison. However, we notice that the formula of the FDR used in their paper is incorrect. This makes the comparison results reported in their paper unconvincing. Other problems with their method include the biased estimation of FDR caused by over- or under-deletion of DE genes in the estimation of FDR and by the implicit use of an unreasonable estimator of the true proportion of equivalently expressed (EE) genes. Due to the great importance of accurate FDR estimation in microarray data analysis, it is necessary to point out such problems and propose improved methods. Results: Our results confirm that the standard permutation method overestimates the FDR. With the correct FDR formula, we show the method of Xie et al. always gives biased estimation of FDR: it overestimates when the number of claimed significant genes is small, and underestimates when the number of claimed significant genes is large. To overcome these problems, we propose two modifications. The simulation results show that our estimator gives more accurate estimation. Contact: szhang3@unl.edu Oxford University Press 2008-08-01 2008-06-23 /pmc/articles/PMC2638866/ /pubmed/18573796 http://dx.doi.org/10.1093/bioinformatics/btn310 Text en © 2008 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Jiao, Shuo
Zhang, Shunpu
On correcting the overestimation of the permutation-based false discovery rate estimator
title On correcting the overestimation of the permutation-based false discovery rate estimator
title_full On correcting the overestimation of the permutation-based false discovery rate estimator
title_fullStr On correcting the overestimation of the permutation-based false discovery rate estimator
title_full_unstemmed On correcting the overestimation of the permutation-based false discovery rate estimator
title_short On correcting the overestimation of the permutation-based false discovery rate estimator
title_sort on correcting the overestimation of the permutation-based false discovery rate estimator
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2638866/
https://www.ncbi.nlm.nih.gov/pubmed/18573796
http://dx.doi.org/10.1093/bioinformatics/btn310
work_keys_str_mv AT jiaoshuo oncorrectingtheoverestimationofthepermutationbasedfalsediscoveryrateestimator
AT zhangshunpu oncorrectingtheoverestimationofthepermutationbasedfalsediscoveryrateestimator