Cargando…

On correcting the overestimation of the permutation-based false discovery rate estimator

Motivation: Recent attempts to account for multiple testing in the analysis of microarray data have focused on controlling the false discovery rate (FDR), which is defined as the expected percentage of the number of false positive genes among the claimed significant genes. As a consequence, the accu...

Descripción completa

Detalles Bibliográficos
Autores principales:	Jiao, Shuo, Zhang, Shunpu
Formato:	Texto
Lenguaje:	English
Publicado:	Oxford University Press 2008
Materias:	Original Papers
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2638866/ https://www.ncbi.nlm.nih.gov/pubmed/18573796 http://dx.doi.org/10.1093/bioinformatics/btn310

_version_	1782164427047436288
author	Jiao, Shuo Zhang, Shunpu
author_facet	Jiao, Shuo Zhang, Shunpu
author_sort	Jiao, Shuo
collection	PubMed
description	Motivation: Recent attempts to account for multiple testing in the analysis of microarray data have focused on controlling the false discovery rate (FDR), which is defined as the expected percentage of the number of false positive genes among the claimed significant genes. As a consequence, the accuracy of the FDR estimators will be important for correctly controlling FDR. Xie et al. found that the standard permutation method of estimating FDR is biased and proposed to delete the predicted differentially expressed (DE) genes in the estimation of FDR for one-sample comparison. However, we notice that the formula of the FDR used in their paper is incorrect. This makes the comparison results reported in their paper unconvincing. Other problems with their method include the biased estimation of FDR caused by over- or under-deletion of DE genes in the estimation of FDR and by the implicit use of an unreasonable estimator of the true proportion of equivalently expressed (EE) genes. Due to the great importance of accurate FDR estimation in microarray data analysis, it is necessary to point out such problems and propose improved methods. Results: Our results confirm that the standard permutation method overestimates the FDR. With the correct FDR formula, we show the method of Xie et al. always gives biased estimation of FDR: it overestimates when the number of claimed significant genes is small, and underestimates when the number of claimed significant genes is large. To overcome these problems, we propose two modifications. The simulation results show that our estimator gives more accurate estimation. Contact: szhang3@unl.edu
format	Text
id	pubmed-2638866
institution	National Center for Biotechnology Information
language	English
publishDate	2008
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-26388662009-02-25 On correcting the overestimation of the permutation-based false discovery rate estimator Jiao, Shuo Zhang, Shunpu Bioinformatics Original Papers Motivation: Recent attempts to account for multiple testing in the analysis of microarray data have focused on controlling the false discovery rate (FDR), which is defined as the expected percentage of the number of false positive genes among the claimed significant genes. As a consequence, the accuracy of the FDR estimators will be important for correctly controlling FDR. Xie et al. found that the standard permutation method of estimating FDR is biased and proposed to delete the predicted differentially expressed (DE) genes in the estimation of FDR for one-sample comparison. However, we notice that the formula of the FDR used in their paper is incorrect. This makes the comparison results reported in their paper unconvincing. Other problems with their method include the biased estimation of FDR caused by over- or under-deletion of DE genes in the estimation of FDR and by the implicit use of an unreasonable estimator of the true proportion of equivalently expressed (EE) genes. Due to the great importance of accurate FDR estimation in microarray data analysis, it is necessary to point out such problems and propose improved methods. Results: Our results confirm that the standard permutation method overestimates the FDR. With the correct FDR formula, we show the method of Xie et al. always gives biased estimation of FDR: it overestimates when the number of claimed significant genes is small, and underestimates when the number of claimed significant genes is large. To overcome these problems, we propose two modifications. The simulation results show that our estimator gives more accurate estimation. Contact: szhang3@unl.edu Oxford University Press 2008-08-01 2008-06-23 /pmc/articles/PMC2638866/ /pubmed/18573796 http://dx.doi.org/10.1093/bioinformatics/btn310 Text en © 2008 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Original Papers Jiao, Shuo Zhang, Shunpu On correcting the overestimation of the permutation-based false discovery rate estimator
title	On correcting the overestimation of the permutation-based false discovery rate estimator
title_full	On correcting the overestimation of the permutation-based false discovery rate estimator
title_fullStr	On correcting the overestimation of the permutation-based false discovery rate estimator
title_full_unstemmed	On correcting the overestimation of the permutation-based false discovery rate estimator
title_short	On correcting the overestimation of the permutation-based false discovery rate estimator
title_sort	on correcting the overestimation of the permutation-based false discovery rate estimator
topic	Original Papers
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2638866/ https://www.ncbi.nlm.nih.gov/pubmed/18573796 http://dx.doi.org/10.1093/bioinformatics/btn310
work_keys_str_mv	AT jiaoshuo oncorrectingtheoverestimationofthepermutationbasedfalsediscoveryrateestimator AT zhangshunpu oncorrectingtheoverestimationofthepermutationbasedfalsediscoveryrateestimator

On correcting the overestimation of the permutation-based false discovery rate estimator

Ejemplares similares