Cargando…

Rank-invariant resampling based estimation of false discovery rate for analysis of small sample microarray data

BACKGROUND: The evaluation of statistical significance has become a critical process in identifying differentially expressed genes in microarray studies. Classical p-value adjustment methods for multiple comparisons such as family-wise error rate (FWER) have been found to be too conservative in anal...

Descripción completa

Detalles Bibliográficos
Autores principales: Jain, Nitin, Cho, HyungJun, O'Connell, Michael, Lee, Jae K
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1187876/
https://www.ncbi.nlm.nih.gov/pubmed/16042779
http://dx.doi.org/10.1186/1471-2105-6-187
_version_ 1782124757002485760
author Jain, Nitin
Cho, HyungJun
O'Connell, Michael
Lee, Jae K
author_facet Jain, Nitin
Cho, HyungJun
O'Connell, Michael
Lee, Jae K
author_sort Jain, Nitin
collection PubMed
description BACKGROUND: The evaluation of statistical significance has become a critical process in identifying differentially expressed genes in microarray studies. Classical p-value adjustment methods for multiple comparisons such as family-wise error rate (FWER) have been found to be too conservative in analyzing large-screening microarray data, and the False Discovery Rate (FDR), the expected proportion of false positives among all positives, has been recently suggested as an alternative for controlling false positives. Several statistical approaches have been used to estimate and control FDR, but these may not provide reliable FDR estimation when applied to microarray data sets with a small number of replicates. RESULTS: We propose a rank-invariant resampling (RIR) based approach to FDR evaluation. Our proposed method generates a biologically relevant null distribution, which maintains similar variability to observed microarray data. We compare the performance of our RIR-based FDR estimation with that of four other popular methods. Our approach outperforms the other methods both in simulated and real microarray data. CONCLUSION: We found that the SAM's random shuffling and SPLOSH approaches were liberal and the other two theoretical methods were too conservative while our RIR approach provided more accurate FDR estimation than the other approaches.
format Text
id pubmed-1187876
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-11878762005-08-18 Rank-invariant resampling based estimation of false discovery rate for analysis of small sample microarray data Jain, Nitin Cho, HyungJun O'Connell, Michael Lee, Jae K BMC Bioinformatics Methodology Article BACKGROUND: The evaluation of statistical significance has become a critical process in identifying differentially expressed genes in microarray studies. Classical p-value adjustment methods for multiple comparisons such as family-wise error rate (FWER) have been found to be too conservative in analyzing large-screening microarray data, and the False Discovery Rate (FDR), the expected proportion of false positives among all positives, has been recently suggested as an alternative for controlling false positives. Several statistical approaches have been used to estimate and control FDR, but these may not provide reliable FDR estimation when applied to microarray data sets with a small number of replicates. RESULTS: We propose a rank-invariant resampling (RIR) based approach to FDR evaluation. Our proposed method generates a biologically relevant null distribution, which maintains similar variability to observed microarray data. We compare the performance of our RIR-based FDR estimation with that of four other popular methods. Our approach outperforms the other methods both in simulated and real microarray data. CONCLUSION: We found that the SAM's random shuffling and SPLOSH approaches were liberal and the other two theoretical methods were too conservative while our RIR approach provided more accurate FDR estimation than the other approaches. BioMed Central 2005-07-22 /pmc/articles/PMC1187876/ /pubmed/16042779 http://dx.doi.org/10.1186/1471-2105-6-187 Text en Copyright © 2005 Jain et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Jain, Nitin
Cho, HyungJun
O'Connell, Michael
Lee, Jae K
Rank-invariant resampling based estimation of false discovery rate for analysis of small sample microarray data
title Rank-invariant resampling based estimation of false discovery rate for analysis of small sample microarray data
title_full Rank-invariant resampling based estimation of false discovery rate for analysis of small sample microarray data
title_fullStr Rank-invariant resampling based estimation of false discovery rate for analysis of small sample microarray data
title_full_unstemmed Rank-invariant resampling based estimation of false discovery rate for analysis of small sample microarray data
title_short Rank-invariant resampling based estimation of false discovery rate for analysis of small sample microarray data
title_sort rank-invariant resampling based estimation of false discovery rate for analysis of small sample microarray data
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1187876/
https://www.ncbi.nlm.nih.gov/pubmed/16042779
http://dx.doi.org/10.1186/1471-2105-6-187
work_keys_str_mv AT jainnitin rankinvariantresamplingbasedestimationoffalsediscoveryrateforanalysisofsmallsamplemicroarraydata
AT chohyungjun rankinvariantresamplingbasedestimationoffalsediscoveryrateforanalysisofsmallsamplemicroarraydata
AT oconnellmichael rankinvariantresamplingbasedestimationoffalsediscoveryrateforanalysisofsmallsamplemicroarraydata
AT leejaek rankinvariantresamplingbasedestimationoffalsediscoveryrateforanalysisofsmallsamplemicroarraydata