Cargando…

An improved procedure for gene selection from microarray experiments using false discovery rate criterion

BACKGROUND: A large number of genes usually show differential expressions in a microarray experiment with two types of tissues, and the p-values of a proper statistical test are often used to quantify the significance of these differences. The genes with small p-values are then picked as the genes r...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, James J, Yang, Mark CK
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1388243/
https://www.ncbi.nlm.nih.gov/pubmed/16405735
http://dx.doi.org/10.1186/1471-2105-7-15
_version_ 1782126909004447744
author Yang, James J
Yang, Mark CK
author_facet Yang, James J
Yang, Mark CK
author_sort Yang, James J
collection PubMed
description BACKGROUND: A large number of genes usually show differential expressions in a microarray experiment with two types of tissues, and the p-values of a proper statistical test are often used to quantify the significance of these differences. The genes with small p-values are then picked as the genes responsible for the differences in the tissue RNA expressions. One key question is what should be the threshold to consider the p-values small. There is always a trade off between this threshold and the rate of false claims. Recent statistical literature shows that the false discovery rate (FDR) criterion is a powerful and reasonable criterion to pick those genes with differential expression. Moreover, the power of detection can be increased by knowing the number of non-differential expression genes. While this number is unknown in practice, there are methods to estimate it from data. The purpose of this paper is to present a new method of estimating this number and use it for the FDR procedure construction. RESULTS: A combination of test functions is used to estimate the number of differentially expressed genes. Simulation study shows that the proposed method has a higher power to detect these genes than other existing methods, while still keeping the FDR under control. The improvement can be substantial if the proportion of true differentially expressed genes is large. This procedure has also been tested with good results using a real dataset. CONCLUSION: For a given expected FDR, the method proposed in this paper has better power to pick genes that show differentiation in their expression than two other well known methods.
format Text
id pubmed-1388243
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-13882432006-04-21 An improved procedure for gene selection from microarray experiments using false discovery rate criterion Yang, James J Yang, Mark CK BMC Bioinformatics Research Article BACKGROUND: A large number of genes usually show differential expressions in a microarray experiment with two types of tissues, and the p-values of a proper statistical test are often used to quantify the significance of these differences. The genes with small p-values are then picked as the genes responsible for the differences in the tissue RNA expressions. One key question is what should be the threshold to consider the p-values small. There is always a trade off between this threshold and the rate of false claims. Recent statistical literature shows that the false discovery rate (FDR) criterion is a powerful and reasonable criterion to pick those genes with differential expression. Moreover, the power of detection can be increased by knowing the number of non-differential expression genes. While this number is unknown in practice, there are methods to estimate it from data. The purpose of this paper is to present a new method of estimating this number and use it for the FDR procedure construction. RESULTS: A combination of test functions is used to estimate the number of differentially expressed genes. Simulation study shows that the proposed method has a higher power to detect these genes than other existing methods, while still keeping the FDR under control. The improvement can be substantial if the proportion of true differentially expressed genes is large. This procedure has also been tested with good results using a real dataset. CONCLUSION: For a given expected FDR, the method proposed in this paper has better power to pick genes that show differentiation in their expression than two other well known methods. BioMed Central 2006-01-11 /pmc/articles/PMC1388243/ /pubmed/16405735 http://dx.doi.org/10.1186/1471-2105-7-15 Text en Copyright © 2006 Yang and Yang; licensee BioMed Central Ltd.
spellingShingle Research Article
Yang, James J
Yang, Mark CK
An improved procedure for gene selection from microarray experiments using false discovery rate criterion
title An improved procedure for gene selection from microarray experiments using false discovery rate criterion
title_full An improved procedure for gene selection from microarray experiments using false discovery rate criterion
title_fullStr An improved procedure for gene selection from microarray experiments using false discovery rate criterion
title_full_unstemmed An improved procedure for gene selection from microarray experiments using false discovery rate criterion
title_short An improved procedure for gene selection from microarray experiments using false discovery rate criterion
title_sort improved procedure for gene selection from microarray experiments using false discovery rate criterion
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1388243/
https://www.ncbi.nlm.nih.gov/pubmed/16405735
http://dx.doi.org/10.1186/1471-2105-7-15
work_keys_str_mv AT yangjamesj animprovedprocedureforgeneselectionfrommicroarrayexperimentsusingfalsediscoveryratecriterion
AT yangmarkck animprovedprocedureforgeneselectionfrommicroarrayexperimentsusingfalsediscoveryratecriterion
AT yangjamesj improvedprocedureforgeneselectionfrommicroarrayexperimentsusingfalsediscoveryratecriterion
AT yangmarkck improvedprocedureforgeneselectionfrommicroarrayexperimentsusingfalsediscoveryratecriterion