Cargando…
An improved procedure for gene selection from microarray experiments using false discovery rate criterion
BACKGROUND: A large number of genes usually show differential expressions in a microarray experiment with two types of tissues, and the p-values of a proper statistical test are often used to quantify the significance of these differences. The genes with small p-values are then picked as the genes r...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2006
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1388243/ https://www.ncbi.nlm.nih.gov/pubmed/16405735 http://dx.doi.org/10.1186/1471-2105-7-15 |
_version_ | 1782126909004447744 |
---|---|
author | Yang, James J Yang, Mark CK |
author_facet | Yang, James J Yang, Mark CK |
author_sort | Yang, James J |
collection | PubMed |
description | BACKGROUND: A large number of genes usually show differential expressions in a microarray experiment with two types of tissues, and the p-values of a proper statistical test are often used to quantify the significance of these differences. The genes with small p-values are then picked as the genes responsible for the differences in the tissue RNA expressions. One key question is what should be the threshold to consider the p-values small. There is always a trade off between this threshold and the rate of false claims. Recent statistical literature shows that the false discovery rate (FDR) criterion is a powerful and reasonable criterion to pick those genes with differential expression. Moreover, the power of detection can be increased by knowing the number of non-differential expression genes. While this number is unknown in practice, there are methods to estimate it from data. The purpose of this paper is to present a new method of estimating this number and use it for the FDR procedure construction. RESULTS: A combination of test functions is used to estimate the number of differentially expressed genes. Simulation study shows that the proposed method has a higher power to detect these genes than other existing methods, while still keeping the FDR under control. The improvement can be substantial if the proportion of true differentially expressed genes is large. This procedure has also been tested with good results using a real dataset. CONCLUSION: For a given expected FDR, the method proposed in this paper has better power to pick genes that show differentiation in their expression than two other well known methods. |
format | Text |
id | pubmed-1388243 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2006 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-13882432006-04-21 An improved procedure for gene selection from microarray experiments using false discovery rate criterion Yang, James J Yang, Mark CK BMC Bioinformatics Research Article BACKGROUND: A large number of genes usually show differential expressions in a microarray experiment with two types of tissues, and the p-values of a proper statistical test are often used to quantify the significance of these differences. The genes with small p-values are then picked as the genes responsible for the differences in the tissue RNA expressions. One key question is what should be the threshold to consider the p-values small. There is always a trade off between this threshold and the rate of false claims. Recent statistical literature shows that the false discovery rate (FDR) criterion is a powerful and reasonable criterion to pick those genes with differential expression. Moreover, the power of detection can be increased by knowing the number of non-differential expression genes. While this number is unknown in practice, there are methods to estimate it from data. The purpose of this paper is to present a new method of estimating this number and use it for the FDR procedure construction. RESULTS: A combination of test functions is used to estimate the number of differentially expressed genes. Simulation study shows that the proposed method has a higher power to detect these genes than other existing methods, while still keeping the FDR under control. The improvement can be substantial if the proportion of true differentially expressed genes is large. This procedure has also been tested with good results using a real dataset. CONCLUSION: For a given expected FDR, the method proposed in this paper has better power to pick genes that show differentiation in their expression than two other well known methods. BioMed Central 2006-01-11 /pmc/articles/PMC1388243/ /pubmed/16405735 http://dx.doi.org/10.1186/1471-2105-7-15 Text en Copyright © 2006 Yang and Yang; licensee BioMed Central Ltd. |
spellingShingle | Research Article Yang, James J Yang, Mark CK An improved procedure for gene selection from microarray experiments using false discovery rate criterion |
title | An improved procedure for gene selection from microarray experiments using false discovery rate criterion |
title_full | An improved procedure for gene selection from microarray experiments using false discovery rate criterion |
title_fullStr | An improved procedure for gene selection from microarray experiments using false discovery rate criterion |
title_full_unstemmed | An improved procedure for gene selection from microarray experiments using false discovery rate criterion |
title_short | An improved procedure for gene selection from microarray experiments using false discovery rate criterion |
title_sort | improved procedure for gene selection from microarray experiments using false discovery rate criterion |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1388243/ https://www.ncbi.nlm.nih.gov/pubmed/16405735 http://dx.doi.org/10.1186/1471-2105-7-15 |
work_keys_str_mv | AT yangjamesj animprovedprocedureforgeneselectionfrommicroarrayexperimentsusingfalsediscoveryratecriterion AT yangmarkck animprovedprocedureforgeneselectionfrommicroarrayexperimentsusingfalsediscoveryratecriterion AT yangjamesj improvedprocedureforgeneselectionfrommicroarrayexperimentsusingfalsediscoveryratecriterion AT yangmarkck improvedprocedureforgeneselectionfrommicroarrayexperimentsusingfalsediscoveryratecriterion |