Cargando…

Ranking analysis of F-statistics for microarray data

BACKGROUND: Microarray technology provides an efficient means for globally exploring physiological processes governed by the coordinated expression of multiple genes. However, identification of genes differentially expressed in microarray experiments is challenging because of their potentially high...

Descripción completa

Detalles Bibliográficos
Autores principales: Tan, Yuan-De, Fornage, Myriam, Xu, Hongyan
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2323973/
https://www.ncbi.nlm.nih.gov/pubmed/18325100
http://dx.doi.org/10.1186/1471-2105-9-142
_version_ 1782152698310688768
author Tan, Yuan-De
Fornage, Myriam
Xu, Hongyan
author_facet Tan, Yuan-De
Fornage, Myriam
Xu, Hongyan
author_sort Tan, Yuan-De
collection PubMed
description BACKGROUND: Microarray technology provides an efficient means for globally exploring physiological processes governed by the coordinated expression of multiple genes. However, identification of genes differentially expressed in microarray experiments is challenging because of their potentially high type I error rate. Methods for large-scale statistical analyses have been developed but most of them are applicable to two-sample or two-condition data. RESULTS: We developed a large-scale multiple-group F-test based method, named ranking analysis of F-statistics (RAF), which is an extension of ranking analysis of microarray data (RAM) for two-sample t-test. In this method, we proposed a novel random splitting approach to generate the null distribution instead of using permutation, which may not be appropriate for microarray data. We also implemented a two-simulation strategy to estimate the false discovery rate. Simulation results suggested that it has higher efficiency in finding differentially expressed genes among multiple classes at a lower false discovery rate than some commonly used methods. By applying our method to the experimental data, we found 107 genes having significantly differential expressions among 4 treatments at <0.7% FDR, of which 31 belong to the expressed sequence tags (ESTs), 76 are unique genes who have known functions in the brain or central nervous system and belong to six major functional groups. CONCLUSION: Our method is suitable to identify differentially expressed genes among multiple groups, in particular, when sample size is small.
format Text
id pubmed-2323973
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-23239732008-04-22 Ranking analysis of F-statistics for microarray data Tan, Yuan-De Fornage, Myriam Xu, Hongyan BMC Bioinformatics Methodology Article BACKGROUND: Microarray technology provides an efficient means for globally exploring physiological processes governed by the coordinated expression of multiple genes. However, identification of genes differentially expressed in microarray experiments is challenging because of their potentially high type I error rate. Methods for large-scale statistical analyses have been developed but most of them are applicable to two-sample or two-condition data. RESULTS: We developed a large-scale multiple-group F-test based method, named ranking analysis of F-statistics (RAF), which is an extension of ranking analysis of microarray data (RAM) for two-sample t-test. In this method, we proposed a novel random splitting approach to generate the null distribution instead of using permutation, which may not be appropriate for microarray data. We also implemented a two-simulation strategy to estimate the false discovery rate. Simulation results suggested that it has higher efficiency in finding differentially expressed genes among multiple classes at a lower false discovery rate than some commonly used methods. By applying our method to the experimental data, we found 107 genes having significantly differential expressions among 4 treatments at <0.7% FDR, of which 31 belong to the expressed sequence tags (ESTs), 76 are unique genes who have known functions in the brain or central nervous system and belong to six major functional groups. CONCLUSION: Our method is suitable to identify differentially expressed genes among multiple groups, in particular, when sample size is small. BioMed Central 2008-03-06 /pmc/articles/PMC2323973/ /pubmed/18325100 http://dx.doi.org/10.1186/1471-2105-9-142 Text en Copyright © 2008 Tan et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Tan, Yuan-De
Fornage, Myriam
Xu, Hongyan
Ranking analysis of F-statistics for microarray data
title Ranking analysis of F-statistics for microarray data
title_full Ranking analysis of F-statistics for microarray data
title_fullStr Ranking analysis of F-statistics for microarray data
title_full_unstemmed Ranking analysis of F-statistics for microarray data
title_short Ranking analysis of F-statistics for microarray data
title_sort ranking analysis of f-statistics for microarray data
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2323973/
https://www.ncbi.nlm.nih.gov/pubmed/18325100
http://dx.doi.org/10.1186/1471-2105-9-142
work_keys_str_mv AT tanyuande rankinganalysisoffstatisticsformicroarraydata
AT fornagemyriam rankinganalysisoffstatisticsformicroarraydata
AT xuhongyan rankinganalysisoffstatisticsformicroarraydata