Cargando…

Handling multiple testing while interpreting microarrays with the Gene Ontology Database

BACKGROUND: The development of software tools that analyze microarray data in the context of genetic knowledgebases is being pursued by multiple research groups using different methods. A common problem for many of these tools is how to correct for multiple statistical testing since simple correctio...

Descripción completa

Detalles Bibliográficos
Autores principales: Osier, Michael V, Zhao, Hongyu, Cheung, Kei-Hoi
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2004
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC518975/
https://www.ncbi.nlm.nih.gov/pubmed/15350198
http://dx.doi.org/10.1186/1471-2105-5-124
Descripción
Sumario:BACKGROUND: The development of software tools that analyze microarray data in the context of genetic knowledgebases is being pursued by multiple research groups using different methods. A common problem for many of these tools is how to correct for multiple statistical testing since simple corrections are overly conservative and more sophisticated corrections are currently impractical. A careful study of the nature of the distribution one would expect by chance, such as by a simulation study, may be able to guide the development of an appropriate correction that is not overly time consuming computationally. RESULTS: We present the results from a preliminary study of the distribution one would expect for analyzing sets of genes extracted from Drosophila, S. cerevisiae, Wormbase, and Gramene databases using the Gene Ontology Database. CONCLUSIONS: We found that the estimated distribution is not regular and is not predictable outside of a particular set of genes. Permutation-based simulations may be necessary to determine the confidence in results of such analyses.