Cargando…

Finding gene clusters for a replicated time course study

BACKGROUND: Finding genes that share similar expression patterns across samples is an important question that is frequently asked in high-throughput microarray studies. Traditional clustering algorithms such as K-means clustering and hierarchical clustering base gene clustering directly on the obser...

Descripción completa

Detalles Bibliográficos
Autores principales: Qin, Li-Xuan, Breeden, Linda, Self, Steven G
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3906880/
https://www.ncbi.nlm.nih.gov/pubmed/24460656
http://dx.doi.org/10.1186/1756-0500-7-60
Descripción
Sumario:BACKGROUND: Finding genes that share similar expression patterns across samples is an important question that is frequently asked in high-throughput microarray studies. Traditional clustering algorithms such as K-means clustering and hierarchical clustering base gene clustering directly on the observed measurements and do not take into account the specific experimental design under which the microarray data were collected. A new model-based clustering method, the clustering of regression models method, takes into account the specific design of the microarray study and bases the clustering on how genes are related to sample covariates. It can find useful gene clusters for studies from complicated study designs such as replicated time course studies. FINDINGS: In this paper, we applied the clustering of regression models method to data from a time course study of yeast on two genotypes, wild type and YOX1 mutant, each with two technical replicates, and compared the clustering results with K-means clustering. We identified gene clusters that have similar expression patterns in wild type yeast, two of which were missed by K-means clustering. We further identified gene clusters whose expression patterns were changed in YOX1 mutant yeast compared to wild type yeast. CONCLUSIONS: The clustering of regression models method can be a valuable tool for identifying genes that are coordinately transcribed by a common mechanism.