Cargando…

Finding gene clusters for a replicated time course study

BACKGROUND: Finding genes that share similar expression patterns across samples is an important question that is frequently asked in high-throughput microarray studies. Traditional clustering algorithms such as K-means clustering and hierarchical clustering base gene clustering directly on the obser...

Descripción completa

Detalles Bibliográficos
Autores principales: Qin, Li-Xuan, Breeden, Linda, Self, Steven G
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3906880/
https://www.ncbi.nlm.nih.gov/pubmed/24460656
http://dx.doi.org/10.1186/1756-0500-7-60
_version_ 1782301534550228992
author Qin, Li-Xuan
Breeden, Linda
Self, Steven G
author_facet Qin, Li-Xuan
Breeden, Linda
Self, Steven G
author_sort Qin, Li-Xuan
collection PubMed
description BACKGROUND: Finding genes that share similar expression patterns across samples is an important question that is frequently asked in high-throughput microarray studies. Traditional clustering algorithms such as K-means clustering and hierarchical clustering base gene clustering directly on the observed measurements and do not take into account the specific experimental design under which the microarray data were collected. A new model-based clustering method, the clustering of regression models method, takes into account the specific design of the microarray study and bases the clustering on how genes are related to sample covariates. It can find useful gene clusters for studies from complicated study designs such as replicated time course studies. FINDINGS: In this paper, we applied the clustering of regression models method to data from a time course study of yeast on two genotypes, wild type and YOX1 mutant, each with two technical replicates, and compared the clustering results with K-means clustering. We identified gene clusters that have similar expression patterns in wild type yeast, two of which were missed by K-means clustering. We further identified gene clusters whose expression patterns were changed in YOX1 mutant yeast compared to wild type yeast. CONCLUSIONS: The clustering of regression models method can be a valuable tool for identifying genes that are coordinately transcribed by a common mechanism.
format Online
Article
Text
id pubmed-3906880
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-39068802014-02-12 Finding gene clusters for a replicated time course study Qin, Li-Xuan Breeden, Linda Self, Steven G BMC Res Notes Short Report BACKGROUND: Finding genes that share similar expression patterns across samples is an important question that is frequently asked in high-throughput microarray studies. Traditional clustering algorithms such as K-means clustering and hierarchical clustering base gene clustering directly on the observed measurements and do not take into account the specific experimental design under which the microarray data were collected. A new model-based clustering method, the clustering of regression models method, takes into account the specific design of the microarray study and bases the clustering on how genes are related to sample covariates. It can find useful gene clusters for studies from complicated study designs such as replicated time course studies. FINDINGS: In this paper, we applied the clustering of regression models method to data from a time course study of yeast on two genotypes, wild type and YOX1 mutant, each with two technical replicates, and compared the clustering results with K-means clustering. We identified gene clusters that have similar expression patterns in wild type yeast, two of which were missed by K-means clustering. We further identified gene clusters whose expression patterns were changed in YOX1 mutant yeast compared to wild type yeast. CONCLUSIONS: The clustering of regression models method can be a valuable tool for identifying genes that are coordinately transcribed by a common mechanism. BioMed Central 2014-01-24 /pmc/articles/PMC3906880/ /pubmed/24460656 http://dx.doi.org/10.1186/1756-0500-7-60 Text en Copyright © 2014 Qin et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Short Report
Qin, Li-Xuan
Breeden, Linda
Self, Steven G
Finding gene clusters for a replicated time course study
title Finding gene clusters for a replicated time course study
title_full Finding gene clusters for a replicated time course study
title_fullStr Finding gene clusters for a replicated time course study
title_full_unstemmed Finding gene clusters for a replicated time course study
title_short Finding gene clusters for a replicated time course study
title_sort finding gene clusters for a replicated time course study
topic Short Report
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3906880/
https://www.ncbi.nlm.nih.gov/pubmed/24460656
http://dx.doi.org/10.1186/1756-0500-7-60
work_keys_str_mv AT qinlixuan findinggeneclustersforareplicatedtimecoursestudy
AT breedenlinda findinggeneclustersforareplicatedtimecoursestudy
AT selfsteveng findinggeneclustersforareplicatedtimecoursestudy