Cargando…

From co-expression to co-regulation: how many microarray experiments do we need?

BACKGROUND: Cluster analysis is often used to infer regulatory modules or biological function by associating unknown genes with other genes that have similar expression patterns and known regulatory elements or functions. However, clustering results may not have any biological relevance. RESULTS: We...

Descripción completa

Detalles Bibliográficos
Autores principales:	Yeung, Ka Yee, Medvedovic, Mario, Bumgarner, Roger E
Formato:	Texto
Lenguaje:	English
Publicado:	BioMed Central 2004
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC463312/ https://www.ncbi.nlm.nih.gov/pubmed/15239833 http://dx.doi.org/10.1186/gb-2004-5-7-r48

_version_	1782121607221739520
author	Yeung, Ka Yee Medvedovic, Mario Bumgarner, Roger E
author_facet	Yeung, Ka Yee Medvedovic, Mario Bumgarner, Roger E
author_sort	Yeung, Ka Yee
collection	PubMed
description	BACKGROUND: Cluster analysis is often used to infer regulatory modules or biological function by associating unknown genes with other genes that have similar expression patterns and known regulatory elements or functions. However, clustering results may not have any biological relevance. RESULTS: We applied various clustering algorithms to microarray datasets with different sizes, and we evaluated the clustering results by determining the fraction of gene pairs from the same clusters that share at least one known common transcription factor. We used both yeast transcription factor databases (SCPD, YPD) and chromatin immunoprecipitation (ChIP) data to evaluate our clustering results. We showed that the ability to identify co-regulated genes from clustering results is strongly dependent on the number of microarray experiments used in cluster analysis and the accuracy of these associations plateaus at between 50 and 100 experiments on yeast data. Moreover, the model-based clustering algorithm MCLUST consistently outperforms more traditional methods in accurately assigning co-regulated genes to the same clusters on standardized data. CONCLUSIONS: Our results are consistent with respect to independent evaluation criteria that strengthen our confidence in our results. However, when one compares ChIP data to YPD, the false-negative rate is approximately 80% using the recommended p-value of 0.001. In addition, we showed that even with large numbers of experiments, the false-positive rate may exceed the true-positive rate. In particular, even when all experiments are included, the best results produce clusters with only a 28% true-positive rate using known gene transcription factor interactions.
format	Text
id	pubmed-463312
institution	National Center for Biotechnology Information
language	English
publishDate	2004
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-4633122004-07-16 From co-expression to co-regulation: how many microarray experiments do we need? Yeung, Ka Yee Medvedovic, Mario Bumgarner, Roger E Genome Biol Research BACKGROUND: Cluster analysis is often used to infer regulatory modules or biological function by associating unknown genes with other genes that have similar expression patterns and known regulatory elements or functions. However, clustering results may not have any biological relevance. RESULTS: We applied various clustering algorithms to microarray datasets with different sizes, and we evaluated the clustering results by determining the fraction of gene pairs from the same clusters that share at least one known common transcription factor. We used both yeast transcription factor databases (SCPD, YPD) and chromatin immunoprecipitation (ChIP) data to evaluate our clustering results. We showed that the ability to identify co-regulated genes from clustering results is strongly dependent on the number of microarray experiments used in cluster analysis and the accuracy of these associations plateaus at between 50 and 100 experiments on yeast data. Moreover, the model-based clustering algorithm MCLUST consistently outperforms more traditional methods in accurately assigning co-regulated genes to the same clusters on standardized data. CONCLUSIONS: Our results are consistent with respect to independent evaluation criteria that strengthen our confidence in our results. However, when one compares ChIP data to YPD, the false-negative rate is approximately 80% using the recommended p-value of 0.001. In addition, we showed that even with large numbers of experiments, the false-positive rate may exceed the true-positive rate. In particular, even when all experiments are included, the best results produce clusters with only a 28% true-positive rate using known gene transcription factor interactions. BioMed Central 2004 2004-06-28 /pmc/articles/PMC463312/ /pubmed/15239833 http://dx.doi.org/10.1186/gb-2004-5-7-r48 Text en Copyright © 2004 Yeung et al.; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.
spellingShingle	Research Yeung, Ka Yee Medvedovic, Mario Bumgarner, Roger E From co-expression to co-regulation: how many microarray experiments do we need?
title	From co-expression to co-regulation: how many microarray experiments do we need?
title_full	From co-expression to co-regulation: how many microarray experiments do we need?
title_fullStr	From co-expression to co-regulation: how many microarray experiments do we need?
title_full_unstemmed	From co-expression to co-regulation: how many microarray experiments do we need?
title_short	From co-expression to co-regulation: how many microarray experiments do we need?
title_sort	from co-expression to co-regulation: how many microarray experiments do we need?
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC463312/ https://www.ncbi.nlm.nih.gov/pubmed/15239833 http://dx.doi.org/10.1186/gb-2004-5-7-r48
work_keys_str_mv	AT yeungkayee fromcoexpressiontocoregulationhowmanymicroarrayexperimentsdoweneed AT medvedovicmario fromcoexpressiontocoregulationhowmanymicroarrayexperimentsdoweneed AT bumgarnerrogere fromcoexpressiontocoregulationhowmanymicroarrayexperimentsdoweneed

From co-expression to co-regulation: how many microarray experiments do we need?

Ejemplares similares