Cargando…

Determination of the minimum number of microarray experiments for discovery of gene expression patterns

BACKGROUND: One type of DNA microarray experiment is discovery of gene expression patterns for a cell line undergoing a biological process over a series of time points. Two important issues with such an experiment are the number of time points, and the interval between them. In the absence of biolog...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Fang-Xiang, Zhang, WJ, Kusalik, Anthony J
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1780120/
https://www.ncbi.nlm.nih.gov/pubmed/17217505
http://dx.doi.org/10.1186/1471-2105-7-S4-S13
_version_ 1782131848966569984
author Wu, Fang-Xiang
Zhang, WJ
Kusalik, Anthony J
author_facet Wu, Fang-Xiang
Zhang, WJ
Kusalik, Anthony J
author_sort Wu, Fang-Xiang
collection PubMed
description BACKGROUND: One type of DNA microarray experiment is discovery of gene expression patterns for a cell line undergoing a biological process over a series of time points. Two important issues with such an experiment are the number of time points, and the interval between them. In the absence of biological knowledge regarding appropriate values, it is natural to question whether the behaviour of progressively generated data may by itself determine a threshold beyond which further microarray experiments do not contribute to pattern discovery. Additionally, such a threshold implies a minimum number of microarray experiments, which is important given the cost of these experiments. RESULTS: We have developed a method for determining the minimum number of microarray experiments (i.e. time points) for temporal gene expression, assuming that the span between time points is given and the hierarchical clustering technique is used for gene expression pattern discovery. The key idea is a similarity measure for two clusterings which is expressed as a function of the data for progressive time points. While the experiments are underway, this function is evaluated. When the function reaches its maximum, it indicates the set of experiments reach a saturated state. Therefore, further experiments do not contribute to the discrimination of patterns. CONCLUSION: The method has been verified with two previously published gene expression datasets. For both experiments, the number of time points determined with our method is less than in the published experiments. It is noted that the overall approach is applicable to other clustering techniques.
format Text
id pubmed-1780120
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-17801202007-01-24 Determination of the minimum number of microarray experiments for discovery of gene expression patterns Wu, Fang-Xiang Zhang, WJ Kusalik, Anthony J BMC Bioinformatics Research BACKGROUND: One type of DNA microarray experiment is discovery of gene expression patterns for a cell line undergoing a biological process over a series of time points. Two important issues with such an experiment are the number of time points, and the interval between them. In the absence of biological knowledge regarding appropriate values, it is natural to question whether the behaviour of progressively generated data may by itself determine a threshold beyond which further microarray experiments do not contribute to pattern discovery. Additionally, such a threshold implies a minimum number of microarray experiments, which is important given the cost of these experiments. RESULTS: We have developed a method for determining the minimum number of microarray experiments (i.e. time points) for temporal gene expression, assuming that the span between time points is given and the hierarchical clustering technique is used for gene expression pattern discovery. The key idea is a similarity measure for two clusterings which is expressed as a function of the data for progressive time points. While the experiments are underway, this function is evaluated. When the function reaches its maximum, it indicates the set of experiments reach a saturated state. Therefore, further experiments do not contribute to the discrimination of patterns. CONCLUSION: The method has been verified with two previously published gene expression datasets. For both experiments, the number of time points determined with our method is less than in the published experiments. It is noted that the overall approach is applicable to other clustering techniques. BioMed Central 2006-12-12 /pmc/articles/PMC1780120/ /pubmed/17217505 http://dx.doi.org/10.1186/1471-2105-7-S4-S13 Text en Copyright © 2006 Wu et al; licensee BioMed Central Ltd http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Wu, Fang-Xiang
Zhang, WJ
Kusalik, Anthony J
Determination of the minimum number of microarray experiments for discovery of gene expression patterns
title Determination of the minimum number of microarray experiments for discovery of gene expression patterns
title_full Determination of the minimum number of microarray experiments for discovery of gene expression patterns
title_fullStr Determination of the minimum number of microarray experiments for discovery of gene expression patterns
title_full_unstemmed Determination of the minimum number of microarray experiments for discovery of gene expression patterns
title_short Determination of the minimum number of microarray experiments for discovery of gene expression patterns
title_sort determination of the minimum number of microarray experiments for discovery of gene expression patterns
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1780120/
https://www.ncbi.nlm.nih.gov/pubmed/17217505
http://dx.doi.org/10.1186/1471-2105-7-S4-S13
work_keys_str_mv AT wufangxiang determinationoftheminimumnumberofmicroarrayexperimentsfordiscoveryofgeneexpressionpatterns
AT zhangwj determinationoftheminimumnumberofmicroarrayexperimentsfordiscoveryofgeneexpressionpatterns
AT kusalikanthonyj determinationoftheminimumnumberofmicroarrayexperimentsfordiscoveryofgeneexpressionpatterns