Cargando…

Microarray data mining: A novel optimization-based approach to uncover biologically coherent structures

BACKGROUND: DNA microarray technology allows for the measurement of genome-wide expression patterns. Within the resultant mass of data lies the problem of analyzing and presenting information on this genomic scale, and a first step towards the rapid and comprehensive interpretation of this data is g...

Descripción completa

Detalles Bibliográficos
Autores principales: Tan, Meng P, Smith, Erin N, Broach, James R, Floudas, Christodoulos A
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2442101/
https://www.ncbi.nlm.nih.gov/pubmed/18538024
http://dx.doi.org/10.1186/1471-2105-9-268
_version_ 1782156673444478976
author Tan, Meng P
Smith, Erin N
Broach, James R
Floudas, Christodoulos A
author_facet Tan, Meng P
Smith, Erin N
Broach, James R
Floudas, Christodoulos A
author_sort Tan, Meng P
collection PubMed
description BACKGROUND: DNA microarray technology allows for the measurement of genome-wide expression patterns. Within the resultant mass of data lies the problem of analyzing and presenting information on this genomic scale, and a first step towards the rapid and comprehensive interpretation of this data is gene clustering with respect to the expression patterns. Classifying genes into clusters can lead to interesting biological insights. In this study, we describe an iterative clustering approach to uncover biologically coherent structures from DNA microarray data based on a novel clustering algorithm EP_GOS_Clust. RESULTS: We apply our proposed iterative algorithm to three sets of experimental DNA microarray data from experiments with the yeast Saccharomyces cerevisiae and show that the proposed iterative approach improves biological coherence. Comparison with other clustering techniques suggests that our iterative algorithm provides superior performance with regard to biological coherence. An important consequence of our approach is that an increasing proportion of genes find membership in clusters of high biological coherence and that the average cluster specificity improves. CONCLUSION: The results from these clustering experiments provide a robust basis for extracting motifs and trans-acting factors that determine particular patterns of expression. In addition, the biological coherence of the clusters is iteratively assessed independently of the clustering. Thus, this method will not be severely impacted by functional annotations that are missing, inaccurate, or sparse.
format Text
id pubmed-2442101
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-24421012008-07-01 Microarray data mining: A novel optimization-based approach to uncover biologically coherent structures Tan, Meng P Smith, Erin N Broach, James R Floudas, Christodoulos A BMC Bioinformatics Methodology Article BACKGROUND: DNA microarray technology allows for the measurement of genome-wide expression patterns. Within the resultant mass of data lies the problem of analyzing and presenting information on this genomic scale, and a first step towards the rapid and comprehensive interpretation of this data is gene clustering with respect to the expression patterns. Classifying genes into clusters can lead to interesting biological insights. In this study, we describe an iterative clustering approach to uncover biologically coherent structures from DNA microarray data based on a novel clustering algorithm EP_GOS_Clust. RESULTS: We apply our proposed iterative algorithm to three sets of experimental DNA microarray data from experiments with the yeast Saccharomyces cerevisiae and show that the proposed iterative approach improves biological coherence. Comparison with other clustering techniques suggests that our iterative algorithm provides superior performance with regard to biological coherence. An important consequence of our approach is that an increasing proportion of genes find membership in clusters of high biological coherence and that the average cluster specificity improves. CONCLUSION: The results from these clustering experiments provide a robust basis for extracting motifs and trans-acting factors that determine particular patterns of expression. In addition, the biological coherence of the clusters is iteratively assessed independently of the clustering. Thus, this method will not be severely impacted by functional annotations that are missing, inaccurate, or sparse. BioMed Central 2008-06-06 /pmc/articles/PMC2442101/ /pubmed/18538024 http://dx.doi.org/10.1186/1471-2105-9-268 Text en Copyright © 2008 Tan et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Tan, Meng P
Smith, Erin N
Broach, James R
Floudas, Christodoulos A
Microarray data mining: A novel optimization-based approach to uncover biologically coherent structures
title Microarray data mining: A novel optimization-based approach to uncover biologically coherent structures
title_full Microarray data mining: A novel optimization-based approach to uncover biologically coherent structures
title_fullStr Microarray data mining: A novel optimization-based approach to uncover biologically coherent structures
title_full_unstemmed Microarray data mining: A novel optimization-based approach to uncover biologically coherent structures
title_short Microarray data mining: A novel optimization-based approach to uncover biologically coherent structures
title_sort microarray data mining: a novel optimization-based approach to uncover biologically coherent structures
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2442101/
https://www.ncbi.nlm.nih.gov/pubmed/18538024
http://dx.doi.org/10.1186/1471-2105-9-268
work_keys_str_mv AT tanmengp microarraydatamininganoveloptimizationbasedapproachtouncoverbiologicallycoherentstructures
AT smitherinn microarraydatamininganoveloptimizationbasedapproachtouncoverbiologicallycoherentstructures
AT broachjamesr microarraydatamininganoveloptimizationbasedapproachtouncoverbiologicallycoherentstructures
AT floudaschristodoulosa microarraydatamininganoveloptimizationbasedapproachtouncoverbiologicallycoherentstructures