Cargando…

Extracting gene expression patterns and identifying co-expressed genes from microarray data reveals biologically responsive processes

BACKGROUND: A common observation in the analysis of gene expression data is that many genes display similarity in their expression patterns and therefore appear to be co-regulated. However, the variation associated with microarray data and the complexity of the experimental designs make the acquisit...

Descripción completa

Detalles Bibliográficos
Autores principales: Chou, Jeff W, Zhou, Tong, Kaufmann, William K, Paules, Richard S, Bushel, Pierre R
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2194742/
https://www.ncbi.nlm.nih.gov/pubmed/17980031
http://dx.doi.org/10.1186/1471-2105-8-427
_version_ 1782147686830440448
author Chou, Jeff W
Zhou, Tong
Kaufmann, William K
Paules, Richard S
Bushel, Pierre R
author_facet Chou, Jeff W
Zhou, Tong
Kaufmann, William K
Paules, Richard S
Bushel, Pierre R
author_sort Chou, Jeff W
collection PubMed
description BACKGROUND: A common observation in the analysis of gene expression data is that many genes display similarity in their expression patterns and therefore appear to be co-regulated. However, the variation associated with microarray data and the complexity of the experimental designs make the acquisition of co-expressed genes a challenge. We developed a novel method for Extracting microarray gene expression Patterns and Identifying co-expressed Genes, designated as EPIG. The approach utilizes the underlying structure of gene expression data to extract patterns and identify co-expressed genes that are responsive to experimental conditions. RESULTS: Through evaluation of the correlations among profiles, the magnitude of variation in gene expression profiles, and profile signal-to-noise ratio's, EPIG extracts a set of patterns representing co-expressed genes. The method is shown to work well with a simulated data set and microarray data obtained from time-series studies of dauer recovery and L1 starvation in C. elegans and after ultraviolet (UV) or ionizing radiation (IR)-induced DNA damage in diploid human fibroblasts. With the simulated data set, EPIG extracted the appropriate number of patterns which were more stable and homogeneous than the set of patterns that were determined using the CLICK or CAST clustering algorithms. However, CLICK performed better than EPIG and CAST with respect to the average correlation between clusters/patterns of the simulated data. With real biological data, EPIG extracted more dauer-specific patterns than CLICK. Furthermore, analysis of the IR/UV data revealed 18 unique patterns and 2661 genes out of approximately 17,000 that were identified as significantly expressed and categorized to the patterns by EPIG. The time-dependent patterns displayed similar and dissimilar responses between IR and UV treatments. Gene Ontology analysis applied to each pattern-related subset of co-expressed genes revealed underlying biological processes affected by IR- and/or UV- induced DNA damage. CONCLUSION: EPIG competed with CLICK and performed better than CAST in extracting patterns from simulated data. EPIG extracted more biological informative patterns and co-expressed genes from both C. elegans and IR/UV-treated human fibroblasts. Using Gene Ontology analysis of the genes in the patterns extracted by EPIG, several key biological categories related to p53-dependent cell cycle control were revealed from the IR/UV data. Among them were mitotic cell cycle, DNA replication, DNA repair, cell cycle checkpoint, and G(0)-like status transition. EPIG can be applied to data sets from a variety of experimental designs.
format Text
id pubmed-2194742
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-21947422008-01-14 Extracting gene expression patterns and identifying co-expressed genes from microarray data reveals biologically responsive processes Chou, Jeff W Zhou, Tong Kaufmann, William K Paules, Richard S Bushel, Pierre R BMC Bioinformatics Research Article BACKGROUND: A common observation in the analysis of gene expression data is that many genes display similarity in their expression patterns and therefore appear to be co-regulated. However, the variation associated with microarray data and the complexity of the experimental designs make the acquisition of co-expressed genes a challenge. We developed a novel method for Extracting microarray gene expression Patterns and Identifying co-expressed Genes, designated as EPIG. The approach utilizes the underlying structure of gene expression data to extract patterns and identify co-expressed genes that are responsive to experimental conditions. RESULTS: Through evaluation of the correlations among profiles, the magnitude of variation in gene expression profiles, and profile signal-to-noise ratio's, EPIG extracts a set of patterns representing co-expressed genes. The method is shown to work well with a simulated data set and microarray data obtained from time-series studies of dauer recovery and L1 starvation in C. elegans and after ultraviolet (UV) or ionizing radiation (IR)-induced DNA damage in diploid human fibroblasts. With the simulated data set, EPIG extracted the appropriate number of patterns which were more stable and homogeneous than the set of patterns that were determined using the CLICK or CAST clustering algorithms. However, CLICK performed better than EPIG and CAST with respect to the average correlation between clusters/patterns of the simulated data. With real biological data, EPIG extracted more dauer-specific patterns than CLICK. Furthermore, analysis of the IR/UV data revealed 18 unique patterns and 2661 genes out of approximately 17,000 that were identified as significantly expressed and categorized to the patterns by EPIG. The time-dependent patterns displayed similar and dissimilar responses between IR and UV treatments. Gene Ontology analysis applied to each pattern-related subset of co-expressed genes revealed underlying biological processes affected by IR- and/or UV- induced DNA damage. CONCLUSION: EPIG competed with CLICK and performed better than CAST in extracting patterns from simulated data. EPIG extracted more biological informative patterns and co-expressed genes from both C. elegans and IR/UV-treated human fibroblasts. Using Gene Ontology analysis of the genes in the patterns extracted by EPIG, several key biological categories related to p53-dependent cell cycle control were revealed from the IR/UV data. Among them were mitotic cell cycle, DNA replication, DNA repair, cell cycle checkpoint, and G(0)-like status transition. EPIG can be applied to data sets from a variety of experimental designs. BioMed Central 2007-11-02 /pmc/articles/PMC2194742/ /pubmed/17980031 http://dx.doi.org/10.1186/1471-2105-8-427 Text en Copyright © 2007 Chou et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Chou, Jeff W
Zhou, Tong
Kaufmann, William K
Paules, Richard S
Bushel, Pierre R
Extracting gene expression patterns and identifying co-expressed genes from microarray data reveals biologically responsive processes
title Extracting gene expression patterns and identifying co-expressed genes from microarray data reveals biologically responsive processes
title_full Extracting gene expression patterns and identifying co-expressed genes from microarray data reveals biologically responsive processes
title_fullStr Extracting gene expression patterns and identifying co-expressed genes from microarray data reveals biologically responsive processes
title_full_unstemmed Extracting gene expression patterns and identifying co-expressed genes from microarray data reveals biologically responsive processes
title_short Extracting gene expression patterns and identifying co-expressed genes from microarray data reveals biologically responsive processes
title_sort extracting gene expression patterns and identifying co-expressed genes from microarray data reveals biologically responsive processes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2194742/
https://www.ncbi.nlm.nih.gov/pubmed/17980031
http://dx.doi.org/10.1186/1471-2105-8-427
work_keys_str_mv AT choujeffw extractinggeneexpressionpatternsandidentifyingcoexpressedgenesfrommicroarraydatarevealsbiologicallyresponsiveprocesses
AT zhoutong extractinggeneexpressionpatternsandidentifyingcoexpressedgenesfrommicroarraydatarevealsbiologicallyresponsiveprocesses
AT kaufmannwilliamk extractinggeneexpressionpatternsandidentifyingcoexpressedgenesfrommicroarraydatarevealsbiologicallyresponsiveprocesses
AT paulesrichards extractinggeneexpressionpatternsandidentifyingcoexpressedgenesfrommicroarraydatarevealsbiologicallyresponsiveprocesses
AT bushelpierrer extractinggeneexpressionpatternsandidentifyingcoexpressedgenesfrommicroarraydatarevealsbiologicallyresponsiveprocesses