Cargando…

EDISA: extracting biclusters from multiple time-series of gene expression profiles

BACKGROUND: Cells dynamically adapt their gene expression patterns in response to various stimuli. This response is orchestrated into a number of gene expression modules consisting of co-regulated genes. A growing pool of publicly available microarray datasets allows the identification of modules by...

Descripción completa

Detalles Bibliográficos
Autores principales:	Supper, Jochen, Strauch, Martin, Wanke, Dierk, Harter, Klaus, Zell, Andreas
Formato:	Texto
Lenguaje:	English
Publicado:	BioMed Central\|1 2007
Materias:	Methodology Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2063505/ https://www.ncbi.nlm.nih.gov/pubmed/17850657 http://dx.doi.org/10.1186/1471-2105-8-334

_version_	1782137336133320704
author	Supper, Jochen Strauch, Martin Wanke, Dierk Harter, Klaus Zell, Andreas
author_facet	Supper, Jochen Strauch, Martin Wanke, Dierk Harter, Klaus Zell, Andreas
author_sort	Supper, Jochen
collection	PubMed
description	BACKGROUND: Cells dynamically adapt their gene expression patterns in response to various stimuli. This response is orchestrated into a number of gene expression modules consisting of co-regulated genes. A growing pool of publicly available microarray datasets allows the identification of modules by monitoring expression changes over time. These time-series datasets can be searched for gene expression modules by one of the many clustering methods published to date. For an integrative analysis, several time-series datasets can be joined into a three-dimensional gene-condition-time dataset, to which standard clustering or biclustering methods are, however, not applicable. We thus devise a probabilistic clustering algorithm for gene-condition-time datasets. RESULTS: In this work, we present the EDISA (Extended Dimension Iterative Signature Algorithm), a novel probabilistic clustering approach for 3D gene-condition-time datasets. Based on mathematical definitions of gene expression modules, the EDISA samples initial modules from the dataset which are then refined by removing genes and conditions until they comply with the module definition. A subsequent extension step ensures gene and condition maximality. We applied the algorithm to a synthetic dataset and were able to successfully recover the implanted modules over a range of background noise intensities. Analysis of microarray datasets has lead us to define three biologically relevant module types: 1) We found modules with independent response profiles to be the most prevalent ones. These modules comprise genes which are co-regulated under several conditions, yet with a different response pattern under each condition. 2) Coherent modules with similar responses under all conditions occurred frequently, too, and were often contained within these modules. 3) A third module type, which covers a response specific to a single condition was also detected, but rarely. All of these modules are essentially different types of biclusters. CONCLUSION: We successfully applied the EDISA to different 3D datasets. While previous studies were mostly aimed at detecting coherent modules only, our results show that coherent responses are often part of a more general module type with independent response profiles under different conditions. Our approach thus allows for a more comprehensive view of the gene expression response. After subsequent analysis of the resulting modules, the EDISA helped to shed light on the global organization of transcriptional control. An implementation of the algorithm is available at http://www-ra.informatik.uni-tuebingen.de/software/IAGEN/.
format	Text
id	pubmed-2063505
institution	National Center for Biotechnology Information
language	English
publishDate	2007
publisher	BioMed Central\|1
record_format	MEDLINE/PubMed
spelling	pubmed-20635052007-11-06 EDISA: extracting biclusters from multiple time-series of gene expression profiles Supper, Jochen Strauch, Martin Wanke, Dierk Harter, Klaus Zell, Andreas BMC Bioinformatics Methodology Article BACKGROUND: Cells dynamically adapt their gene expression patterns in response to various stimuli. This response is orchestrated into a number of gene expression modules consisting of co-regulated genes. A growing pool of publicly available microarray datasets allows the identification of modules by monitoring expression changes over time. These time-series datasets can be searched for gene expression modules by one of the many clustering methods published to date. For an integrative analysis, several time-series datasets can be joined into a three-dimensional gene-condition-time dataset, to which standard clustering or biclustering methods are, however, not applicable. We thus devise a probabilistic clustering algorithm for gene-condition-time datasets. RESULTS: In this work, we present the EDISA (Extended Dimension Iterative Signature Algorithm), a novel probabilistic clustering approach for 3D gene-condition-time datasets. Based on mathematical definitions of gene expression modules, the EDISA samples initial modules from the dataset which are then refined by removing genes and conditions until they comply with the module definition. A subsequent extension step ensures gene and condition maximality. We applied the algorithm to a synthetic dataset and were able to successfully recover the implanted modules over a range of background noise intensities. Analysis of microarray datasets has lead us to define three biologically relevant module types: 1) We found modules with independent response profiles to be the most prevalent ones. These modules comprise genes which are co-regulated under several conditions, yet with a different response pattern under each condition. 2) Coherent modules with similar responses under all conditions occurred frequently, too, and were often contained within these modules. 3) A third module type, which covers a response specific to a single condition was also detected, but rarely. All of these modules are essentially different types of biclusters. CONCLUSION: We successfully applied the EDISA to different 3D datasets. While previous studies were mostly aimed at detecting coherent modules only, our results show that coherent responses are often part of a more general module type with independent response profiles under different conditions. Our approach thus allows for a more comprehensive view of the gene expression response. After subsequent analysis of the resulting modules, the EDISA helped to shed light on the global organization of transcriptional control. An implementation of the algorithm is available at http://www-ra.informatik.uni-tuebingen.de/software/IAGEN/. BioMed Central\|1 2007-09-12 /pmc/articles/PMC2063505/ /pubmed/17850657 http://dx.doi.org/10.1186/1471-2105-8-334 Text en Copyright © 2007 Supper et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Methodology Article Supper, Jochen Strauch, Martin Wanke, Dierk Harter, Klaus Zell, Andreas EDISA: extracting biclusters from multiple time-series of gene expression profiles
title	EDISA: extracting biclusters from multiple time-series of gene expression profiles
title_full	EDISA: extracting biclusters from multiple time-series of gene expression profiles
title_fullStr	EDISA: extracting biclusters from multiple time-series of gene expression profiles
title_full_unstemmed	EDISA: extracting biclusters from multiple time-series of gene expression profiles
title_short	EDISA: extracting biclusters from multiple time-series of gene expression profiles
title_sort	edisa: extracting biclusters from multiple time-series of gene expression profiles
topic	Methodology Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2063505/ https://www.ncbi.nlm.nih.gov/pubmed/17850657 http://dx.doi.org/10.1186/1471-2105-8-334
work_keys_str_mv	AT supperjochen edisaextractingbiclustersfrommultipletimeseriesofgeneexpressionprofiles AT strauchmartin edisaextractingbiclustersfrommultipletimeseriesofgeneexpressionprofiles AT wankedierk edisaextractingbiclustersfrommultipletimeseriesofgeneexpressionprofiles AT harterklaus edisaextractingbiclustersfrommultipletimeseriesofgeneexpressionprofiles AT zellandreas edisaextractingbiclustersfrommultipletimeseriesofgeneexpressionprofiles

EDISA: extracting biclusters from multiple time-series of gene expression profiles

Ejemplares similares