Cargando…

Inferring biological functions and associated transcriptional regulators using gene set expression coherence analysis

BACKGROUND: Gene clustering has been widely used to group genes with similar expression pattern in microarray data analysis. Subsequent enrichment analysis using predefined gene sets can provide clues on which functional themes or regulatory sequence motifs are associated with individual gene cluste...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Tae-Min, Chung, Yeun-Jun, Rhyu, Mun-Gan, Ho Jung, Myeong
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2217565/
https://www.ncbi.nlm.nih.gov/pubmed/18021416
http://dx.doi.org/10.1186/1471-2105-8-453
_version_ 1782149282303836160
author Kim, Tae-Min
Chung, Yeun-Jun
Rhyu, Mun-Gan
Ho Jung, Myeong
author_facet Kim, Tae-Min
Chung, Yeun-Jun
Rhyu, Mun-Gan
Ho Jung, Myeong
author_sort Kim, Tae-Min
collection PubMed
description BACKGROUND: Gene clustering has been widely used to group genes with similar expression pattern in microarray data analysis. Subsequent enrichment analysis using predefined gene sets can provide clues on which functional themes or regulatory sequence motifs are associated with individual gene clusters. In spite of the potential utility, gene clustering and enrichment analysis have been used in separate platforms, thus, the development of integrative algorithm linking both methods is highly challenging. RESULTS: In this study, we propose an algorithm for discovery of molecular functions and elucidation of transcriptional logics using two kinds of gene information, functional and regulatory motif gene sets. The algorithm, termed gene set expression coherence analysis first selects functional gene sets with significantly high expression coherences. Those candidate gene sets are further processed into a number of functionally related themes or functional clusters according to the expression similarities. Each functional cluster is then, investigated for the enrichment of transcriptional regulatory motifs using modified gene set enrichment analysis and regulatory motif gene sets. The method was tested for two publicly available expression profiles representing murine myogenesis and erythropoiesis. For respective profiles, our algorithm identified myocyte- and erythrocyte-related molecular functions, along with the putative transcriptional regulators for the corresponding molecular functions. CONCLUSION: As an integrative and comprehensive method for the analysis of large-scaled gene expression profiles, our method is able to generate a set of testable hypotheses: the transcriptional regulator X regulates function Y under cellular condition Z. GSECA algorithm is implemented into freely available software package.
format Text
id pubmed-2217565
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-22175652008-01-30 Inferring biological functions and associated transcriptional regulators using gene set expression coherence analysis Kim, Tae-Min Chung, Yeun-Jun Rhyu, Mun-Gan Ho Jung, Myeong BMC Bioinformatics Methodology Article BACKGROUND: Gene clustering has been widely used to group genes with similar expression pattern in microarray data analysis. Subsequent enrichment analysis using predefined gene sets can provide clues on which functional themes or regulatory sequence motifs are associated with individual gene clusters. In spite of the potential utility, gene clustering and enrichment analysis have been used in separate platforms, thus, the development of integrative algorithm linking both methods is highly challenging. RESULTS: In this study, we propose an algorithm for discovery of molecular functions and elucidation of transcriptional logics using two kinds of gene information, functional and regulatory motif gene sets. The algorithm, termed gene set expression coherence analysis first selects functional gene sets with significantly high expression coherences. Those candidate gene sets are further processed into a number of functionally related themes or functional clusters according to the expression similarities. Each functional cluster is then, investigated for the enrichment of transcriptional regulatory motifs using modified gene set enrichment analysis and regulatory motif gene sets. The method was tested for two publicly available expression profiles representing murine myogenesis and erythropoiesis. For respective profiles, our algorithm identified myocyte- and erythrocyte-related molecular functions, along with the putative transcriptional regulators for the corresponding molecular functions. CONCLUSION: As an integrative and comprehensive method for the analysis of large-scaled gene expression profiles, our method is able to generate a set of testable hypotheses: the transcriptional regulator X regulates function Y under cellular condition Z. GSECA algorithm is implemented into freely available software package. BioMed Central 2007-11-17 /pmc/articles/PMC2217565/ /pubmed/18021416 http://dx.doi.org/10.1186/1471-2105-8-453 Text en Copyright © 2007 Kim et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Kim, Tae-Min
Chung, Yeun-Jun
Rhyu, Mun-Gan
Ho Jung, Myeong
Inferring biological functions and associated transcriptional regulators using gene set expression coherence analysis
title Inferring biological functions and associated transcriptional regulators using gene set expression coherence analysis
title_full Inferring biological functions and associated transcriptional regulators using gene set expression coherence analysis
title_fullStr Inferring biological functions and associated transcriptional regulators using gene set expression coherence analysis
title_full_unstemmed Inferring biological functions and associated transcriptional regulators using gene set expression coherence analysis
title_short Inferring biological functions and associated transcriptional regulators using gene set expression coherence analysis
title_sort inferring biological functions and associated transcriptional regulators using gene set expression coherence analysis
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2217565/
https://www.ncbi.nlm.nih.gov/pubmed/18021416
http://dx.doi.org/10.1186/1471-2105-8-453
work_keys_str_mv AT kimtaemin inferringbiologicalfunctionsandassociatedtranscriptionalregulatorsusinggenesetexpressioncoherenceanalysis
AT chungyeunjun inferringbiologicalfunctionsandassociatedtranscriptionalregulatorsusinggenesetexpressioncoherenceanalysis
AT rhyumungan inferringbiologicalfunctionsandassociatedtranscriptionalregulatorsusinggenesetexpressioncoherenceanalysis
AT hojungmyeong inferringbiologicalfunctionsandassociatedtranscriptionalregulatorsusinggenesetexpressioncoherenceanalysis