Cargando…

An Effective Tri-Clustering Algorithm Combining Expression Data with Gene Regulation Information

MOTIVATION: Bi-clustering algorithms aim to identify sets of genes sharing similar expression patterns across a subset of conditions. However direct interpretation or prediction of gene regulatory mechanisms may be difficult as only gene expression data is used. Information about gene regulators may...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Ao, Tuck, David
Formato: Texto
Lenguaje:English
Publicado: Libertas Academica 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2758278/
https://www.ncbi.nlm.nih.gov/pubmed/19838334
_version_ 1782172581461229568
author Li, Ao
Tuck, David
author_facet Li, Ao
Tuck, David
author_sort Li, Ao
collection PubMed
description MOTIVATION: Bi-clustering algorithms aim to identify sets of genes sharing similar expression patterns across a subset of conditions. However direct interpretation or prediction of gene regulatory mechanisms may be difficult as only gene expression data is used. Information about gene regulators may also be available, most commonly about which transcription factors may bind to the promoter region and thus control the expression level of a gene. Thus a method to integrate gene expression and gene regulation information is desirable for clustering and analyzing. METHODS: By incorporating gene regulatory information with gene expression data, we define regulated expression values (REV) as indicators of how a gene is regulated by a specific factor. Existing bi-clustering methods are extended to a three dimensional data space by developing a heuristic TRI-Clustering algorithm. An additional approach named Automatic Boundary Searching algorithm (ABS) is introduced to automatically determine the boundary threshold. RESULTS: Results based on incorporating ChIP-chip data representing transcription factor-gene interactions show that the algorithms are efficient and robust for detecting tri-clusters. Detailed analysis of the tri-cluster extracted from yeast sporulation REV data shows genes in this cluster exhibited significant differences during the middle and late stages. The implicated regulatory network was then reconstructed for further study of defined regulatory mechanisms. Topological and statistical analysis of this network demonstrated evidence of significant changes of TF activities during the different stages of yeast sporulation, and suggests this approach might be a general way to study regulatory networks undergoing transformations.
format Text
id pubmed-2758278
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Libertas Academica
record_format MEDLINE/PubMed
spelling pubmed-27582782009-10-16 An Effective Tri-Clustering Algorithm Combining Expression Data with Gene Regulation Information Li, Ao Tuck, David Gene Regul Syst Bio Original Research MOTIVATION: Bi-clustering algorithms aim to identify sets of genes sharing similar expression patterns across a subset of conditions. However direct interpretation or prediction of gene regulatory mechanisms may be difficult as only gene expression data is used. Information about gene regulators may also be available, most commonly about which transcription factors may bind to the promoter region and thus control the expression level of a gene. Thus a method to integrate gene expression and gene regulation information is desirable for clustering and analyzing. METHODS: By incorporating gene regulatory information with gene expression data, we define regulated expression values (REV) as indicators of how a gene is regulated by a specific factor. Existing bi-clustering methods are extended to a three dimensional data space by developing a heuristic TRI-Clustering algorithm. An additional approach named Automatic Boundary Searching algorithm (ABS) is introduced to automatically determine the boundary threshold. RESULTS: Results based on incorporating ChIP-chip data representing transcription factor-gene interactions show that the algorithms are efficient and robust for detecting tri-clusters. Detailed analysis of the tri-cluster extracted from yeast sporulation REV data shows genes in this cluster exhibited significant differences during the middle and late stages. The implicated regulatory network was then reconstructed for further study of defined regulatory mechanisms. Topological and statistical analysis of this network demonstrated evidence of significant changes of TF activities during the different stages of yeast sporulation, and suggests this approach might be a general way to study regulatory networks undergoing transformations. Libertas Academica 2009-04-15 /pmc/articles/PMC2758278/ /pubmed/19838334 Text en © 2009 by the authors http://creativecommons.org/licenses/by/3.0 This article is an open-access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).
spellingShingle Original Research
Li, Ao
Tuck, David
An Effective Tri-Clustering Algorithm Combining Expression Data with Gene Regulation Information
title An Effective Tri-Clustering Algorithm Combining Expression Data with Gene Regulation Information
title_full An Effective Tri-Clustering Algorithm Combining Expression Data with Gene Regulation Information
title_fullStr An Effective Tri-Clustering Algorithm Combining Expression Data with Gene Regulation Information
title_full_unstemmed An Effective Tri-Clustering Algorithm Combining Expression Data with Gene Regulation Information
title_short An Effective Tri-Clustering Algorithm Combining Expression Data with Gene Regulation Information
title_sort effective tri-clustering algorithm combining expression data with gene regulation information
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2758278/
https://www.ncbi.nlm.nih.gov/pubmed/19838334
work_keys_str_mv AT liao aneffectivetriclusteringalgorithmcombiningexpressiondatawithgeneregulationinformation
AT tuckdavid aneffectivetriclusteringalgorithmcombiningexpressiondatawithgeneregulationinformation
AT liao effectivetriclusteringalgorithmcombiningexpressiondatawithgeneregulationinformation
AT tuckdavid effectivetriclusteringalgorithmcombiningexpressiondatawithgeneregulationinformation