Cargando…

A temporal precedence based clustering method for gene expression microarray data

BACKGROUND: Time-course microarray experiments can produce useful data which can help in understanding the underlying dynamics of the system. Clustering is an important stage in microarray data analysis where the data is grouped together according to certain characteristics. The majority of clusteri...

Descripción completa

Detalles Bibliográficos
Autores principales: Krishna, Ritesh, Li, Chang-Tsun, Buchanan-Wollaston, Vicky
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2841598/
https://www.ncbi.nlm.nih.gov/pubmed/20113513
http://dx.doi.org/10.1186/1471-2105-11-68
_version_ 1782179134290526208
author Krishna, Ritesh
Li, Chang-Tsun
Buchanan-Wollaston, Vicky
author_facet Krishna, Ritesh
Li, Chang-Tsun
Buchanan-Wollaston, Vicky
author_sort Krishna, Ritesh
collection PubMed
description BACKGROUND: Time-course microarray experiments can produce useful data which can help in understanding the underlying dynamics of the system. Clustering is an important stage in microarray data analysis where the data is grouped together according to certain characteristics. The majority of clustering techniques are based on distance or visual similarity measures which may not be suitable for clustering of temporal microarray data where the sequential nature of time is important. We present a Granger causality based technique to cluster temporal microarray gene expression data, which measures the interdependence between two time-series by statistically testing if one time-series can be used for forecasting the other time-series or not. RESULTS: A gene-association matrix is constructed by testing temporal relationships between pairs of genes using the Granger causality test. The association matrix is further analyzed using a graph-theoretic technique to detect highly connected components representing interesting biological modules. We test our approach on synthesized datasets and real biological datasets obtained for Arabidopsis thaliana. We show the effectiveness of our approach by analyzing the results using the existing biological literature. We also report interesting structural properties of the association network commonly desired in any biological system. CONCLUSIONS: Our experiments on synthesized and real microarray datasets show that our approach produces encouraging results. The method is simple in implementation and is statistically traceable at each step. The method can produce sets of functionally related genes which can be further used for reverse-engineering of gene circuits.
format Text
id pubmed-2841598
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28415982010-03-19 A temporal precedence based clustering method for gene expression microarray data Krishna, Ritesh Li, Chang-Tsun Buchanan-Wollaston, Vicky BMC Bioinformatics Research article BACKGROUND: Time-course microarray experiments can produce useful data which can help in understanding the underlying dynamics of the system. Clustering is an important stage in microarray data analysis where the data is grouped together according to certain characteristics. The majority of clustering techniques are based on distance or visual similarity measures which may not be suitable for clustering of temporal microarray data where the sequential nature of time is important. We present a Granger causality based technique to cluster temporal microarray gene expression data, which measures the interdependence between two time-series by statistically testing if one time-series can be used for forecasting the other time-series or not. RESULTS: A gene-association matrix is constructed by testing temporal relationships between pairs of genes using the Granger causality test. The association matrix is further analyzed using a graph-theoretic technique to detect highly connected components representing interesting biological modules. We test our approach on synthesized datasets and real biological datasets obtained for Arabidopsis thaliana. We show the effectiveness of our approach by analyzing the results using the existing biological literature. We also report interesting structural properties of the association network commonly desired in any biological system. CONCLUSIONS: Our experiments on synthesized and real microarray datasets show that our approach produces encouraging results. The method is simple in implementation and is statistically traceable at each step. The method can produce sets of functionally related genes which can be further used for reverse-engineering of gene circuits. BioMed Central 2010-01-30 /pmc/articles/PMC2841598/ /pubmed/20113513 http://dx.doi.org/10.1186/1471-2105-11-68 Text en Copyright ©2010 Krishna et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research article
Krishna, Ritesh
Li, Chang-Tsun
Buchanan-Wollaston, Vicky
A temporal precedence based clustering method for gene expression microarray data
title A temporal precedence based clustering method for gene expression microarray data
title_full A temporal precedence based clustering method for gene expression microarray data
title_fullStr A temporal precedence based clustering method for gene expression microarray data
title_full_unstemmed A temporal precedence based clustering method for gene expression microarray data
title_short A temporal precedence based clustering method for gene expression microarray data
title_sort temporal precedence based clustering method for gene expression microarray data
topic Research article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2841598/
https://www.ncbi.nlm.nih.gov/pubmed/20113513
http://dx.doi.org/10.1186/1471-2105-11-68
work_keys_str_mv AT krishnaritesh atemporalprecedencebasedclusteringmethodforgeneexpressionmicroarraydata
AT lichangtsun atemporalprecedencebasedclusteringmethodforgeneexpressionmicroarraydata
AT buchananwollastonvicky atemporalprecedencebasedclusteringmethodforgeneexpressionmicroarraydata
AT krishnaritesh temporalprecedencebasedclusteringmethodforgeneexpressionmicroarraydata
AT lichangtsun temporalprecedencebasedclusteringmethodforgeneexpressionmicroarraydata
AT buchananwollastonvicky temporalprecedencebasedclusteringmethodforgeneexpressionmicroarraydata