Cargando…

Listen to Genes: Dealing with Microarray Data in the Frequency Domain

BACKGROUND: We present a novel and systematic approach to analyze temporal microarray data. The approach includes normalization, clustering and network analysis of genes. METHODOLOGY: Genes are normalized using an error model based uniform normalization method aimed at identifying and estimating the...

Descripción completa

Detalles Bibliográficos
Autores principales: Feng, Jianfeng, Yi, Dongyun, Krishna, Ritesh, Guo, Shuixia, Buchanan-Wollaston, Vicky
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3383793/
https://www.ncbi.nlm.nih.gov/pubmed/22745650
http://dx.doi.org/10.1371/journal.pone.0005098
_version_ 1782236654519451648
author Feng, Jianfeng
Yi, Dongyun
Krishna, Ritesh
Guo, Shuixia
Buchanan-Wollaston, Vicky
author_facet Feng, Jianfeng
Yi, Dongyun
Krishna, Ritesh
Guo, Shuixia
Buchanan-Wollaston, Vicky
author_sort Feng, Jianfeng
collection PubMed
description BACKGROUND: We present a novel and systematic approach to analyze temporal microarray data. The approach includes normalization, clustering and network analysis of genes. METHODOLOGY: Genes are normalized using an error model based uniform normalization method aimed at identifying and estimating the sources of variations. The model minimizes the correlation among error terms across replicates. The normalized gene expressions are then clustered in terms of their power spectrum density. The method of complex Granger causality is introduced to reveal interactions between sets of genes. Complex Granger causality along with partial Granger causality is applied in both time and frequency domains to selected as well as all the genes to reveal the interesting networks of interactions. The approach is successfully applied to Arabidopsis leaf microarray data generated from 31,000 genes observed over 22 time points over 22 days. Three circuits: a circadian gene circuit, an ethylene circuit and a new global circuit showing a hierarchical structure to determine the initiators of leaf senescence are analyzed in detail. CONCLUSIONS: We use a totally data-driven approach to form biological hypothesis. Clustering using the power-spectrum analysis helps us identify genes of potential interest. Their dynamics can be captured accurately in the time and frequency domain using the methods of complex and partial Granger causality. With the rise in availability of temporal microarray data, such methods can be useful tools in uncovering the hidden biological interactions. We show our method in a step by step manner with help of toy models as well as a real biological dataset. We also analyse three distinct gene circuits of potential interest to Arabidopsis researchers.
format Online
Article
Text
id pubmed-3383793
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-33837932012-06-28 Listen to Genes: Dealing with Microarray Data in the Frequency Domain Feng, Jianfeng Yi, Dongyun Krishna, Ritesh Guo, Shuixia Buchanan-Wollaston, Vicky PLoS One Research Article BACKGROUND: We present a novel and systematic approach to analyze temporal microarray data. The approach includes normalization, clustering and network analysis of genes. METHODOLOGY: Genes are normalized using an error model based uniform normalization method aimed at identifying and estimating the sources of variations. The model minimizes the correlation among error terms across replicates. The normalized gene expressions are then clustered in terms of their power spectrum density. The method of complex Granger causality is introduced to reveal interactions between sets of genes. Complex Granger causality along with partial Granger causality is applied in both time and frequency domains to selected as well as all the genes to reveal the interesting networks of interactions. The approach is successfully applied to Arabidopsis leaf microarray data generated from 31,000 genes observed over 22 time points over 22 days. Three circuits: a circadian gene circuit, an ethylene circuit and a new global circuit showing a hierarchical structure to determine the initiators of leaf senescence are analyzed in detail. CONCLUSIONS: We use a totally data-driven approach to form biological hypothesis. Clustering using the power-spectrum analysis helps us identify genes of potential interest. Their dynamics can be captured accurately in the time and frequency domain using the methods of complex and partial Granger causality. With the rise in availability of temporal microarray data, such methods can be useful tools in uncovering the hidden biological interactions. We show our method in a step by step manner with help of toy models as well as a real biological dataset. We also analyse three distinct gene circuits of potential interest to Arabidopsis researchers. Public Library of Science 2009-04-06 /pmc/articles/PMC3383793/ /pubmed/22745650 http://dx.doi.org/10.1371/journal.pone.0005098 Text en Feng et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Feng, Jianfeng
Yi, Dongyun
Krishna, Ritesh
Guo, Shuixia
Buchanan-Wollaston, Vicky
Listen to Genes: Dealing with Microarray Data in the Frequency Domain
title Listen to Genes: Dealing with Microarray Data in the Frequency Domain
title_full Listen to Genes: Dealing with Microarray Data in the Frequency Domain
title_fullStr Listen to Genes: Dealing with Microarray Data in the Frequency Domain
title_full_unstemmed Listen to Genes: Dealing with Microarray Data in the Frequency Domain
title_short Listen to Genes: Dealing with Microarray Data in the Frequency Domain
title_sort listen to genes: dealing with microarray data in the frequency domain
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3383793/
https://www.ncbi.nlm.nih.gov/pubmed/22745650
http://dx.doi.org/10.1371/journal.pone.0005098
work_keys_str_mv AT fengjianfeng listentogenesdealingwithmicroarraydatainthefrequencydomain
AT yidongyun listentogenesdealingwithmicroarraydatainthefrequencydomain
AT krishnaritesh listentogenesdealingwithmicroarraydatainthefrequencydomain
AT guoshuixia listentogenesdealingwithmicroarraydatainthefrequencydomain
AT buchananwollastonvicky listentogenesdealingwithmicroarraydatainthefrequencydomain