Cargando…

Identification of metagenes and their Interactions through Large-scale Analysis of Arabidopsis Gene Expression Data

BACKGROUND: Many plant genes have been identified through whole genome and deep transcriptome sequencing and other methods; yet our knowledge on the function of many of these genes remains limited. The integration and analysis of large gene-expression datasets gives researchers the ability to formal...

Descripción completa

Detalles Bibliográficos
Autores principales: Wilson, Tyler J, Lai, Liming, Ban, Yuguang, Ge, Steven X
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3536586/
https://www.ncbi.nlm.nih.gov/pubmed/22694750
http://dx.doi.org/10.1186/1471-2164-13-237
_version_ 1782254762754834432
author Wilson, Tyler J
Lai, Liming
Ban, Yuguang
Ge, Steven X
author_facet Wilson, Tyler J
Lai, Liming
Ban, Yuguang
Ge, Steven X
author_sort Wilson, Tyler J
collection PubMed
description BACKGROUND: Many plant genes have been identified through whole genome and deep transcriptome sequencing and other methods; yet our knowledge on the function of many of these genes remains limited. The integration and analysis of large gene-expression datasets gives researchers the ability to formalize hypotheses concerning the functionality and interaction between different groups of correlated genes. RESULTS: We applied the non-negative matrix factorization (NMF) algorithm to the AtGenExpress dataset which consists of 783 microarray samples (29 separate experimental series) conducted on the model plant Arabidopsis thaliana. We identified 15 metagenes, which are groups of genes with correlated expression. Functional roles of these metagenes are established by observing the enriched gene ontology (GO) categories using gene set enrichment analyses (GSEA). Activity levels of these metagenes in various experimental conditions are also analyzed to associate metagenes with stimuli/conditions. A metagene correlation network, constructed based on the results of NMF analysis, revealed many new interactions between the metagenes. Comparison of these metagenes with an earlier large-scale clustering analysis indicates many statistically significant overlaps. CONCLUSIONS: This study identifies a network of correlated metagenes composed of Arabidopsis genes acting in a highly correlated fashion across a broad spectrum of experimental stimuli, which may shed some light on the function of many of the un-annotated genes.
format Online
Article
Text
id pubmed-3536586
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35365862013-01-08 Identification of metagenes and their Interactions through Large-scale Analysis of Arabidopsis Gene Expression Data Wilson, Tyler J Lai, Liming Ban, Yuguang Ge, Steven X BMC Genomics Research Article BACKGROUND: Many plant genes have been identified through whole genome and deep transcriptome sequencing and other methods; yet our knowledge on the function of many of these genes remains limited. The integration and analysis of large gene-expression datasets gives researchers the ability to formalize hypotheses concerning the functionality and interaction between different groups of correlated genes. RESULTS: We applied the non-negative matrix factorization (NMF) algorithm to the AtGenExpress dataset which consists of 783 microarray samples (29 separate experimental series) conducted on the model plant Arabidopsis thaliana. We identified 15 metagenes, which are groups of genes with correlated expression. Functional roles of these metagenes are established by observing the enriched gene ontology (GO) categories using gene set enrichment analyses (GSEA). Activity levels of these metagenes in various experimental conditions are also analyzed to associate metagenes with stimuli/conditions. A metagene correlation network, constructed based on the results of NMF analysis, revealed many new interactions between the metagenes. Comparison of these metagenes with an earlier large-scale clustering analysis indicates many statistically significant overlaps. CONCLUSIONS: This study identifies a network of correlated metagenes composed of Arabidopsis genes acting in a highly correlated fashion across a broad spectrum of experimental stimuli, which may shed some light on the function of many of the un-annotated genes. BioMed Central 2012-06-13 /pmc/articles/PMC3536586/ /pubmed/22694750 http://dx.doi.org/10.1186/1471-2164-13-237 Text en Copyright ©2012 Wilson et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Wilson, Tyler J
Lai, Liming
Ban, Yuguang
Ge, Steven X
Identification of metagenes and their Interactions through Large-scale Analysis of Arabidopsis Gene Expression Data
title Identification of metagenes and their Interactions through Large-scale Analysis of Arabidopsis Gene Expression Data
title_full Identification of metagenes and their Interactions through Large-scale Analysis of Arabidopsis Gene Expression Data
title_fullStr Identification of metagenes and their Interactions through Large-scale Analysis of Arabidopsis Gene Expression Data
title_full_unstemmed Identification of metagenes and their Interactions through Large-scale Analysis of Arabidopsis Gene Expression Data
title_short Identification of metagenes and their Interactions through Large-scale Analysis of Arabidopsis Gene Expression Data
title_sort identification of metagenes and their interactions through large-scale analysis of arabidopsis gene expression data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3536586/
https://www.ncbi.nlm.nih.gov/pubmed/22694750
http://dx.doi.org/10.1186/1471-2164-13-237
work_keys_str_mv AT wilsontylerj identificationofmetagenesandtheirinteractionsthroughlargescaleanalysisofarabidopsisgeneexpressiondata
AT lailiming identificationofmetagenesandtheirinteractionsthroughlargescaleanalysisofarabidopsisgeneexpressiondata
AT banyuguang identificationofmetagenesandtheirinteractionsthroughlargescaleanalysisofarabidopsisgeneexpressiondata
AT gestevenx identificationofmetagenesandtheirinteractionsthroughlargescaleanalysisofarabidopsisgeneexpressiondata