Cargando…

Systematic identification of functional modules and cis-regulatory elements in Arabidopsis thaliana

BACKGROUND: Several large-scale gene co-expression networks have been constructed successfully for predicting gene functional modules and cis-regulatory elements in Arabidopsis (Arabidopsis thaliana). However, these networks are usually constructed and analyzed in an ad hoc manner. In this study, we...

Descripción completa

Detalles Bibliográficos
Autores principales: Ruan, Jianhua, Perez, Joseph, Hernandez, Brian, Lei, Chengwei, Sunter, Garry, Sponsel, Valerie M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3247083/
https://www.ncbi.nlm.nih.gov/pubmed/22168340
http://dx.doi.org/10.1186/1471-2105-12-S12-S2
_version_ 1782220038407716864
author Ruan, Jianhua
Perez, Joseph
Hernandez, Brian
Lei, Chengwei
Sunter, Garry
Sponsel, Valerie M
author_facet Ruan, Jianhua
Perez, Joseph
Hernandez, Brian
Lei, Chengwei
Sunter, Garry
Sponsel, Valerie M
author_sort Ruan, Jianhua
collection PubMed
description BACKGROUND: Several large-scale gene co-expression networks have been constructed successfully for predicting gene functional modules and cis-regulatory elements in Arabidopsis (Arabidopsis thaliana). However, these networks are usually constructed and analyzed in an ad hoc manner. In this study, we propose a completely parameter-free and systematic method for constructing gene co-expression networks and predicting functional modules as well as cis-regulatory elements. RESULTS: Our novel method consists of an automated network construction algorithm, a parameter-free procedure to predict functional modules, and a strategy for finding known cis-regulatory elements that is suitable for consensus scanning without prior knowledge of the allowed extent of degeneracy of the motif. We apply the method to study a large collection of gene expression microarray data in Arabidopsis. We estimate that our co-expression network has ~94% of accuracy, and has topological properties similar to other biological networks, such as being scale-free and having a high clustering coefficient. Remarkably, among the ~300 predicted modules whose sizes are at least 20, 88% have at least one significantly enriched functions, including a few extremely significant ones (ribosome, p < 1E-300, photosynthetic membrane, p < 1.3E-137, proteasome complex, p < 5.9E-126). In addition, we are able to predict cis-regulatory elements for 66.7% of the modules, and the association between the enriched cis-regulatory elements and the enriched functional terms can often be confirmed by the literature. Overall, our results are much more significant than those reported by several previous studies on similar data sets. Finally, we utilize the co-expression network to dissect the promoters of 19 Arabidopsis genes involved in the metabolism and signaling of the important plant hormone gibberellin, and achieved promising results that reveal interesting insight into the biosynthesis and signaling of gibberellin. CONCLUSIONS: The results show that our method is highly effective in finding functional modules from real microarray data. Our application on Arabidopsis leads to the discovery of the largest number of annotated Arabidopsis functional modules in the literature. Given the high statistical significance of functional enrichment and the agreement between cis-regulatory and functional annotations, we believe our Arabidopsis gene modules can be used to predict the functions of unknown genes in Arabidopsis, and to understand the regulatory mechanisms of many genes.
format Online
Article
Text
id pubmed-3247083
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32470832011-12-29 Systematic identification of functional modules and cis-regulatory elements in Arabidopsis thaliana Ruan, Jianhua Perez, Joseph Hernandez, Brian Lei, Chengwei Sunter, Garry Sponsel, Valerie M BMC Bioinformatics Proceedings BACKGROUND: Several large-scale gene co-expression networks have been constructed successfully for predicting gene functional modules and cis-regulatory elements in Arabidopsis (Arabidopsis thaliana). However, these networks are usually constructed and analyzed in an ad hoc manner. In this study, we propose a completely parameter-free and systematic method for constructing gene co-expression networks and predicting functional modules as well as cis-regulatory elements. RESULTS: Our novel method consists of an automated network construction algorithm, a parameter-free procedure to predict functional modules, and a strategy for finding known cis-regulatory elements that is suitable for consensus scanning without prior knowledge of the allowed extent of degeneracy of the motif. We apply the method to study a large collection of gene expression microarray data in Arabidopsis. We estimate that our co-expression network has ~94% of accuracy, and has topological properties similar to other biological networks, such as being scale-free and having a high clustering coefficient. Remarkably, among the ~300 predicted modules whose sizes are at least 20, 88% have at least one significantly enriched functions, including a few extremely significant ones (ribosome, p < 1E-300, photosynthetic membrane, p < 1.3E-137, proteasome complex, p < 5.9E-126). In addition, we are able to predict cis-regulatory elements for 66.7% of the modules, and the association between the enriched cis-regulatory elements and the enriched functional terms can often be confirmed by the literature. Overall, our results are much more significant than those reported by several previous studies on similar data sets. Finally, we utilize the co-expression network to dissect the promoters of 19 Arabidopsis genes involved in the metabolism and signaling of the important plant hormone gibberellin, and achieved promising results that reveal interesting insight into the biosynthesis and signaling of gibberellin. CONCLUSIONS: The results show that our method is highly effective in finding functional modules from real microarray data. Our application on Arabidopsis leads to the discovery of the largest number of annotated Arabidopsis functional modules in the literature. Given the high statistical significance of functional enrichment and the agreement between cis-regulatory and functional annotations, we believe our Arabidopsis gene modules can be used to predict the functions of unknown genes in Arabidopsis, and to understand the regulatory mechanisms of many genes. BioMed Central 2011-11-24 /pmc/articles/PMC3247083/ /pubmed/22168340 http://dx.doi.org/10.1186/1471-2105-12-S12-S2 Text en Copyright ©2011 Ruan et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Ruan, Jianhua
Perez, Joseph
Hernandez, Brian
Lei, Chengwei
Sunter, Garry
Sponsel, Valerie M
Systematic identification of functional modules and cis-regulatory elements in Arabidopsis thaliana
title Systematic identification of functional modules and cis-regulatory elements in Arabidopsis thaliana
title_full Systematic identification of functional modules and cis-regulatory elements in Arabidopsis thaliana
title_fullStr Systematic identification of functional modules and cis-regulatory elements in Arabidopsis thaliana
title_full_unstemmed Systematic identification of functional modules and cis-regulatory elements in Arabidopsis thaliana
title_short Systematic identification of functional modules and cis-regulatory elements in Arabidopsis thaliana
title_sort systematic identification of functional modules and cis-regulatory elements in arabidopsis thaliana
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3247083/
https://www.ncbi.nlm.nih.gov/pubmed/22168340
http://dx.doi.org/10.1186/1471-2105-12-S12-S2
work_keys_str_mv AT ruanjianhua systematicidentificationoffunctionalmodulesandcisregulatoryelementsinarabidopsisthaliana
AT perezjoseph systematicidentificationoffunctionalmodulesandcisregulatoryelementsinarabidopsisthaliana
AT hernandezbrian systematicidentificationoffunctionalmodulesandcisregulatoryelementsinarabidopsisthaliana
AT leichengwei systematicidentificationoffunctionalmodulesandcisregulatoryelementsinarabidopsisthaliana
AT suntergarry systematicidentificationoffunctionalmodulesandcisregulatoryelementsinarabidopsisthaliana
AT sponselvaleriem systematicidentificationoffunctionalmodulesandcisregulatoryelementsinarabidopsisthaliana