Cargando…

Gene set internal coherence in the context of functional profiling

BACKGROUND: Functional profiling methods have been extensively used in the context of high-throughput experiments and, in particular, in microarray data analysis. Such methods use available biological information to define different types of functional gene modules (e.g. gene ontology -GO-, KEGG pat...

Descripción completa

Detalles Bibliográficos
Autores principales: Montaner, David, Minguez, Pablo, Al-Shahrour, Fátima, Dopazo, Joaquín
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2680416/
https://www.ncbi.nlm.nih.gov/pubmed/19397819
http://dx.doi.org/10.1186/1471-2164-10-197
_version_ 1782166953935241216
author Montaner, David
Minguez, Pablo
Al-Shahrour, Fátima
Dopazo, Joaquín
author_facet Montaner, David
Minguez, Pablo
Al-Shahrour, Fátima
Dopazo, Joaquín
author_sort Montaner, David
collection PubMed
description BACKGROUND: Functional profiling methods have been extensively used in the context of high-throughput experiments and, in particular, in microarray data analysis. Such methods use available biological information to define different types of functional gene modules (e.g. gene ontology -GO-, KEGG pathways, etc.) whose representation in a pre-defined list of genes is further studied. In the most popular type of microarray experimental designs (e.g. up- or down-regulated genes, clusters of co-expressing genes, etc.) or in other genomic experiments (e.g. Chip-on-chip, epigenomics, etc.) these lists are composed by genes with a high degree of co-expression. Therefore, an implicit assumption in the application of functional profiling methods within this context is that the genes corresponding to the modules tested are effectively defining sets of co-expressing genes. Nevertheless not all the functional modules are biologically coherent entities in terms of co-expression, which will eventually hinder its detection with conventional methods of functional enrichment. RESULTS: Using a large collection of microarray data we have carried out a detailed survey of internal correlation in GO terms and KEGG pathways, providing a coherence index to be used for measuring functional module co-regulation. An unexpected low level of internal correlation was found among the modules studied. Only around 30% of the modules defined by GO terms and 57% of the modules defined by KEGG pathways display an internal correlation higher than the expected by chance. This information on the internal correlation of the genes within the functional modules can be used in the context of a logistic regression model in a simple way to improve their detection in gene expression experiments. CONCLUSION: For the first time, an exhaustive study on the internal co-expression of the most popular functional categories has been carried out. Interestingly, the real level of coexpression within many of them is lower than expected (or even inexistent), which will preclude its detection by means of most conventional functional profiling methods. If the gene-to-function correlation information is used in functional profiling methods, the results obtained improve the ones obtained by conventional enrichment methods.
format Text
id pubmed-2680416
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26804162009-05-12 Gene set internal coherence in the context of functional profiling Montaner, David Minguez, Pablo Al-Shahrour, Fátima Dopazo, Joaquín BMC Genomics Research Article BACKGROUND: Functional profiling methods have been extensively used in the context of high-throughput experiments and, in particular, in microarray data analysis. Such methods use available biological information to define different types of functional gene modules (e.g. gene ontology -GO-, KEGG pathways, etc.) whose representation in a pre-defined list of genes is further studied. In the most popular type of microarray experimental designs (e.g. up- or down-regulated genes, clusters of co-expressing genes, etc.) or in other genomic experiments (e.g. Chip-on-chip, epigenomics, etc.) these lists are composed by genes with a high degree of co-expression. Therefore, an implicit assumption in the application of functional profiling methods within this context is that the genes corresponding to the modules tested are effectively defining sets of co-expressing genes. Nevertheless not all the functional modules are biologically coherent entities in terms of co-expression, which will eventually hinder its detection with conventional methods of functional enrichment. RESULTS: Using a large collection of microarray data we have carried out a detailed survey of internal correlation in GO terms and KEGG pathways, providing a coherence index to be used for measuring functional module co-regulation. An unexpected low level of internal correlation was found among the modules studied. Only around 30% of the modules defined by GO terms and 57% of the modules defined by KEGG pathways display an internal correlation higher than the expected by chance. This information on the internal correlation of the genes within the functional modules can be used in the context of a logistic regression model in a simple way to improve their detection in gene expression experiments. CONCLUSION: For the first time, an exhaustive study on the internal co-expression of the most popular functional categories has been carried out. Interestingly, the real level of coexpression within many of them is lower than expected (or even inexistent), which will preclude its detection by means of most conventional functional profiling methods. If the gene-to-function correlation information is used in functional profiling methods, the results obtained improve the ones obtained by conventional enrichment methods. BioMed Central 2009-04-27 /pmc/articles/PMC2680416/ /pubmed/19397819 http://dx.doi.org/10.1186/1471-2164-10-197 Text en Copyright © 2009 Montaner et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Montaner, David
Minguez, Pablo
Al-Shahrour, Fátima
Dopazo, Joaquín
Gene set internal coherence in the context of functional profiling
title Gene set internal coherence in the context of functional profiling
title_full Gene set internal coherence in the context of functional profiling
title_fullStr Gene set internal coherence in the context of functional profiling
title_full_unstemmed Gene set internal coherence in the context of functional profiling
title_short Gene set internal coherence in the context of functional profiling
title_sort gene set internal coherence in the context of functional profiling
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2680416/
https://www.ncbi.nlm.nih.gov/pubmed/19397819
http://dx.doi.org/10.1186/1471-2164-10-197
work_keys_str_mv AT montanerdavid genesetinternalcoherenceinthecontextoffunctionalprofiling
AT minguezpablo genesetinternalcoherenceinthecontextoffunctionalprofiling
AT alshahrourfatima genesetinternalcoherenceinthecontextoffunctionalprofiling
AT dopazojoaquin genesetinternalcoherenceinthecontextoffunctionalprofiling