Cargando…

Assessing the Biological Significance of Gene Expression Signatures and Co-Expression Modules by Studying Their Network Properties

Microarray experiments have been extensively used to define signatures, which are sets of genes that can be considered markers of experimental conditions (typically diseases). Paradoxically, in spite of the apparent functional role that might be attributed to such gene sets, signatures do not seem t...

Descripción completa

Detalles Bibliográficos
Autores principales: Minguez, Pablo, Dopazo, Joaquin
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3049771/
https://www.ncbi.nlm.nih.gov/pubmed/21408226
http://dx.doi.org/10.1371/journal.pone.0017474
_version_ 1782199271068532736
author Minguez, Pablo
Dopazo, Joaquin
author_facet Minguez, Pablo
Dopazo, Joaquin
author_sort Minguez, Pablo
collection PubMed
description Microarray experiments have been extensively used to define signatures, which are sets of genes that can be considered markers of experimental conditions (typically diseases). Paradoxically, in spite of the apparent functional role that might be attributed to such gene sets, signatures do not seem to be reproducible across experiments. Given the close relationship between function and protein interaction, network properties can be used to study to what extent signatures are composed of genes whose resulting proteins show a considerable level of interaction (and consequently a putative common functional role). We have analysed 618 signatures and 507 modules of co-expression in cancer looking for significant values of four main protein-protein interaction (PPI) network parameters: connection degree, cluster coefficient, betweenness and number of components. A total of 3904 gene ontology (GO) modules, 146 KEGG pathways, and 263 Biocarta pathways have been used as functional modules of reference. Co-expression modules found in microarray experiments display a high level of connectivity, similar to the one shown by conventional modules based on functional definitions (GO, KEGG and Biocarta). A general observation for all the classes studied is that the networks formed by the modules improve their topological parameters when an external protein is allowed to be introduced within the paths (up to the 70% of GO modules show network parameters beyond the random expectation). This fact suggests that functional definitions are incomplete and some genes might still be missing. Conversely, signatures are clearly not capturing the altered functions in the corresponding studies. This is probably because the way in which the genes have been selected in the signatures is too conservative. These results suggest that gene selection methods which take into account relationships among genes should be superior to methods that assume independence among genes outside their functional contexts.
format Text
id pubmed-3049771
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-30497712011-03-15 Assessing the Biological Significance of Gene Expression Signatures and Co-Expression Modules by Studying Their Network Properties Minguez, Pablo Dopazo, Joaquin PLoS One Research Article Microarray experiments have been extensively used to define signatures, which are sets of genes that can be considered markers of experimental conditions (typically diseases). Paradoxically, in spite of the apparent functional role that might be attributed to such gene sets, signatures do not seem to be reproducible across experiments. Given the close relationship between function and protein interaction, network properties can be used to study to what extent signatures are composed of genes whose resulting proteins show a considerable level of interaction (and consequently a putative common functional role). We have analysed 618 signatures and 507 modules of co-expression in cancer looking for significant values of four main protein-protein interaction (PPI) network parameters: connection degree, cluster coefficient, betweenness and number of components. A total of 3904 gene ontology (GO) modules, 146 KEGG pathways, and 263 Biocarta pathways have been used as functional modules of reference. Co-expression modules found in microarray experiments display a high level of connectivity, similar to the one shown by conventional modules based on functional definitions (GO, KEGG and Biocarta). A general observation for all the classes studied is that the networks formed by the modules improve their topological parameters when an external protein is allowed to be introduced within the paths (up to the 70% of GO modules show network parameters beyond the random expectation). This fact suggests that functional definitions are incomplete and some genes might still be missing. Conversely, signatures are clearly not capturing the altered functions in the corresponding studies. This is probably because the way in which the genes have been selected in the signatures is too conservative. These results suggest that gene selection methods which take into account relationships among genes should be superior to methods that assume independence among genes outside their functional contexts. Public Library of Science 2011-03-07 /pmc/articles/PMC3049771/ /pubmed/21408226 http://dx.doi.org/10.1371/journal.pone.0017474 Text en Minguez, Dopazo. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Minguez, Pablo
Dopazo, Joaquin
Assessing the Biological Significance of Gene Expression Signatures and Co-Expression Modules by Studying Their Network Properties
title Assessing the Biological Significance of Gene Expression Signatures and Co-Expression Modules by Studying Their Network Properties
title_full Assessing the Biological Significance of Gene Expression Signatures and Co-Expression Modules by Studying Their Network Properties
title_fullStr Assessing the Biological Significance of Gene Expression Signatures and Co-Expression Modules by Studying Their Network Properties
title_full_unstemmed Assessing the Biological Significance of Gene Expression Signatures and Co-Expression Modules by Studying Their Network Properties
title_short Assessing the Biological Significance of Gene Expression Signatures and Co-Expression Modules by Studying Their Network Properties
title_sort assessing the biological significance of gene expression signatures and co-expression modules by studying their network properties
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3049771/
https://www.ncbi.nlm.nih.gov/pubmed/21408226
http://dx.doi.org/10.1371/journal.pone.0017474
work_keys_str_mv AT minguezpablo assessingthebiologicalsignificanceofgeneexpressionsignaturesandcoexpressionmodulesbystudyingtheirnetworkproperties
AT dopazojoaquin assessingthebiologicalsignificanceofgeneexpressionsignaturesandcoexpressionmodulesbystudyingtheirnetworkproperties