Cargando…

Reassessing the modularity of gene co-expression networks using the Stochastic Block Model

Finding communities in gene co-expression networks is a common first step toward extracting biological insight from such complex datasets. Most community detection algorithms expect genes to be organized into assortative modules, that is, groups of genes that are more associated with each other than...

Descripción completa

Detalles Bibliográficos
Autores principales: Melo, Diogo, Pallares, Luisa F., Ayroles, Julien F.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10312592/
https://www.ncbi.nlm.nih.gov/pubmed/37398186
http://dx.doi.org/10.1101/2023.05.31.542906
_version_ 1785066955063951360
author Melo, Diogo
Pallares, Luisa F.
Ayroles, Julien F.
author_facet Melo, Diogo
Pallares, Luisa F.
Ayroles, Julien F.
author_sort Melo, Diogo
collection PubMed
description Finding communities in gene co-expression networks is a common first step toward extracting biological insight from such complex datasets. Most community detection algorithms expect genes to be organized into assortative modules, that is, groups of genes that are more associated with each other than with genes in other groups. While it is reasonable to expect that these modules exist, using methods that assume they exist a priori is risky, as it guarantees that alternative organizations of gene interactions will be ignored. Here, we ask: can we find meaningful communities without imposing a modular organization on gene co-expression networks, and how modular are these communities? For this, we use a recently developed community detection method, the weighted degree corrected stochastic block model (SBM), that does not assume that assortative modules exist. Instead, the SBM attempts to efficiently use all information contained in the co-expression network to separate the genes into hierarchically organized blocks of genes. Using RNA-seq gene expression data measured in two tissues derived from an outbred population of Drosophila melanogaster, we show that (a) the SBM is able to find ten times as many groups as competing methods, that (b) several of those gene groups are not modular, and that (c) the functional enrichment for non-modular groups is as strong as for modular communities. These results show that the transcriptome is structured in more complex ways than traditionally thought and that we should revisit the long-standing assumption that modularity is the main driver of the structuring of gene co-expression networks.
format Online
Article
Text
id pubmed-10312592
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-103125922023-07-01 Reassessing the modularity of gene co-expression networks using the Stochastic Block Model Melo, Diogo Pallares, Luisa F. Ayroles, Julien F. bioRxiv Article Finding communities in gene co-expression networks is a common first step toward extracting biological insight from such complex datasets. Most community detection algorithms expect genes to be organized into assortative modules, that is, groups of genes that are more associated with each other than with genes in other groups. While it is reasonable to expect that these modules exist, using methods that assume they exist a priori is risky, as it guarantees that alternative organizations of gene interactions will be ignored. Here, we ask: can we find meaningful communities without imposing a modular organization on gene co-expression networks, and how modular are these communities? For this, we use a recently developed community detection method, the weighted degree corrected stochastic block model (SBM), that does not assume that assortative modules exist. Instead, the SBM attempts to efficiently use all information contained in the co-expression network to separate the genes into hierarchically organized blocks of genes. Using RNA-seq gene expression data measured in two tissues derived from an outbred population of Drosophila melanogaster, we show that (a) the SBM is able to find ten times as many groups as competing methods, that (b) several of those gene groups are not modular, and that (c) the functional enrichment for non-modular groups is as strong as for modular communities. These results show that the transcriptome is structured in more complex ways than traditionally thought and that we should revisit the long-standing assumption that modularity is the main driver of the structuring of gene co-expression networks. Cold Spring Harbor Laboratory 2023-06-01 /pmc/articles/PMC10312592/ /pubmed/37398186 http://dx.doi.org/10.1101/2023.05.31.542906 Text en https://creativecommons.org/licenses/by-nc/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (https://creativecommons.org/licenses/by-nc/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Melo, Diogo
Pallares, Luisa F.
Ayroles, Julien F.
Reassessing the modularity of gene co-expression networks using the Stochastic Block Model
title Reassessing the modularity of gene co-expression networks using the Stochastic Block Model
title_full Reassessing the modularity of gene co-expression networks using the Stochastic Block Model
title_fullStr Reassessing the modularity of gene co-expression networks using the Stochastic Block Model
title_full_unstemmed Reassessing the modularity of gene co-expression networks using the Stochastic Block Model
title_short Reassessing the modularity of gene co-expression networks using the Stochastic Block Model
title_sort reassessing the modularity of gene co-expression networks using the stochastic block model
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10312592/
https://www.ncbi.nlm.nih.gov/pubmed/37398186
http://dx.doi.org/10.1101/2023.05.31.542906
work_keys_str_mv AT melodiogo reassessingthemodularityofgenecoexpressionnetworksusingthestochasticblockmodel
AT pallaresluisaf reassessingthemodularityofgenecoexpressionnetworksusingthestochasticblockmodel
AT ayrolesjulienf reassessingthemodularityofgenecoexpressionnetworksusingthestochasticblockmodel